Rishabh's Portfolio

Work Experience

Lead Data Engineer Two Circles, Vancouver

May 2025 - Present

Responsibility :

Own end-to-end architecture of the clickstream analytics platform ingesting 10M+ events/day from NFL+ and premium media properties driving content engagement, subscription retention, and executive KPI reporting.
Drove the design of a 100+ model DBT warehouse on Redshift with conformed fact and dimension layers, SCD Type 2 snapshots, and SLA-backed freshness contracts enabling self-service analytics for 30+ downstream consumers.
Led a Redshift cost & performance overhaul (WLM redesign, sort/dist key strategy, materialized views, workload isolation) cutting compute spend 25% and report latency 40% while doubling concurrent query throughput.
Architected the unified subscription revenue pipeline consolidating Cleeng and Recurly into a single source of truth, unblocking finance close from T+5 to T+1.
Established the team's engineering standards, PR review framework, schema migration playbook, and rollback automation; hired and mentored 2 engineers.

Senior Data Engineer WineDirect, Vancouver

Jun 2023 - May 2025

Responsibility :

Founded and productionized the dbt practice from zero, establishing the modeling layer on Redshift, Airflow orchestration patterns, testing conventions, and documentation standards; shipped 50+ models in the first year.
Re-architected critical Spark pipelines (partitioning strategy, broadcast joins, adaptive query execution, intelligent caching), cutting processing time 25% and AWS compute spend across petabyte-scale workloads.
Architected a centralized S3 Data Lake holding petabytes of historical commerce data and designed the client-facing API access layer enabling thousands of wineries and retailers to query their data programmatically.
Owned the CI/CD pipeline for schema migrations on GitHub Actions, with automated rollback and blue/green deployment patterns — drove production data incidents down 60% QoQ.
Drove data quality maturity by introducing dbt tests, Great Expectations checks, and freshness SLAs across critical revenue and inventory pipelines.

Data Engineer Skillz, Vancouver

Jun 2021 - May 2023

Responsibility :

Drove the end-to-end Redshift to Snowflake migration covering 400+ tables across marketing, product, and finance domains — zero data loss and zero downtime for downstream consumers.
Built the real-time ingestion layer on Snowpipe, Tasks, and External Tables, reducing data freshness from hours to minutes for gaming and campaign event streams powering attribution and ML feature pipelines.
Owned query optimization and schema design for high-cardinality event tables, cutting dashboard load times 3–5x on critical campaign reports.
Established the dbt + Great Expectations testing framework across analytics pipelines, defining SLA tiers and quality gates that became the team's standard for production data releases.

Big Data Engineer Intern Samsung Electronics, Canada

May 2020 - Dec 2020

Responsibility :

Migrated production Spark Scala jobs from AWS EMR to EKS, leveraging Spot Instances and Kubernetes autoscaling to reduce infrastructure costs by 15% while processing petabytes of device telemetry.
Built Airflow-orchestrated ETL pipelines executing SQL and Python workloads across terabyte-scale datasets, with retry, alerting, and SLA tracking baked in.
Provisioned cloud infrastructure with Terraform, AWS Auto Scaling Groups, IAM, and EKS clusters enabling repeatable, version-controlled deployments across dev/stage/prod.
Contributed to internal functional data engineering standards covering containerization, CI/CD, and distributed Spark tuning.

Software Engineer, Data Tesco, Bengaluru, India

Aug 2015 - Aug 2019

Responsibility :

Built and maintained ETL pipelines in IBM DataStage 11.5, integrating structured and semi-structured sources (Parquet, CSV, JSON) to deliver consolidated datasets for banking analytics.
Engineered solutions to handle schema evolution and scaling challenges across multi-source ingestion frameworks.
Seconded to the UK for one year to partner directly with business stakeholders, gathering requirements and building a personalized offers engine spanning Savings, Mortgages, and Loans products.

Education

Master's in Computer Science, Big Data from Simon Fraser University, Canada

2019-2021

CGPA: 4.0 / 4.3

Relevant courses taken:

Machine Learning

Statistics

Big Data systems

Algorithms

Bachelor's in Computer Science from Vellore Institute of Technology, India

2011-2015

CGPA: 9.10 / 10

Relevant courses taken :

Data Mining

Database

Cloud computing

Academic and Personal Projects

Rishabh Jain

Data Engineer

About Me

What I do

Data Engineer

Cloud Engineer

AI Developer

Technical Skills

Professional Skills

Work Experience

Lead Data Engineer Two Circles, Vancouver

Senior Data Engineer WineDirect, Vancouver

Data Engineer Skillz, Vancouver

Big Data Engineer Intern Samsung Electronics, Canada

Software Engineer, Data Tesco, Bengaluru, India

Education

Master's in Computer Science, Big Data from Simon Fraser University, Canada

Bachelor's in Computer Science from Vellore Institute of Technology, India

Academic and Personal Projects

Strategic Asset Manager

RETINA

Find your Home

Folio — Portfolio Tracker

Certifications

CKAD: Certified Kubernetes Application Developer

AWS Certified Data Analytics – Specialty

Tableau Desktop Specialist

Data Science Methodology

Data Science Methodology

Data Science Methodology

Featured Posts

Time Series Forecasting: A Deep Dive

Proportion are what’s really needed

Address

Email

Phone

Rishabh Jain

Data Engineer

About Me

What I do

Data Engineer

Cloud Engineer

AI Developer

Technical Skills

Professional Skills

Work Experience

Lead Data Engineer Two Circles, Vancouver

Senior Data Engineer WineDirect, Vancouver

Data Engineer Skillz, Vancouver

Big Data Engineer Intern Samsung Electronics, Canada

Software Engineer, Data Tesco, Bengaluru, India

Education

Master's in Computer Science, Big Data from Simon Fraser University, Canada

Bachelor's in Computer Science from Vellore Institute of Technology, India

Academic and Personal Projects

Strategic Asset Manager

RETINA

Find your Home

Folio — Portfolio Tracker

Strategic Asset Manager

Real Time Speech Activated Assistant (RETINA)

Find Your Home

Folio — Portfolio Tracker

Certifications

CKAD: Certified Kubernetes Application Developer

AWS Certified Data Analytics – Specialty

Tableau Desktop Specialist

Data Science Methodology

Data Science Methodology

Data Science Methodology

Featured Posts

Time Series Forecasting: A Deep Dive

Proportion are what’s really needed

Address

Email

Phone