Hemanya Arora

I'm a

About Me

Senior Data Engineer with proven expertise architecting enterprise-scale data platforms that drive strategic business decisions. Specialized in cloud-native solutions (Azure, Databricks), data modeling, and team leadership. Track record of delivering high-impact data products—from modernizing legacy systems to building ML-powered analytics—that generate measurable business value, cost savings, and operational efficiency across Fortune 500 organizations.

Open for collaboration on interesting data engineering projects and technical leadership opportunities.

Download Resume

Measurable Impact

87%

Processing Time Reduction

40+

Pipelines Migrated to Cloud

$150K+

Annual Cost Savings

6-8

Team Members Led

Volunteering Experience

Code Club Canada

Moderating and Co-facilitating regular sessions of coding clubs for students aged 8-12 and teaching them basics of programming languages like Python & Scratch with the virtue of Raspberry Pi projects.

References: Madelyn Cugno (Email: maddie@kidscodejeunesse.org)

Skills

Python
SQL
T‑SQL
SSIS, SSMS
Data Modeling
ETL
Databricks
Apache Spark
Azure Data Factory
dbt
Data Lakehouse Platform
Azure DevOps

Resume

Work Experience

Senior Data Engineer: Cineplex Entertainment

April 2024 – Present
  • Platform Migration & Modernization: Spearheaded the migration of 40+ legacy data pipelines from on-premises infrastructure to Databricks, establishing a scalable cloud-native data platform. Orchestrated end-to-end migration strategy, ensuring zero data loss and minimal business disruption while modernizing critical data workflows.
  • High-Impact Performance Optimization: Re-architected 25-year-old legacy OLAP cubes into modern star schema data models (Media360, Film360) in Databricks, powering executive-level financial reporting in Power BI. Achieved 87% reduction in processing time (4.5 hours → 36 minutes), enabling real-time decision-making for C-suite stakeholders on film performance and revenue distribution across all business verticals.
  • Technical Leadership & Team Management: Led cross-functional teams of 6-8 engineers on enterprise-scale data initiatives. Owned complete project lifecycle for Media360 and Film360 data platforms—strategic solutions that drive advertising pricing decisions and power the company's flagship financial reporting system, directly informing multi-million dollar revenue decisions.
  • Strategic Data Product Development: Architected and deployed production-grade data products integrating multi-source film metadata (YouTube API, Wikipedia, MovieXchange). These pipelines serve as foundational datasets for advanced analytics including long-term attendance forecasting, film recommendation engines, and international market models—enabling data-driven strategic planning across the organization.
  • Production Operations & Reliability Engineering: Established robust monitoring frameworks and BAU operational excellence practices, ensuring 99.9%+ pipeline reliability. Implemented proactive alerting, automated recovery mechanisms, and comprehensive data quality validations across mission-critical data flows serving enterprise reporting and ML workloads.
  • Cost Optimization & Vendor Management: Identified and negotiated alternative data sourcing strategy, preventing $150K+ in annual licensing costs while maintaining comprehensive data coverage. Demonstrated business acumen in balancing technical requirements with financial constraints.
  • DevOps & Platform Engineering: Championed infrastructure-as-code practices, implementing CI/CD pipelines with Azure DevOps, service principal authentication, and automated deployment workflows. Enhanced data platform security posture and accelerated development velocity through standardized engineering practices.

Data Science Intern: Kinaxis

September 2022 – December 2023
  • Data Modeling & ETL: Prototyped data-driven models and optimized feature engineering processes, improving AI frameworks for supply chain forecasting.
  • ML Workflow Optimization: Analyzed feature importance and implemented automation algorithms to streamline data segmentation, yielding more accurate forecasting outcomes.
  • Large-Scale Data Handling: Conducted data transformations and exploratory data analysis using Python and PySpark, with a strong focus on data security, compliance, and performance.
  • Agile Collaboration: Engaged in daily scrums with cross-functional teams, ensuring alignment of project deliverables with business goals.

Business Intelligence Analyst (Co-op): Co-operators

August 2021 – October 2021
  • Developed BI solutions (MicroStrategy, IBM Netezza) to improve reporting and analytics.
  • Optimized and validated SQL-based data extractions, ensuring accuracy and consistency.
  • Analyzed relational and non-relational data structures to contribute to more efficient data pipelines.

Programming Instructor: Code Club Canada

September 2020 – September 2022
  • Educated students aged 8–12 in programming basics and Raspberry Pi projects, fostering an early interest in technology.

Certifications & Technical Skills

  • Certified Databricks Data Engineer Associate: Passed on March 9, 2025 (Latest Achievement)
  • Machine Learning – Coursera (Offered by Stanford University), Issued September 2022

Technical Skills

  • Programming: Python, SQL, T‑SQL, PySpark
  • Big Data & Cloud: Databricks, Azure Data Factory, dbt, Apache Spark, Hadoop (HDFS), Azure DevOps
  • Data Engineering: ETL, Data Modeling, Star/Snowflake Schema, Medallion Architecture, CI/CD, Data Orchestration
  • Visualization: Power BI, Tableau, Plotly
  • Workflow/Collaboration: Jira, Confluence, Git (GitHub), Agile/Scrum

Education

Bachelor of Science, Computer Science

York University, Toronto, ON

Technical Expertise

Data Platform Engineering

Enterprise data lakehouse architecture, medallion design patterns, Delta Lake optimization, data mesh principles, star/snowflake schema modeling

Cloud & Infrastructure

Azure ecosystem (ADF, Databricks, DevOps), Infrastructure as Code, CI/CD automation, scalable pipeline orchestration, service principal authentication

Technical Leadership

Cross-functional team management, mentoring data engineers, architectural decision-making, stakeholder communication, project ownership

Let's Connect

Interested in collaboration or discussing data engineering opportunities? Feel free to reach out.

Location:

North York, ON, M3J 2V7

Loading
Your message has been sent. Thank you!