Codebasics

Powered by
Codebasics DA Bootcamp

 

1

Python Project

About Me

I am a Data Engineer professional with 6+ years of experience building scalable data pipelines, modernizing data ecosystems, and enabling analytics across healthcare, banking, and telecom domains. Along with my strong foundation in SQL, Python, Spark, Databricks, AWS, and Azure, I’ve recently expanded my expertise into Generative AI. I hold the AWS Educate Introduction to Generative AI and Oracle Cloud Infrastructure 2025 Certified Generative AI Professional credentials, which equip me with skills to apply AI/ML techniques, leverage LLMs, and integrate cloud-native AI services into enterprise data workflows. My combined certifications—AWS Certified Data Engineer, Databricks Certified Data Engineer Associate, and the latest Generative AI credentials—position me at the intersection of data engineering and applied AI, enabling me to deliver innovative, secure, and business-driven data solutions

Key Skills

AWS (S3, Redshift, Glue, EMR, Lambda, EC2, RDS, IAM, VPC,

Databricks, Hadoop, Spark, Kafka, BigQuery

python

sql

power bi

tableau

ETL

Agile

Waterfall

My Projects

stock analysis package using Python and the yfinance library
stock analysis package using Python and the yfinance library

Domain/Function: Quantitative Finance · Time-Series Analysis · Technical Indicators

My Experience

Data Engineer, | United Health Group | October 2022 – Present

• Designed scalable ETL/ELT pipelines in Python, SQL, and Spark to process 20TB+ daily healthcare and financial data, ensuring HIPAA compliance.

• Architected Delta Lake data models in Databricks with schema evolution, ACID compliance, and time-travel for audit readiness and regulatory reporting.

• Migrated legacy Informatica workflows to AWS (S3, Glue, Redshift, EMR), cutting processing times by 45% and infrastructure costs by 20%.

• Implemented data quality and governance frameworks (Great Expectations, Airflow), reducing incidents by 30% and improving downstream analytics reliability.

• Partnered with Data Science teams to enable fraud detection and predictive models, accelerating deployment of clinical insights.

• Mentored junior engineers, promoting AWS and Databricks best practices.

Data Engineer, HDFC Bank | August 2020 - December 2021

• Built real-time ingestion pipelines with Kafka, NiFi, and Spark Streaming for banking transactions, enabling fraud detection and RBI/AML compliance.

• Automated reconciliations with AWS Lambda, S3, DynamoDB, reducing manual intervention by 40%.

• Optimized BigQuery datasets, improving performance by 60% for compliance dashboards.

• Integrated external APIs (credit bureau, market data) to enhance risk scoring models.

Data Engineer, Wipro |May 2020- Aug 2020

Conducted business process analysis for telecom provisioning, identifying automation opportunities that cut manual order handling by 15%.

Authored FRDs/BRDs for OSS/BSS integrations; collaborated with cross-functional teams to optimize workflows, reducing SLA violations by 20%.

Data Engineer, Cognizant | June 2017- April 2020

Built SQL pipelines for churn analysis and revenue leakage detection, increasing retention by 10%.

Developed BI dashboards in Tableau and SQL for executives, enhancing decision-making.

Reduced data mismatches by 35% through automated integration workflows across CRM and billing systems.

Awards & Certificate

AWS Certified Data Engineer – Associate

AWS Certified Data Engineer – Associate

Let's Connect

Feel free to get in touch with me. I am always open to discussing new projects, creative ideas or opportunities to be part of your visions.

Download Resume

Resume