I'm a Data Engineer & Senior Data Analyst with 6+ years of experience designing scalable ETL/ELT pipelines, building modern data lakehouse architectures, and implementing cloud-based data platforms.
My expertise spans GCP, AWS, Azure Fabric, and large-scale analytics engineering.
I am passionate about solving data quality challenges, automation, and building reliable, high-performing data systems.
💻 Programming: Python • SQL • Unix/Shell
☁️ Cloud Platforms: Google Cloud (BigQuery, Pub/Sub, Dataproc, Dataflow) • AWS (S3, EMR, Glue, Athena, Redshift) • Azure Fabric • Snowflake • SSIS • Alteryx
🗄️ Databases: MySQL • PostgreSQL • SQL Server • Teradata • MongoDB
📊 Reporting / Visualization: Power BI • Tableau • SSRS • MS Excel • Matplotlib • Plotly • Seaborn
⚙️ Frameworks & Tools: Apache Spark • dbt • Airflow • Hadoop • Flask • Soda • Great Expectations
🚀 CI/CD & DevOps: GitHub • GitHub Actions • Docker • Kubernetes • Terraform
- DataEngineering-portfolio – End-to-end cloud ETL/ELT architectures, pipeline designs, & reusable components
- Airflow Retail Pipeline (BigQuery + dbt + Soda) – Retail analytics pipeline with automated data quality
- dashboard-portfolio – Power BI, Excel & Tableau dashboards
- Impact of Covid-19 on Digital Learning – Storytelling visualization project using python, matplotlib, seaborn
- Clustering Profiles using Data Quality Metrics – ML-based profiling for assessing data trust
- Improving Data Quality Metrics – Framework covering completeness, validity, and uniqueness
- Case Study on Data Governance – Governance policies, lineage models, and stewardship
- CI-CD-portfolio – GitHub Actions, Docker, and Kubernetes automation for pipelines
- Big Mart Sales Prediction – Regression models with feature engineering
- 🏅 Google Cloud Professional Data Engineer
- 🏅 Microsoft PL-300: Power BI Data Analyst
- 🏅 Microsoft DP-600: Fabric Analytics Engineer
Let’s collaborate on Data Engineering, Cloud, Data Quality, and Modern Analytics projects!