I'm a Data Engineer with 3+ years of experience building scalable data pipelines, working with large-scale datasets, and developing data-driven solutions. I'm passionate about the intersection of data engineering and machine learning — from designing ETL workflows to fine-tuning transformer models.
- 🎓 University of Michigan
- 📍 Michigan, USA
- 🔭 Currently working on: NLP & ML projects to sharpen my deep learning skills
- 🌱 Learning: Large Language Models, MLOps, and cloud-native data stacks
- 💬 Ask me about: Python, SQL, data pipelines, Spark, and ML workflows
Languages & Data
Machine Learning & NLP
Data Engineering
Tools & Platforms
| Project | Description | Stack |
|---|---|---|
| 🤖 BERT Sentiment Analysis | Fine-tuned bert-base-uncased for airline tweet complaint detection — outperforms TF-IDF + Naive Bayes baseline |
PyTorch · HuggingFace · NLP |
| 🛡️ Spam NLP Detector | SMS spam classification pipeline using TF-IDF and Multinomial Naive Bayes on UCI dataset | scikit-learn · NLP · Python |
| 🗺️ Customer Journey ML | ML-based stage prediction across 5 funnel stages using Logistic Regression, Random Forest & Gradient Boosting | Python · scikit-learn · ML |
| 📊 User Retention Toolkit | Python toolkit for user behavior analytics and retention analysis from event log data | Python · Data Engineering |
| 🚗 Uber Ride Analysis | Exploratory data analysis on Uber ride patterns, demand forecasting, and pricing trends | Pandas · Matplotlib · EDA |
| 🛒 Amazon Recommender | Collaborative filtering-based product recommendation system on Amazon review data | Python · Recommender Systems |
Open to Data Engineering and ML roles — feel free to reach out!