Skip to content
View DevyaniD19's full-sized avatar

Block or report DevyaniD19

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DevyaniD19/README.md

Hi, I'm Devyani Deore 👋

Data Engineer · ML Practitioner

LinkedIn Email GitHub


👩‍💻 About Me

I'm a Data Engineer with 3+ years of experience building scalable data pipelines, working with large-scale datasets, and developing data-driven solutions. I'm passionate about the intersection of data engineering and machine learning — from designing ETL workflows to fine-tuning transformer models.

  • 🎓 University of Michigan
  • 📍 Michigan, USA
  • 🔭 Currently working on: NLP & ML projects to sharpen my deep learning skills
  • 🌱 Learning: Large Language Models, MLOps, and cloud-native data stacks
  • 💬 Ask me about: Python, SQL, data pipelines, Spark, and ML workflows

🛠️ Tech Stack

Languages & Data

Python SQL Bash

Machine Learning & NLP

PyTorch HuggingFace scikit-learn Pandas NumPy

Data Engineering

Apache Spark Airflow dbt

Tools & Platforms

Git Jupyter VS Code


🚀 Featured Projects

Project Description Stack
🤖 BERT Sentiment Analysis Fine-tuned bert-base-uncased for airline tweet complaint detection — outperforms TF-IDF + Naive Bayes baseline PyTorch · HuggingFace · NLP
🛡️ Spam NLP Detector SMS spam classification pipeline using TF-IDF and Multinomial Naive Bayes on UCI dataset scikit-learn · NLP · Python
🗺️ Customer Journey ML ML-based stage prediction across 5 funnel stages using Logistic Regression, Random Forest & Gradient Boosting Python · scikit-learn · ML
📊 User Retention Toolkit Python toolkit for user behavior analytics and retention analysis from event log data Python · Data Engineering
🚗 Uber Ride Analysis Exploratory data analysis on Uber ride patterns, demand forecasting, and pricing trends Pandas · Matplotlib · EDA
🛒 Amazon Recommender Collaborative filtering-based product recommendation system on Amazon review data Python · Recommender Systems

📈 GitHub Stats

Devyani's GitHub Stats    Top Languages


Open to Data Engineering and ML roles — feel free to reach out!

Pinned Loading

  1. Bert-for-sentiment-analysis Bert-for-sentiment-analysis Public

    Fine-tuning BERT for tweet sentiment analysis — complaint detection on airline tweets using PyTorch and Hugging Face Transformers

    Jupyter Notebook

  2. customer-journey-ml customer-journey-ml Public

    Customer journey mapping and ML-based stage prediction across 5 funnel stages using Logistic Regression, Random Forest, and Gradient Boosting

    Python

  3. detect-spam-nlp detect-spam-nlp Public

    NLP pipeline for SMS spam detection using TF-IDF and Multinomial Naive Bayes on the UCI SMS Spam Collection dataset

    Jupyter Notebook

  4. Heart-Stroke-Prediction-Using-Quantitative-Data Heart-Stroke-Prediction-Using-Quantitative-Data Public

    Stroke risk prediction using Random Forest, XGBoost, SVM & Logistic Regression with SMOTE for class imbalance

    Jupyter Notebook

  5. MNC-Stock-Market-Price-Prediction MNC-Stock-Market-Price-Prediction Public

    IBM stock price forecasting using Random Forest, LSTM (4-layer), and ARIMA models with Python

    Jupyter Notebook

  6. retentioneering-tools retentioneering-tools Public

    A simplified Python toolkit for user behavior analytics from event log data

    Python