Projects

A curated set of projects with clear problem → approach → results.

Dynamic Airlift (NYC Air Quality Pipeline)
Automated pipeline + dashboard for real-time air quality monitoring and trend analysis.
Sep 2024 – Nov 2024
Problem
Make air quality monitoring easier by pulling real-time data, analyzing trends, and visualizing pollutant levels with minimal manual work.
Approach
Built a Python pipeline to fetch and process NYC air quality data (API). Orchestrated automated updates via Airflow and built an interactive dashboard for monitoring and analysis.
Results
Reduced manual effort by 50%+ and provided a real-time view into pollution patterns, correlations, and threshold exceedances.
AirflowETLDashboardsStatisticsAPIs
Stack: Python • Airflow • APIs • Matplotlib • Plotly • Pandas
Stock Price Prediction (Time Series Forecasting)
LSTM-based forecasting pipeline with feature engineering and evaluation.
May 2024 – Jul 2024
Problem
Predict stock price movement using historical data and robust time-series modeling.
Approach
Performed EDA + feature engineering; implemented and tuned LSTM models to capture temporal dependencies; evaluated using appropriate forecasting metrics.
Results
Achieved strong forecasting performance (reported ~85% accuracy) and improved stability through iterative tuning and data-driven features.
LSTMTime SeriesFeature EngineeringForecasting
Stack: Python • TensorFlow/Keras • Pandas • NumPy
Chronic Disease Prediction
ML models + Tableau visuals to identify high-risk demographics and improve interpretability.
Sep 2023 – Nov 2023
Problem
Predict chronic disease risk and communicate results clearly for stakeholders.
Approach
Built predictive models with GridSearchCV and compared algorithms (e.g., KNN, Naive Bayes). Built Tableau dashboards to improve explainability.
Results
Improved model accuracy (reported up to ~90%) and increased stakeholder comprehension by ~35% via better visualization.
GridSearchCVClassificationTableauModel Tuning
Stack: Python • scikit-learn • Tableau
Real-time Sentiment Trading App
Crypto sentiment + transaction automation with reduced latency and scalable deployment.
Jan 2025 – Apr 2025
Problem
Combine sentiment signals with real-time trading flows while keeping latency low and deployment repeatable.
Approach
Built Python services integrating sentiment models and REST APIs. Fine-tuned BERT on AWS EC2 and deployed with SageMaker. Optimized execution and pipeline flow.
Results
Reduced data latency by ~70% and improved deployment speed (8h → 2h).
BERTAWSRESTLatency OptimizationDeployment
Stack: Python • BERT/Transformers • AWS EC2 • SageMaker • REST APIs