Anish Dahiya
Data Scientist • Creator

Designing intelligent systems that move people forward.

I'm Anish Dahiya — a data scientist building practical AI products with high-velocity teams and documenting the process for builders. I help organizations move from prototype to production with pragmatic model design, robust data pipelines, and repeatable shipping patterns.

Featured Work

AI products that create measurable leverage

From experiment to production, I help teams launch data products that improve revenue, retention, and trust.

2025

Document Theme Identifier

LLM-powered research chatbot that ingests 75+ docs, runs OCR + embeddings, and answers queries with citation-backed summaries.

RAGNLPLLM
2023

Diamond Price Prediction

Predictive pricing engine that cleanses diamond attributes, trains ensemble regressors, and exposes results through a polished Flask web app.

RegressionFlaskPricing
2022

Parkinson's Disease Prediction

Biomedical voice analytics pipeline that standardizes acoustic biomarkers and trains interpretable classifiers for early Parkinson's screening.

Healthcare AIClassificationLogistic Regression
2024

Acoustic Keyboard Detection

Deep learning system that hears keyboard keystrokes via MFCC features and a custom 1D-CNN, shipped with a Streamlit UI for real-time inference.

Audio AIDeep LearningStreamlit
2024

Wafer Fault Detection

End-to-end ML workflow that validates 590-sensor wafer batches, clusters signals, and selects the best Random Forest/XGBoost model per cluster.

Anomaly DetectionMLOpsAWS
Journey

Moments that shaped my craft

Every chapter blends strategy, storytelling, and system design.

  • Apr 2025 – Present

    Data Scientist, Applied AI

    Joined a fast-moving ML team to ship production-ready models, tighten evaluation loops, and translate research spikes into real user impact.

  • 2024

    AI internships + capstone

    Split time between research internships and my final-year project, hardening MLOps pipelines and documenting lessons for the next cohort.

  • 2023

    Bus Congestion Prediction (DIMTS)

    Built a congestion prediction model for Delhi Integrated Multi-Modal Transit System to forecast bus load and improve scheduling.

  • 2022–2024

    SME — Computer Science (Chegg)

    Solved curriculum-aligned problems and authored explanations for CS learners while pursuing undergrad.

  • 2021–2025

    B.Tech CSE (AIML), Chandigarh University

    Specialized in Artificial Intelligence & Machine Learning with hands-on projects across CV, NLP, and forecasting.