Dark Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
ashutoshsinha25
Follow
Working from home

Ashutosh Sinha ashutoshsinha25

Working from home
Data Scientist at CDM Smith

Block or report ashutoshsinha25

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user's behavior. Learn more about reporting abuse.

Report abuse
ashutoshsinha25/README.md

Hi, I'm Ashutosh Sinha

Data Scientist | Generative AI Engineer | MLOps Enthusiast


About Me

I'm a Data Scientist (3+ yrs) experienced in building and deploying AI solutions that drive measurable business value.
Currently working at CDM Smith (Engineering & Construction) where I'm building Generative AI systems, orchestrating LLM pipelines using LangGraph, and developing RAG-based internal tools to automate schedule generation and knowledge retrieval.

I enjoy working at the intersection of Machine Learning, MLOps, and Scalable Systems, transforming ideas into end-to-end production-ready solutions.


Core Competencies

  • Generative AI: RAG, LangChain, LangGraph, Prompt Engineering, Azure OpenAI
  • Machine Learning: Supervised/Unsupervised, NLP, CNNs, Time Series, XAI (SHAP, LIME)
  • MLOps: FastAPI, Docker, MLflow, GitHub Actions, CI/CD
  • Data Engineering: Snowflake, BigQuery, Azure, AWS
  • Visualization & BI: Power BI, Tableau, Matplotlib, Seaborn
  • Languages: Python, SQL, Bash
  • Frameworks: TensorFlow, PyTorch, Scikit-learn, Pyspark

Highlight Projects

Generative AI Platform @ CDM Smith

  • Built a RAG-based LLM platform reducing schedule generation time from 30 days - 5 minutes.
  • Developed FastAPI backend, integrated Azure OpenAI models (GPT-4.1, 4o, o3) with stateful workflows via LangGraph.
  • Added model comparison, intent detection, and Azure Application Insights logging.

Competitive Intelligence Dashboard

  • Automated analysis of 50+ competitors using Azure Document Intelligence and Power BI.
  • Parsed PDFs and XBRL filings with Python to generate actionable insights.

DriveChurn Prediction (Deployed on AWS)

  • Achieved 0.95 F1-score using LightGBM with EDA, feature engineering, and model tuning.
  • Built CI/CD pipelines via GitHub Actions, deployed with Flask, Docker, AWS ECS, and Postman validation.
    View Project

Tech Stack Overview

Programming & ML

Cloud & MLOps

Visualization


Current Focus

  • Expanding LangGraph expertise for agentic orchestration of LLM pipelines.
  • Building domain-specific LLM evaluation frameworks (BLEU, ROUGE, METEOR, BERTScore).
  • Exploring autonomous LLM agents for knowledge extraction and workflow automation.

GitHub Stats


Let's Connect

ashutoshsinha519@gmail.com
LinkedIn
Portfolio / Projects


"Turning data into decisions and AI into impact."

Pinned Loading

  1. InsightLoop InsightLoop Public

    AI-Powered Feedback & Insight Engine for Conversations

    Python

  2. LLM-GenAI LLM-GenAI Public

    Repo containing notebooks related to generativeAI and LLM's, its implementations and projects

    Jupyter Notebook