Hello, I'm

Mithil

Data Science undergraduate building RAG systems, NLP pipelines, and scalable data applications.

Last Played Aurora RAG Chatbot
Scroll to explore
</>
Mithil

I am a Data Science undergraduate specializing in Retrieval-Augmented Generation (RAG), NLP systems, and backend AI infrastructure. I build retrieval pipelines, evaluation frameworks, and low-latency AI applications focused on measurable retrieval quality and production reliability.

PythonSQLFastAPIPyTorchRAGDockerPostgreSQL

Work History

Analytics Intern — RAG & Data Analytics

Star Health Allied Insurance

  • Built analytics pipelines over FY24 to FY25 insurance data using hybrid retrieval and FastAPI serving for scalable market analysis
  • Developed a hybrid RAG pipeline using LangChain and ChromaDB integrating structured datasets with analytical documents for natural language querying
  • Improved retrieval quality using BM25 and dense vector retrieval with reranking and metadata filtering across internal evaluation workflows
  • Reduced end to end latency from 2.4s to 650ms through query routing context compression and prompt optimization while reducing token usage by 42%
  • Evaluated retrieval and answer quality using RAGAS and manual review workflows across a 60 query benchmark
PythonLangChainRAGChromaDBSentence-TransformersHybrid SearchStreamlit

Featured Work

Aurora RAG Chatbot

RAG system for real-time event queries deployed during a university event serving 400+ attendees. Reduced repeated-query latency from 4.2s to sub-20ms using multi-tier caching and semantic cache reuse. Led a 6-member development team across retrieval pipeline development, deployment, and testing.

PythonFastAPIRedisChromaDBGroqDocker

AI Cloud Drive

Built a self-hosted cloud storage system with an integrated RAG pipeline for querying technical PDFs. Implemented a retrieval strategy using hybrid search, reranking, and context sufficiency checks to improve retrieval reliability and context grounding. Features asynchronous document ingestion, citation tracking, and context validation pipelines. Processed and indexed 1,000+ document chunks with hybrid retrieval and reranking pipelines to optimize retrieval quality.

PythonFastAPIDockerGroqChromaDB

Open Source Contributions

@mithil27360

My neurons use RAG too – Retrieve And Guess 🧠

View GitHub
Open Source Highlight

Contributor to Keras — merged pull request reviewed by François Chollet addressing Apple Silicon MPS backend issues

Merged by François Chollet

Contribution Activity

View on GitHub
Mithil's GitHub Contributions
2024
2028

Bachelor of Technology

Data Science & Engineering

Manipal Institute of Technology

Data Analytics & VisualizationObject-Oriented ProgrammingDatabase SystemsData Structures

Clubs & Organizations

The Data Alchemists

Management Committee Member Aug 2025 - Present

ISTE Manipal

Working Committee Oct 24 - Jun 25
Management Committee Jul 25 - Present

Manipal Open Source Society

Management Committee Member Aug 2025 - Present

AWS Cloud Club

Management Committee Member Dec 2025 - Present

Finova, MIT Manipal

Working Committee Nov 24 - Jun 25
Management Committee Jul 25 - Present

Chords & Co.

HR Manager Aug 2025 - Present

Open to Collaboration

Around applied projects, research, and systems.

Contact Me