← Back to homepage
Applied AI for Research
Self-Hosted Bioinformatics AI / Internal RAG System
A local research assistant built around retrieval-augmented generation for curated scientific content, with a focus on internal use rather than generic chatbot behavior.
MSDS Capstone Project
Project Lead
January 2025 - May 2025
What it does
The goal was to build a lab-oriented assistant that could retrieve from curated scientific material locally, rather than rely on a generic external chat workflow. That meant prioritizing ingestion, chunking, indexing, retrieval quality, and practical usability for research contexts.
It was designed as a tool for controlled research environments where curated corpora and offline or local workflows can matter more than broad general-purpose chat behavior.
Core stack
- Document ingestion and text chunking
- Sentence-transformer embeddings
- FAISS vector indexing
- Retrieval workflows
- Streamlit interface
What I built
- Led the project concept and a substantial share of implementation
- Built ingestion and chunking workflows for curated scientific material
- Implemented vector indexing and retrieval logic for lab-facing querying
- Helped shape the project around practical internal research use
Availability
This was an MSDS capstone project and is summarized here as representative work. The page focuses on the architecture and intent rather than exposing a raw code dump.