Skip to content

harshhh28/hia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

80 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

🩺 HIA (Health Insights Agent)

AI Agent to analyze blood reports and provide detailed health insights.

Features | Tech Stack | Installation | Project Structure | Contributing | Author

Usage Demo

🌟 Features

  • Agent-based architecture
    • Analysis Agent: Report analysis with in-context learning from previous analyses and a built-in knowledge base
    • Chat Agent: RAG-powered follow-up Q&A over your report (FAISS + HuggingFace embeddings)
  • Multi-model cascade via Groq with automatic fallback (primary β†’ secondary β†’ tertiary β†’ fallback)
  • Chat sessions: Create multiple analysis sessions; each session stores report, analysis, and follow-up messages in Supabase
  • Report sources: Upload your own PDF or use the built-in sample report for quick testing
  • PDF handling: Upload up to 20MB, max 50 pages; validation for file type and medical-report content
  • Daily analysis limit: Configurable cap (default 15/day) with countdown in the sidebar
  • Secure auth: Supabase Auth (sign up / sign in), session validation, and configurable session timeout
  • Session history: View, switch, and delete past sessions; report text persisted for follow-up chat across reloads
  • Modern UI: Responsive Streamlit app with sidebar session list, user greeting, and real-time feedback

πŸ› οΈ Tech Stack

  • Frontend: Streamlit (1.42+)
  • AI / LLM
    • Report analysis: Groq with multi-model fallback via ModelManager
      • Primary: meta-llama/llama-4-maverick-17b-128e-instruct
      • Secondary: llama-3.3-70b-versatile
      • Tertiary: llama-3.1-8b-instant
      • Fallback: llama3-70b-8192
    • Follow-up chat: RAG with LangChain, HuggingFace embeddings (all-MiniLM-L6-v2), FAISS vector store, and Groq (llama-3.3-70b-versatile)
  • Database: Supabase (PostgreSQL)
    • Tables: users, chat_sessions, chat_messages
  • Auth: Supabase Auth, Gotrue
  • PDF: PDFPlumber (text extraction), filetype (file validation)
  • Libraries: LangChain, LangChain Community, LangChain HuggingFace, LangChain Text Splitters, sentence-transformers, FAISS (CPU)

πŸš€ Installation

Requirements πŸ“‹

  • Python 3.8+
  • Streamlit 1.42+
  • Supabase account
  • Groq API key
  • PDFPlumber, filetype

Getting Started πŸ“

  1. Clone the repository:
git clone https://github.com/harshhh28/hia.git
cd hia
  1. Install dependencies:
pip install -r requirements.txt
  1. Required environment variables (in .streamlit/secrets.toml):
SUPABASE_URL = "your-supabase-url"
SUPABASE_KEY = "your-supabase-key"
GROQ_API_KEY = "your-groq-api-key"
  1. Set up Supabase database schema:

The application uses three tables: users, chat_sessions, and chat_messages. Use the SQL script at public/db/script.sql to create them.

database schema

(You can turn off email confirmation on signup in Supabase: Authentication β†’ Providers β†’ Email β†’ Confirm email.)

  1. Run the application:
streamlit run src\main.py

πŸ“ Project Structure

hia/
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ README.md
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ main.py                 # Application entry point; chat UI and session flow
β”‚   β”œβ”€β”€ auth/
β”‚   β”‚   β”œβ”€β”€ auth_service.py     # Supabase auth, sessions, chat message persistence
β”‚   β”‚   └── session_manager.py # Session init, timeout, create/delete chat sessions
β”‚   β”œβ”€β”€ components/
β”‚   β”‚   β”œβ”€β”€ analysis_form.py    # Report source (upload/sample), patient form, analysis trigger
β”‚   β”‚   β”œβ”€β”€ auth_pages.py       # Login / signup pages
β”‚   β”‚   β”œβ”€β”€ footer.py           # Footer component
β”‚   β”‚   β”œβ”€β”€ header.py           # User greeting
β”‚   β”‚   └── sidebar.py          # Session list, new session, daily limit, logout
β”‚   β”œβ”€β”€ config/
β”‚   β”‚   β”œβ”€β”€ app_config.py       # App name, limits (upload, pages, analysis, timeout)
β”‚   β”‚   β”œβ”€β”€ prompts.py          # Specialist prompts for report analysis
β”‚   β”‚   └── sample_data.py      # Sample blood report for "Use Sample PDF"
β”‚   β”œβ”€β”€ services/
β”‚   β”‚   └── ai_service.py       # Analysis + chat entry points; vector store caching
β”‚   β”œβ”€β”€ agents/
β”‚   β”‚   β”œβ”€β”€ analysis_agent.py   # Report analysis, rate limits, knowledge base, in-context learning
β”‚   β”‚   β”œβ”€β”€ chat_agent.py       # RAG pipeline (embeddings, FAISS, query contextualization)
β”‚   β”‚   └── model_manager.py   # Groq multi-model cascade and fallback
β”‚   └── utils/
β”‚       β”œβ”€β”€ validators.py       # Email, password, PDF file and content validation
β”‚       └── pdf_extractor.py   # PDF text extraction and validation
β”œβ”€β”€ public/
β”‚   └── db/
β”‚       β”œβ”€β”€ script.sql          # Supabase schema (users, chat_sessions, chat_messages)
β”‚       └── schema.png          # Schema diagram

πŸ‘₯ Contributing

Contributions are welcome! Please read our Contributing Guidelines for details on how to submit pull requests, the development workflow, coding standards, and more.

We appreciate all contributions, from reporting bugs and improving documentation to implementing new features.

πŸ‘¨β€πŸ’» Contributors

Thanks to all the amazing contributors who have helped improve this project!

Avatar Name GitHub Role Contributions PR(s) Notes
harshhh28 avatar Harsh Gajjar harshhh28 Project Creator & Maintainer Core implementation, Documentation N/A Lead Developer
gaurav98095 avatar Gaurav gaurav98095 Contributor DB Schema, bugs #1, #5, #6, #7 Database Design, bugs

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

πŸ™‹β€β™‚οΈ Author

Created by Harsh Gajjar

About

Hia (Health Insights Agent) - AI Agent to analyze blood reports and provide detailed health insights.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages