AI-Agent-RAG-and-MCP

Name: AI-Agent-RAG-and-MCP
Availability: InStock
Author: oankit

This project combines AI agents with Retrieval-Augmented Generation (RAG), integrating information retrieval and generation. Implemented in Python, it efficiently handles data acquisition and processing. It presents a valuable opportunity for developers interested in natural language processing and machine learning to learn and practice.

GitHub

GitHub Stars

User Rating

Not Rated

Favorites

Views

172

Forks

Issues

README

Context-Aware Assistant

An AI assistant with RAG (Retrieval-Augmented Generation) capabilities, content classification, and external API integration through MCP (Model Context Protocol).
🧠 Built using the ReACT (Reason + Act) framework for agent decision-making.

📽️ Watch the Demo on Loom

Features

Static RAG: Retrieves relevant information from multiple collections of documents
Hybrid Search: Combines vector search and keyword search for better results
Content Classification: Automatically classifies content using zero-shot learning
External API Integration: Connects to TheSportsDB API for live sports data
LangChain Integration: Uses LangChain for document loading and text splitting
Modern Frontend: React-based UI for interacting with the assistant
Containerized Deployment: Docker and Docker Compose setup for easy deployment

System Architecture

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│   Frontend  │────▶│ Backend API │────▶│ ChromaDB    │
│  (React)    │     │  (FastAPI)  │     │(Vector DB)  │
└─────────────┘     └──────┬──────┘     └─────────────┘
                           │
                           ▼
                    ┌─────────────┐     ┌─────────────┐
                    │ MCP Server  │────▶│ TheSportsDB │
                    │  (FastAPI)  │     │    API      │
                    └─────────────┘     └─────────────┘

Project Structure

context_aware_assistant/
├── data/                      # Data directories for different collections
│   ├── broadcast_transcripts/
│   ├── production_metadata/
│   ├── technical_docs/
│   └── industry_news/
├── mcp_server/                # MCP server for external API integration
│   └── main.py
├── frontend/                  # React frontend
│   ├── src/
│   │   ├── components/
│   │   │   ├── SearchBar.jsx
│   │   │   └── ResultsDisplay.jsx
│   │   ├── App.jsx
│   │   ├── main.jsx
│   │   └── index.css
│   ├── index.html
│   ├── package.json
│   └── vite.config.js
├── whoosh_index/              # Directory for Whoosh keyword search index
├── ingest.py                  # Data processing and embedding script
├── retriever.py               # Retrieval logic for vector and keyword search
├── classifier.py              # Content classification using zero-shot learning
├── agent.py                   # Agent logic and LLM integration
├── main.py                    # Main backend API
├── requirements.txt           # Python dependencies
├── .env                       # Environment variables
├── Dockerfile.backend         # Dockerfile for the backend
├── Dockerfile.mcp             # Dockerfile for the MCP server
├── Dockerfile.frontend        # Dockerfile for the frontend
└── docker-compose.yml         # Docker Compose configuration

Getting Started

Prerequisites

Python 3.9+
Node.js 18+
Docker and Docker Compose (for containerized deployment)

Local Development Setup

Clone the repository:

git clone <repository-url>
cd context_aware_assistant

Set up the Python virtual environment:

**Use a dedicated venv (recommended)**
Create a fresh virtualenv in your project root:  
```bash
python -m venv .venv

Activate it:

Windows (PowerShell)

.\.venv\Scripts\Activate.ps1

Windows (cmd.exe)

.\.venv\Scripts\activate.bat

Re-install your requirements:

pip install --upgrade pip
pip install -r requirements.txt

Set up environment variables:

Create a .env file in the project root with the following configuration:

# API Keys
SPORTS_DB_API_KEY="3"  # API key for the TheSportsDB custom MCP server
OPENAI_API_KEY="your_openai_api_key_here"

# Database Configuration
CHROMA_HOST=localhost
CHROMA_PORT=8000

# Server Configuration
MCP_SERVER_PORT=8001
MAIN_SERVER_PORT=8002

# LLM Configuration
OPENAI_MODEL=gpt-3.5-turbo

Replace your_openai_api_key_here with your actual OpenAI API key
Note: The sports data API key "3" is already set for the TheSportsDB custom MCP server

Run ChromaDB:

docker run -d -p 8000:8000 --name chroma_db chromadb/chroma

Process and index data:
```
python ingest.py
```

Start the MCP server:

uvicorn mcp_server.main:app --reload --port 8001

Start the backend API:
```
uvicorn main:app --reload --port 8002
```
Set up and run the frontend:
```
cd frontend
npm install
npm run dev
```

Docker Deployment

To deploy the entire application using Docker Compose:

docker-compose up -d

This will start all services:

ChromaDB on port 8000
MCP Server on port 8001
Backend API on port 8002
Frontend on port 80

Usage

Open your browser and navigate to http://localhost:3000 (for local development) or http://localhost (for Docker deployment)
Enter your query in the search bar
The assistant will retrieve relevant information, classify it, fetch live data if needed, and generate a response

Adding Your Own Data

To add your own data to the system:

Place your text files, JSON files, or other supported formats in the appropriate data subdirectory:
- data/broadcast_transcripts/ for broadcast-related content
- data/production_metadata/ for production metadata
- data/technical_docs/ for technical documentation
- data/industry_news/ for news articles
Run the ingestion script to process and index the data:
```
python ingest.py
```

Customization

OpenAI Model: You can change the OpenAI model in the .env file by updating the OPENAI_MODEL variable (e.g., "gpt-4" for better quality)
OpenAI Parameters: Adjust temperature, max_tokens, and other parameters in the run_query function in agent.py
Embedding Model: The embedding model can be changed in ingest.py and retriever.py
Classification Labels: Update the DEFAULT_LABELS in classifier.py to customize content classification

License

MIT License

Implementation Plan

The project follows a phased implementation approach:

Phase 1: Foundation & Data Preparation (Static RAG)

Project setup with Python environment and dependencies
ChromaDB setup for vector database
Data collection and preparation for the four collections
Data processing and embedding using sentence transformers
Indexing data into ChromaDB

Phase 2: Core Service Development (MCP & Retrieval)

MCP server development for sports data integration
Retrieval logic implementation with vector search
Hybrid retrieval with Whoosh for keyword search fallback
Content classification implementation

Phase 3: Agent Logic & LLM Integration

Agent logic design and implementation
LLM setup and integration
Synthesis step implementation for generating responses

Phase 4: API & Frontend

Backend API development with FastAPI
React frontend development
Connection between frontend and backend

Phase 5: Deployment & Observability

Dockerization of services
Docker Compose setup
Logging implementation
Metrics collection (optional)
Testing
Kubernetes deployment (optional)

Note: Due to time constraints, no actual data was used for the production metadata or industry news collections. The system is set up to handle these data types, but they are currently empty placeholders.

Author Information

oankit

GitHub

Followers

Repositories

Gists

Total Contributions

Related MCPs

chatgpt-on-wechat

41763

CowAgent是基于大模型的超级AI助理，能主动思考和任务规划、访问操作系统和外部资源、创造和执行Skills、拥有长期记忆并不断成长。同时支持飞书、钉钉、企业微信应用、微信公众号、网页等接入，可选择OpenAI/Claude/Gemini/DeepSeek/ Qwen/GLM/Kimi/LinkAI，能处理文本、语音、图片和文件，可快速搭建个人AI助手和企业数字员工。

Python

mem0

41755

Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python

PDFMathTranslate

31937

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python