VecMed-MCP

Name: VecMed-MCP
Availability: InStock
Author: David Qu

VecMed-MCP is a Jupyter Notebook project aimed at establishing a Milvus vector database. It includes functionalities for setting up the database schema, downloading PubMed data, testing search functionalities, and experimenting with summarizing search results. The use of Docker simplifies the environment setup, making data management efficient.

GitHub

GitHub Stars

User Rating

Not Rated

Favorites

Views

Forks

Issues

README

VecMed-MCP: Milvus Vector Database for Medical Data

Project Overview

47c3a52e549d46a9bbdee82e38fe4b79~tplv-5jbd59dj06-image

This repository provides tools to establish and manage a Milvus vector database for medical data, specifically designed for rare disease research. It includes scripts for database initialization, data ingestion, search functionality, LLM-based result summarization, and scheduled updates.

File Descriptions

File Name	Purpose
`docker-compose.yml`	Used to initialize MilvusDB
`setup_milvus.py`	Formulates the database schema
`download_pubmed_tomilvusdb.py`	Handles timely updates (currently set to 30 days)
`search_milvusdb.py`	Tests search functionality in MilvusDB
`llm_process_search_result.py`	Experiments with summarizing search results using LLM
`updating_milvusdb.log`	Records database updating operations

ATTU WebUI

The ATTU WebUI provides a visual interface to:

View all database records
Manage collections and schemas
No authentication required

Current Collection: pubmed_rare_disease_db contains over 160,000 PubMed records related to rare diseases.

Environment Configuration

Launch Milvus with Docker Compose

docker compose up -d

Install Required Dependencies

pip install marshmallow==3.20.1 -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install Flask -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install pymilvus -U -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install "mcp[cli]" -i https://pypi.tuna.tsinghua.edu.cn/simple

Set Up ATTU WebUI

docker pull zilliz/attu:v2.4.4
docker run -d --name attu -p 8000:3000 -e MILVUS_URL=192.168.10.199:19530 zilliz/attu:v2.4.4

Workflow

The general workflow can be adapted for various database types beyond medical articles:

Build your custom Milvus database with a designed collection schema using the main folder code (steps 1-5)
Download required data and store it in your vector database (see download_pubmed_2015-2025 subfolder)
Launch the MCP server with HTTP API service or modify to use stdin transport (see pubmed-mcp-server subfolder)
Set up timer-based database updates using download_pubmed_to_milvusdb_2.py
Integrate the MCP server into your agent/LLM/workflow (example integration with Dify workflow provided)

Scheduled Database Updates

Add a New Cron Job

crontab -e

View Existing Cron Jobs

crontab -l

Author

David Qu
Undergraduate Researcher | AI Algorithm Engineer
University of Toronto Scarborough - Department of Computer Science
📧 davidsz.qu@mail.utoronto.ca

Author Information

David Qu

Undergraduate AI Researcher/Engineer, interested in LLM and CV.

Toronto, Canada

GitHub

Followers

Repositories

Gists

Total Contributions