ai-assistant

Name: ai-assistant
Availability: InStock
Author: dchayes27

Local AI Assistant with Voice Capabilities - MCP Integration, Ollama LLM, Whisper STT, and Real-time Streaming

GitHub Website Docs

GitHub Stars

User Rating

Not Rated

Favorites

Views

Forks

Issues

README

Local AI Assistant

A comprehensive local AI assistant with voice capabilities, persistent memory, and MCP (Model Context Protocol) integration.

Overview

This project implements a fully local AI assistant that can:

Process voice input using OpenAI Whisper
Generate responses using Ollama-hosted language models
Speak responses using text-to-speech
Maintain conversation history and context
Integrate with external tools via MCP servers
Provide both GUI and API interfaces

Architecture

Core Components

core/: Main application logic and AI model integration
- Ollama LLM integration
- Whisper speech-to-text processing
- TTS (Text-to-Speech) engine
- Conversation management
memory/: Persistent storage and context management
- SQLite database for conversation history
- Vector storage for semantic search
- Context retrieval and management
mcp_server/: Model Context Protocol server
- Tool integration framework
- External service connections
- Custom tool implementations
gui/: User interface components
- Gradio-based web interface
- Voice input/output controls
- Conversation display
scripts/: Utility and setup scripts
- Installation helpers
- Model download scripts
- Database initialization

Requirements

Python 3.9+
Ollama (for local LLM hosting)
FFmpeg (for audio processing)
SQLite3

Installation

Clone this repository
Install Python dependencies:
```
pip install -r requirements.txt
```
Install Ollama and download your preferred models
Run the setup script:
```
python scripts/setup.py
```

Usage

Start the API server:

python -m core.main

Launch the GUI:

python -m gui.app

Run as MCP server:

python -m mcp_server.server

Configuration

Configuration is managed through environment variables and config files. See config.example.yaml for available options.

Development

Project Structure

ai-assistant/
├── core/
│   ├── __init__.py
│   ├── main.py
│   ├── llm.py
│   ├── speech.py
│   └── tts.py
├── memory/
│   ├── __init__.py
│   ├── database.py
│   ├── context.py
│   └── models.py
├── mcp_server/
│   ├── __init__.py
│   ├── server.py
│   └── tools.py
├── gui/
│   ├── __init__.py
│   ├── app.py
│   └── components.py
├── scripts/
│   ├── setup.py
│   └── download_models.py
├── tests/
├── requirements.txt
├── README.md
└── .gitignore

License

MIT License - See LICENSE file for details

Author Information

dchayes27

GitHub

Followers

Repositories

Gists

Total Contributions

Related MCPs

mem0

41755

Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python

chatgpt-on-wechat

39485

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Python

PDFMathTranslate

29531

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python