GitHubスター
0
ユーザー評価
未評価
フォーク
0
イシュー
0
閲覧数
1
お気に入り
0
Web Automation MCP Server
An MCP (Model Context Protocol) server that enables Claude Code CLI to interact with Cloud Desktop, ChatGPT, and Gemini web interfaces through browser automation.
Features
- Multi-Service Support: Interact with Cloud Desktop, ChatGPT, and Gemini
- Browser Automation: Uses Playwright for reliable web automation
- Session Persistence: Maintains browser sessions across requests
- Response Extraction: Automatically extracts and returns responses
- Conversation Management: Clear conversation history when needed
Installation
- Clone the repository:
cd ~/web-automation-mcp
- Install dependencies:
npm install
- Build the project:
npm run build
Configuration
Add the server to your Claude MCP settings (~/Library/Application Support/Claude/claude_desktop_config.json
):
{
"mcpServers": {
"web-automation": {
"command": "node",
"args": ["/Users/YOUR_USERNAME/web-automation-mcp/dist/index.js"]
}
}
}
Usage
Natural Language Support
The MCP server now supports natural language queries through the web_automation_query
tool:
# Ask ChatGPT a question
"Ask ChatGPT: What is quantum computing?"
# Send to Gemini
"Send to Gemini: Explain machine learning"
# Clear conversation
"Clear ChatGPT conversation"
# Get latest response
"Get the latest response from Gemini"
# Browser control
"Initialize browser in visible mode"
"Close browser"
Available Tools
web_automation_query - Process natural language queries
query
: string - Natural language description of what you want to do
initialize_browser - Initialize the browser
headless
: boolean (default: false) - Run in headless mode
send_to_service - Send input to a service and get response
service
: "clouddesktop" | "chatgpt" | "gemini"content
: string - The message to send
get_latest_response - Get the latest response from a service
service
: "clouddesktop" | "chatgpt" | "gemini"
clear_conversation - Clear conversation history
service
: "clouddesktop" | "chatgpt" | "gemini"
cleanup_browser - Close all browser windows
Example Usage in Claude Code
Natural language examples:
# Ask ChatGPT
Use web_automation_query with query: "Ask ChatGPT about the history of computing"
# Send to Gemini
Use web_automation_query with query: "Send to Gemini: write a Python function to sort a list"
# Clear conversation
Use web_automation_query with query: "Clear ChatGPT conversation"
Direct tool usage:
# Initialize browser (visible mode)
Use the initialize_browser tool with headless: false
# Send a message to ChatGPT
Use the send_to_service tool with service: "chatgpt" and content: "What is the capital of France?"
# Clean up when done
Use the cleanup_browser tool
Important Notes
Manual Login Required: You must manually log in to each service in the browser window the first time
Service URLs: Update the URLs in the handler files if they change:
src/handlers/cloudDesktop.ts
- Update Cloud Desktop URLsrc/handlers/chatgpt.ts
- ChatGPT URLsrc/handlers/gemini.ts
- Gemini URL
Selectors: The CSS selectors in handlers may need updates if the services change their UI
Development
To run in development mode:
npm run dev
To modify service handlers, edit the files in src/handlers/
.
Troubleshooting
- Login Required Error: Open the browser in non-headless mode and log in manually
- Selector Not Found: The service UI may have changed - update selectors in the handler
- Timeout Errors: Increase timeout values in the handlers for slower connections
License
MIT