MCP Servers are shaping the future of software. MCPMark is a comprehensive, stress-testing benchmark and a collection of diverse, verifiable tasks designed to evaluate model capabilities in real-world MCP use.
GitHubスター
119
ユーザー評価
未評価
お気に入り
0
閲覧数
10
フォーク
5
イシュー
2
Evaluation Systems Organization
3
フォロワー
4
リポジトリ
Gist
貢献数
An MCP server that autonomously evaluates web applications.
MCP Servers are shaping the future of software. MCPMark is a comprehensive, stress-testing benchmark designed to evaluate model and agent capabilities in real-world MCP use.
Verify that any MCP server is running the intended and untampered code via hardware attestation.