Stop guessing if your AI tools work
AgentJury tests MCP servers and agent skills the way agents actually use them — with fuzzy inputs, weird edge cases, and no hand-holding. See which tools pass and which break.
How it works
Tool goes into a sandbox
Isolated Docker container. No internet access. Resource-limited. Your tool runs exactly like it would in production, minus the ability to phone home.
Five test agents hammer it
Security fuzzing, 100-call reliability runs, "can an agent figure this out" tests, and cross-framework compatibility checks. All automated, all recorded.
You get data, not opinions
Test results with evidence. Failure modes. Compatibility matrix. Every score links to the logs that produced it. No hand-waving.
Recently tested
View all →mcp-server-time
MCPA Model Context Protocol server providing tools for time queries and timezone conversions for LLMs
mcp-fetch-server
MCPAn MCP server offering simple HTTP fetch functionality
@modelcontextprotocol/server-filesystem
MCPMCP server for filesystem access
@perplexity-ai/mcp-server
MCPReal-time web search, reasoning, and research through Perplexity's API
@antv/mcp-server-chart
MCPA Model Context Protocol server for generating charts using AntV. This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools.
@notionhq/notion-mcp-server
MCPOfficial MCP server for Notion API
tavily-mcp
MCPMCP server for advanced web search using Tavily
chrome-devtools-mcp
MCPMCP server for Chrome DevTools
@circleci/mcp-server-circleci
MCPA Model Context Protocol (MCP) server implementation for CircleCI, enabling natural language interactions with CircleCI functionality through MCP-enabled clients
@modelcontextprotocol/server-sequential-thinking
MCPMCP server for sequential thinking and problem solving
@postman/postman-mcp-server
MCPA simple MCP server to operate on the Postman API
mcp-server-qdrant
MCPMCP server for retrieving context from a Qdrant vector database
107+ tools tested. MCP servers and agent skills. Every score links to its test data.
Built an MCP server or skill? Test it.
Submit your GitHub URL and get a verdict in minutes. Scores, compatibility matrix, and security analysis — all automated.
Submit for testing