Stop guessing if your AI tools work

AgentJury tests MCP servers and agent skills the way agents actually use them — with fuzzy inputs, weird edge cases, and no hand-holding. See which tools pass and which break.

Browse 107+ tested tools Submit your tool

How it works

Tool goes into a sandbox

Isolated Docker container. No internet access. Resource-limited. Your tool runs exactly like it would in production, minus the ability to phone home.

Five test agents hammer it

Security fuzzing, 100-call reliability runs, "can an agent figure this out" tests, and cross-framework compatibility checks. All automated, all recorded.

You get data, not opinions

Test results with evidence. Failure modes. Compatibility matrix. Every score links to the logs that produced it. No hand-waving.

Recently tested

View all →

mcp-server-time

MCP

8.2

A Model Context Protocol server providing tools for time queries and timezone conversions for LLMs

mcp-fetch-server

MCP

7.6

An MCP server offering simple HTTP fetch functionality

@modelcontextprotocol/server-filesystem

MCP

7.6

MCP server for filesystem access

@perplexity-ai/mcp-server

MCP

7.6

Real-time web search, reasoning, and research through Perplexity's API

@antv/mcp-server-chart

MCP

7.5

A Model Context Protocol server for generating charts using AntV. This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools.

@notionhq/notion-mcp-server

MCP

7.5

Official MCP server for Notion API

tavily-mcp

MCP

7.5

MCP server for advanced web search using Tavily

chrome-devtools-mcp

MCP

7.3

MCP server for Chrome DevTools

@circleci/mcp-server-circleci

MCP

7.3

A Model Context Protocol (MCP) server implementation for CircleCI, enabling natural language interactions with CircleCI functionality through MCP-enabled clients