๐
LEADERBOARD
Observability
Monitoring, logging, and tracing for AI systems
18tools ranked
Observability Rankings
Ranked by overall ToolRoute Score across all benchmark dimensions
| Rank | Tool Name | ToolRoute Score | Output | Reliability | Efficiency | Cost | Trust | Stars |
|---|---|---|---|---|---|---|---|---|
| ๐ฅ | Datadog MCPOfficial | 84.0 | 82.0 | 88.0 | 75.0 | 55.0 | 92.0 | 1,500 |
| ๐ฅ | Grafana MCP | 80.0 | 78.0 | 84.0 | 80.0 | 90.0 | 88.0 | 2,200 |
| ๐ฅ | HoneyHiveOfficial | 79.0 | 78.0 | 80.0 | 80.0 | 60.0 | 82.0 | 500 |
| #4 | Lunary | 77.0 | 78.0 | 76.0 | 84.0 | 90.0 | 76.0 | 1,500 |
| #5 | Literal AIOfficial | 77.0 | 76.0 | 78.0 | 80.0 | 65.0 | 80.0 | 800 |
| #6 | Langfuse | 50.2 | 84.0 | 82.0 | 84.0 | 90.0 | 10.0 | 23,255 |
| #7 | OpenLLMetry | 49.1 | 80.0 | 78.0 | 84.0 | 95.0 | 10.0 | 6,923 |
| #8 | Helicone | 48.8 | 80.0 | 80.0 | 86.0 | 88.0 | 10.0 | 5,259 |
| #9 | PortkeyOfficial | 48.0 | 82.0 | 82.0 | 86.0 | 70.0 | 10.0 | 10,930 |
| #10 | OpenLIT | 47.5 | 76.0 | 74.0 | 86.0 | 95.0 | 10.0 | 2,286 |
| #11 | LangSmithOfficial | 47.0 | 86.0 | 84.0 | 82.0 | 55.0 | 10.0 | 805 |
| #12 | AgentOps | 46.7 | 80.0 | 76.0 | 84.0 | 85.0 | 10.0 | 5,371 |
| #13 | Phospho | 44.9 | 76.0 | 74.0 | 82.0 | 90.0 | 10.0 | 439 |
| #14 | Log10Official | 43.1 | 76.0 | 78.0 | 82.0 | 70.0 | 10.0 | 96 |
| #15 | AWS MCPOfficial | 9.2 | 9.0 | 9.0 | 8.5 | 7.8 | 9.2 | 8,463 |
| #16 | Cloudflare MCPOfficial | 8.8 | 8.8 | 9.0 | 8.5 | 8.0 | 9.2 | 3,536 |
| #17 | Sentry MCPOfficial | 8.3 | 8.5 | 8.5 | 8.0 | 8.0 | 8.2 | 598 |
| #18 | PagerDuty MCP | 7.3 | 7.2 | 7.5 | 7.5 | 7.5 | 7.0 | 750 |
Score Guide
9.0+ Exceptional
8.0+ Excellent
7.0+ Good
6.0+ Fair
<6.0 Below Average
Contribute Benchmark Data
Help improve these rankings by submitting real-world telemetry. Contributors earn routing credits for every data point.