eval-view

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

python AI & Machine Learning by hidai25 ★ 61 Last commit: 2 months, 4 weeks ago

Anthropic OpenAI

Links:

GitHub

Glama

◐

Assessed visibility — 4/3 applicable dimensions scored

✓ Schema Quality ✓ Protocol — Reliability ✓ Docs & Maintenance ✓ Security Hygiene — Schema Interpretability

Schema Quality

42% weight

Protocol Compliance

N/A

Local server

Reliability

N/A

Local server

Docs & Maintenance

25% weight

Security Hygiene

33% weight

Score History

Category Trends

Static Analysis

Metric	Score	Rating
Schema Completeness	100	Good
Description Quality	90	Good
Documentation Coverage	100	Good
Maintenance Pulse	84	Good
Dependency Health	75	Good
License Clarity	100	Good
Version Hygiene	90	Good

Analyzed 2 months, 4 weeks ago

Embed Badge

Add this to your README to display your MCP Scoreboard grade:

[![MCP Score](https://www.mcpscoreboard.com/badge/26f00103-42c4-4578-b57d-b2b3f74b1aec.svg)](https://www.mcpscoreboard.com/server/26f00103-42c4-4578-b57d-b2b3f74b1aec/)