75
/100
B
◐ Assessed 4⁄3
eval-view
Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.
Anthropic
OpenAI
Assessed visibility
— 4/3 applicable dimensions scored
✓ Schema Quality
✓ Protocol
— Reliability
✓ Docs & Maintenance
✓ Security Hygiene
— Schema Interpretability
Schema Quality
96
42% weight
Protocol Compliance
N/A
Local server
Reliability
N/A
Local server
Docs & Maintenance
90
25% weight
Security Hygiene
83
33% weight
Score History
Category Trends
Static Analysis
| Metric | Score | Rating |
|---|---|---|
| Schema Completeness | 100 | Good |
| Description Quality | 90 | Good |
| Documentation Coverage | 100 | Good |
| Maintenance Pulse | 84 | Good |
| Dependency Health | 75 | Good |
| License Clarity | 100 | Good |
| Version Hygiene | 90 | Good |
Analyzed 2 months, 4 weeks ago