Capability Profile
One number hides a model's shape; a profile shows it.
Key Insight
This project runs one model through eight benchmarks spanning different skills — knowledge, math, code, instruction-following — and draws the results as a single radar chart.
Why This Matters
A model that is strong at math can be weak at following instructions, and a per-skill profile reveals those trade-offs that one averaged score would bury — exactly what you need when picking a model for a specific job.