REVAL Political bias leaderboard

Leaderboard Docs GitHub

Getting started

  • Install
  • Run your first eval
  • Viewing reports

Concepts

  • Methodology
  • Rubrics & metrics
  • Test cases

Reference

  • Providers & models
  • Config reference
  • CLI reference

Roadmap

  • Upcoming features

Docs

Everything you need to run REVAL locally, understand the rubrics and metrics, and read the roadmap.

Get started locally

Clone, install, configure .env, run your first eval.

Run your first eval

Command-line walkthrough with real model flags and outputs.

Methodology

What REVAL measures, why it matters, and what it deliberately does not measure.

Rubrics & metrics

Figure treatment, issue framing, scoring formulas, and the 0.85 similarity threshold.

Test cases

54 entries across US and India, five categories, and the schema that enforces them.

Providers & models

Bedrock, Anthropic, OpenAI, MiniMax, Ollama — and how to register a new model.

What's next

Roadmap & upcoming features

Generated by reval leaderboard build

github.com/krishnakartik1/reval