⚖️ LegalEvalHub
Home Tasks
Aggregate Leaderboards ▼
All Leaderboards
LegalBench (Full) LegalBench (Reasoning) HousingQA (Knowledge) HousingQA (Statute Comprehension)
Resources FAQ GitHub

  • LegalEvalHub: Code for this website. See here for instructions on how to contribute tasks, leaderboards, and evaluation runs.
  • legal-eval-harness: A harness for evaluating LLMs on legal tasks.
  • legal-ml-datasets: A collection of legal datasets for machine learning.
  • HELM: A framework for evaluating LLMs on legal tasks, run by Stanford CRFM.

© 2025 LegalEvalHub.

GitHub | FAQ | Contribute | Contact