maud_initial_matching_rights_period_(cor)
- Task Description: This is a multiple-choice task in which the model must select the answer that best characterizes the merger agreement regarding the duration of the initial matching rights period if the board changes its recommendation.
- Task Type: 7-way classification
- Document Type: merger agreement
- Number of Samples: 159
- Input Length Range: 90-1630 tokens
- Evaluation Metrics: accuracy (maximize), balanced_accuracy (maximize), f1_macro (maximize), f1_micro (maximize), valid_predictions_ratio (maximize)
- Tags: corporate law, interpretation
- Paper: LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
- Dataset Download: https://hazyresearch.stanford.edu/legalbench/
7 submissions
Rank | Model | accuracy | balanced_accuracy | f1_macro | f1_micro | valid_predictions_ratio | Date | Results |
---|---|---|---|---|---|---|---|---|
1 | meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | 0.601 | 0.433 | 0.429 | 0.601 | 1.000 | 2025-07-25 | View |
2 | meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | 0.532 | 0.366 | 0.378 | 0.532 | 1.000 | 2025-08-03 | View |
3 | gpt-4o-mini | 0.323 | 0.249 | 0.262 | 0.323 | 1.000 | 2025-07-02 | View |
4 | claude-3-5-haiku-20241022 | 0.272 | 0.244 | 0.122 | 0.272 | 1.000 | 2025-08-01 | View |
5 | gpt-4.1-nano | 0.114 | 0.083 | 0.118 | 0.114 | 1.000 | 2025-07-03 | View |
6 | google/gemma-2-27b-it | 0.108 | 0.133 | 0.165 | 0.108 | 1.000 | 2025-07-24 | View |
7 | claude-3-haiku-20240307 | 0.044 | 0.025 | 0.038 | 0.044 | 1.000 | 2025-07-28 | View |