maud_initial_matching_rights_period_(cor)

Task Description: This is a multiple-choice task in which the model must select the answer that best characterizes the merger agreement regarding the duration of the initial matching rights period if the board changes its recommendation.
Task Type: 7-way classification
Document Type: merger agreement
Number of Samples: 159
Input Length Range: 90-1630 tokens
Evaluation Metrics: accuracy (maximize), balanced_accuracy (maximize), f1_macro (maximize), f1_micro (maximize), valid_predictions_ratio (maximize)
Tags: corporate law, interpretation
Paper: LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
Dataset Download: https://hazyresearch.stanford.edu/legalbench/

7 submissions

Rank	Model	accuracy	balanced_accuracy	f1_macro	f1_micro	valid_predictions_ratio	Date	Results
1	meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo	0.601	0.433	0.429	0.601	1.000	2025-07-25	View
2	meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo	0.532	0.366	0.378	0.532	1.000	2025-08-03	View
3	gpt-4o-mini	0.323	0.249	0.262	0.323	1.000	2025-07-02	View
4	claude-3-5-haiku-20241022	0.272	0.244	0.122	0.272	1.000	2025-08-01	View
5	gpt-4.1-nano	0.114	0.083	0.118	0.114	1.000	2025-07-03	View
6	google/gemma-2-27b-it	0.108	0.133	0.165	0.108	1.000	2025-07-24	View
7	claude-3-haiku-20240307	0.044	0.025	0.038	0.044	1.000	2025-07-28	View