CourtPressGER: A German Court Decision to Press Release Summarization Dataset
By: Sebastian Nagl , Mohamed Elganayni , Melanie Pospisil and more
Potential Business Impact:
Helps courts explain rulings clearly to everyone.
Official court press releases from Germany's highest courts present and explain judicial rulings to the public, as well as to expert audiences. Prior NLP efforts emphasize technical headnotes, ignoring citizen-oriented communication needs. We introduce CourtPressGER, a 6.4k dataset of triples: rulings, human-drafted press releases, and synthetic prompts for LLMs to generate comparable releases. This benchmark trains and evaluates LLMs in generating accurate, readable summaries from long judicial texts. We benchmark small and large LLMs using reference-based metrics, factual-consistency checks, LLM-as-judge, and expert ranking. Large LLMs produce high-quality drafts with minimal hierarchical performance loss; smaller models require hierarchical setups for long judgments. Initial benchmarks show varying model performance, with human-drafted releases ranking highest.
Similar Papers
Summarisation of German Judgments in conjunction with a Class-based Evaluation
Computation and Language
Helps lawyers quickly understand long legal papers.
What Are the Facts? Automated Extraction of Court-Established Facts from Criminal-Court Opinions
Computation and Language
Helps computers understand crime details from court papers.
On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search
Information Retrieval
Helps reporters find facts faster and safer.