Instructions to use TrustHLT/ModernBERT-large-madon-arg-detection with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use TrustHLT/ModernBERT-large-madon-arg-detection with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="TrustHLT/ModernBERT-large-madon-arg-detection")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("TrustHLT/ModernBERT-large-madon-arg-detection") model = AutoModelForSequenceClassification.from_pretrained("TrustHLT/ModernBERT-large-madon-arg-detection") - Notebooks
- Google Colab
- Kaggle
ModernBERT-large-madon-arg-detection
This model is a fine-tuned version of ModernBERT-large for Czech legal argument detection. It was introduced in the paper Mining Legal Arguments to Study Judicial Formalism.
The model is part of the MADON project, which focuses on detecting and classifying judicial reasoning in Czech court decisions. This specific model corresponds to Task 1 in the paper: detecting whether a paragraph in a legal decision is argumentative or non-argumentative.
Model Description
The model was adapted to the Czech legal domain through continued pretraining on a corpus of over 300,000 court decisions and fine-tuned on the MADON dataset. In the paper's evaluation, this model achieved a Balanced F1 score of 82.6% for argument detection.
- Paper: Mining Legal Arguments to Study Judicial Formalism
- Repository: TrustHLT/MADON
- Task: Binary text classification (argumentative vs. non-argumentative)
- Language: Czech
Usage
You can use this model for presence classification of Czech legal arguments using the transformers library:
from transformers import AutoModelForSequenceClassification, AutoTokenizer, pipeline
model = AutoModelForSequenceClassification.from_pretrained("TrustHLT/ModernBERT-large-madon-arg-detection")
tokenizer = AutoTokenizer.from_pretrained("TrustHLT/ModernBERT-large-madon-arg-detection")
pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)
text = "This is a legal paragraph" # Replace with Czech legal text
print(pipe(text))
Citation
If you find this model useful, please cite:
@article{madon2025,
title={Mining Legal Arguments to Study Judicial Formalism},
author={Anonymous},
journal={arXiv preprint arXiv:2512.11374},
year={2025}
}
- Downloads last month
- 4