CESSC
causal and non-causal sentence classification dataset and model
CESSC provides a curated dataset and a fine-tuned BERT-based model for binary classification of causal and non-causal sentences within social science texts. The work is connected to the paper (Norouzi et al., 2025).
- Published article
- Fine-tuned BERT model on Hugging Face
- Dataset on Hugging Face
- Dataset on GitHub
- Source code
1,000 manually annotated sentences, supplementary machine-labeled sentences, scripts for model fine-tuning and evaluation, and benchmark results.