Anthony Hughes, Ning Ma, Nikolaos Aletras

Investigating Privacy Preservation of Language Models in Legal Text Summarization: A Preliminary Study

Rubrik: Conference Article 3
Jahrgang 182 (2026) / Heft 1, S. 73-82 (10)
Publiziert 05.06.2026
DOI 10.1628/jite-2026-0010
inkl. gesetzl. MwSt.
  • Artikel PDF
  • lieferbar
  • 10.1628/jite-2026-0010
Beschreibung
Language models (LMs) have shown outstanding performance in legal text summarization. It is crucial that personally identifying information (PII) included in the source document should not leak in the summary. Prior efforts have mostly focused on studying how LMs may inadvertently leak PII from training data. However, to what extent LMs can provide privacy-preserving summaries given a non-private source legal document remains under-explored. In this paper, we perform a preliminary empirical study on privacy-preservation in legal text summarization showing that LMs often cannot prevent PII leakage in summaries.