Fair Text Classification via Transferable Representations

Bibliographic Details
Title: Fair Text Classification via Transferable Representations
Authors: Leteno, Thibaud, Perrot, Michael, Laclau, Charlotte, Gourru, Antoine, Gravier, Christophe
Publication Year: 2025
Collection: Computer Science
Subject Terms: Computer Science - Machine Learning, Computer Science - Computation and Language
More Details: Group fairness is a central research topic in text classification, where reaching fair treatment between sensitive groups (e.g., women and men) remains an open challenge. We propose an approach that extends the use of the Wasserstein Dependency Measure for learning unbiased neural text classifiers. Given the challenge of distinguishing fair from unfair information in a text encoder, we draw inspiration from adversarial training by inducing independence between representations learned for the target label and those for a sensitive attribute. We further show that Domain Adaptation can be efficiently leveraged to remove the need for access to the sensitive attributes in the dataset we cure. We provide both theoretical and empirical evidence that our approach is well-founded.
Comment: arXiv admin note: text overlap with arXiv:2311.12689
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2503.07691
Accession Number: edsarx.2503.07691
Database: arXiv
More Details
Description not available.