IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Machine learning models have made incredible progress, but they still struggle when applied to examples from unseen domains. This study focuses on a specific problem of domain generalization, where a model is trained on one source domain and tested on multiple target domains that are unseen during training. We propose IMO: Invariant features Masks for Out-of-Distribution text classification, to achieve OOD generalization by learning invariant features. During training, IMO would learn sparse mask layers to remove irrelevant features for prediction, where the remaining features keep invariant. Additionally, IMO has an attention module at the token level to focus on tokens that are useful for prediction. Our comprehensive experiments show that IMO substantially outperforms strong baselines in terms of various evaluation metrics and settings.
Original languageEnglish
Title of host publicationProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
EditorsMiruna Clinciu, Bing Liu, Zhiyu Zoey Chen, Chen Liang
Place of PublicationKerrville TX USA
PublisherAssociation for Computational Linguistics (ACL)
Pages2625–2639
Number of pages15
Volume1
ISBN (Electronic)9798891760943
DOIs
Publication statusPublished - 2024
EventAnnual Meeting of the Association of Computational Linguistics 2024 - Bangkok, Thailand
Duration: 11 Aug 202416 Aug 2024
Conference number: 62nd
https://aclanthology.org/2024.acl-long.0/ (Proceedings)
https://2024.aclweb.org/ (Website)
https://aclanthology.org/volumes/2024.findings-acl/ (Proceedings (Findings))
https://aclanthology.org/volumes/2024.acl-long/ (Proceedings)

Conference

ConferenceAnnual Meeting of the Association of Computational Linguistics 2024
Abbreviated titleACL 2024
Country/TerritoryThailand
CityBangkok
Period11/08/2416/08/24
Internet address

Cite this