Conflict-aware Pseudo Labeling via Optimal Transport for entity alignment

Qijie Ding, Daokun Zhang, Jie Yin

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Entity alignment aims to discover unique equivalent entity pairs with the same meaning across different knowledge graphs (KGs). Existing models have focused on projecting KGs into a latent embedding space so that inherent semantics between entities can be captured for entity alignment. However, the adverse impacts of alignment conflicts have been largely overlooked during training, thereby limiting the entity alignment performance. To address this issue, we propose a novel Conflict-aware Pseudo Labeling via Optimal Transport model (CPL-OT) for entity alignment. The key idea is to iteratively pseudo-label alignment pairs empowered with conflict-aware optimal transport (OT) modeling to boost the precision of entity alignment. CPL-OT is composed of two key components - entity embedding learning with global-local aggregation and iterative conflict-aware pseudo labeling - that mutually reinforce each other. To mitigate alignment conflicts during pseudo labeling, we propose to use optimal transport as an effective means to warrant one-to-one entity alignment between two KGs with the minimal overall transport cost. Extensive experiments on benchmark datasets validate the superiority of CPL-OT over state-of-the-art baselines under both settings with and without prior alignment seeds.

Original languageEnglish
Title of host publicationProceedings - 22nd IEEE International Conference on Data Mining, ICDM 2022
EditorsXingquan Zhu, Sanjay Ranka, My T. Thai, Takashi Washio, Xindong Wu
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages915-920
Number of pages6
ISBN (Electronic)9781665450997
ISBN (Print)9781665451000
DOIs
Publication statusPublished - 2022
EventIEEE International Conference on Data Mining 2022 - Orlando, United States of America
Duration: 28 Nov 20221 Dec 2022
Conference number: 22nd
https://ieeexplore.ieee.org/xpl/conhome/10027565/proceeding (Proceedings)
https://icdm22.cse.usf.edu/ (Website)

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
PublisherIEEE, Institute of Electrical and Electronics Engineers
Volume2022-November
ISSN (Print)1550-4786
ISSN (Electronic)2374-8486

Conference

ConferenceIEEE International Conference on Data Mining 2022
Abbreviated titleICDM 2022
Country/TerritoryUnited States of America
CityOrlando
Period28/11/221/12/22
Internet address

Keywords

  • entity alignment
  • knowledge graph
  • optimal transport
  • pseudo labeling

Cite this