Modelling zeros in blockmodelling

Laurence A.F. Park, Mohadeseh Ganji, Emir Demirovic, Jeffrey Chan, Peter Stuckey, James Bailey, Christopher Leckie, Rao Kotagiri

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Blockmodelling is the process of determining community structure in a graph. Real graphs contain noise and so it is up to the blockmodelling method to allow for this noise and reconstruct the most likely role memberships and role relationships. Relationships are encoded in a graph using the absence and presence of edges. Two objects are considered similar if they each have edges to a third object. However, the information provided by missing edges is ambiguous and therefore can be measured in different ways. In this article, we examine the effect of the choice of block metric on blockmodelling accuracy and find that data relationships can be position based or set based. We hypothesise that this is due to the data containing either Hamming noise or Jaccard noise. Experiments performed on simulated data show that when no noise is present, the accuracy is independent of the choice of metric. But when noise is introduced, high accuracy results are obtained when the choice of metric matches the type of noise.

Original languageEnglish
Title of host publication26th Pacific-Asia Conference, PAKDD 2022 Chengdu, China, May 16–19, 2022 Proceedings, Part II
EditorsJoão Gama, Tianrui Li, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng
Place of PublicationCham Switzerland
PublisherSpringer
Pages187-198
Number of pages12
ISBN (Electronic)9783031059360
ISBN (Print)9783031059353
DOIs
Publication statusPublished - 2022
EventPacific-Asia Conference on Knowledge Discovery and Data Mining 2022 - Chengdu, China
Duration: 16 May 202219 May 2022
Conference number: 26th
https://link.springer.com/book/10.1007/978-3-031-05936-0 (Proceedings)
http://www.pakdd.net/ (Website)

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume13281
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferencePacific-Asia Conference on Knowledge Discovery and Data Mining 2022
Abbreviated titlePAKDD 2022
Country/TerritoryChina
CityChengdu
Period16/05/2219/05/22
Internet address

Cite this