Graph stream classification using labeled and unlabeled graphs

Shirui Pan, Xingquan Zhu, Chengqi Zhang, Philip S. Yu

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

53 Citations (Scopus)

Abstract

Graph classification is becoming increasingly popular due to the rapidly rising applications involving data with structural dependency. The wide spread of the graph applications and the inherent complex relationships between graph objects have made the labels of the graph data expensive and/or difficult to obtain, especially for applications involving dynamic changing graph records. While labeled graphs are limited, the copious amounts of unlabeled graphs are often easy to obtain with trivial efforts. In this paper, we propose a framework to build a stream based graph classification model by combining both labeled and unlabeled graphs. Our method, called gSLU, employs an ensemble based framework to partition graph streams into a number of graph chunks each containing some labeled and unlabeled graphs. For each individual chunk, we propose a minimum-redundancy subgraph feature selection module to select a set of informative subgraph features to build a classifier. To tackle the concept drifting in graph streams, an instance level weighting mechanism is used to dynamically adjust the instance weight, through which the subgraph feature selection can emphasize on difficult graph samples. The classifiers built from different graph chunks form an ensemble for graph stream classification. Experiments on real-world graph streams demonstrate clear benefits of using minimum-redundancy subgraph features to build accurate classifiers. By employing instance level weighting, our graph ensemble model can effectively adapt to the concept drifting in the graph stream for classification.

Original languageEnglish
Title of host publicationICDE 2013 - 29th International Conference on Data Engineering
Pages398-409
Number of pages12
DOIs
Publication statusPublished - 15 Aug 2013
Externally publishedYes
EventIEEE International Conference on Data Engineering 2013 - Sofitel Hotel, Brisbane, Australia
Duration: 8 Apr 201312 Apr 2013
Conference number: 29th
http://www.icde2013.org/
https://ieeexplore.ieee.org/xpl/conhome/6530811/proceeding (Proceedings)

Conference

ConferenceIEEE International Conference on Data Engineering 2013
Abbreviated titleICDE 2013
Country/TerritoryAustralia
CityBrisbane
Period8/04/1312/04/13
OtherThe annual ICDE conference addresses research issues in designing, building, managing, and evaluating advanced data-intensive systems and applications. It is a leading forum for researchers, practitioners, developers, and users to explore cutting-edge ideas and to exchange techniques, tools, and experiences. We invite the submission of original research contributions and industry papers, as well as proposals for workshops, panels, tutorials, and demonstrations.
Internet address

Cite this