This paper presents a novel schema mediation approach, called XMiner, for mining a mediated schema from a set of XML schemas. XMiner addresses three main mediation problems resulting from the heterogeneous source schemas: nesting discrepancy, backward paths and schema discrepancy. It discovers frequent concepts and paths, which are used to construct the mediated schema. XMiner exploits structural context, forward/backward paths, and label semantics to preserve the hierarchical structure as the best as possible while avoiding information loss. Experimental evaluations on real and synthetic datasets show that XMiner offers acceptable performance for large-scale scenarios.
|Title of host publication||Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008|
|Number of pages||4|
|Publication status||Published - 2008|
|Event||2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008 - Sydney, NSW, Australia|
Duration: 9 Dec 2008 → 12 Dec 2008
|Conference||2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008|
|Period||9/12/08 → 12/12/08|