A high performance integrated web data warehousing

Xuan Thi Dung, Wenny Rahayu, David Taniar

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Over the years, we have seen a significant number of integration techniques for data warehouses to support web integrated data. However, the existing works focus extensively on the design concept. In this paper, we focus on the performance of a web database application such as an integrated web data warehousing using a well-defined and uniform structure to deal with web information sources including semi-structured data such as XML data, and documents such as HTML in a web data warehouse system. By using a case study, our implementation of the prototype is a web manipulation concept for both incoming sources and result outputs. Thus, the system not only can be operated through the web, it can also handle the integration of web data sources and structured data sources. Our main contribution is the performance evaluation of an integrated web data warehouse application which includes two tasks. Task one is to perform a verification of the correctness of integrated data based on the result set that is retrieved from the web integrated data warehouse system using complex and OLAP queries. The result set is checked against the result set that is retrieved from the existing independent data source systems. Task two is to measure the performance of OLAP or complex query by investigating source operation functions used by these queries to retrieve the data. The information of source operation functions used by each query is obtained using the TKPROF utility.

Original languageEnglish
Pages (from-to)95-109
Number of pages15
JournalTertiary Education and Management
Volume10
Issue number1
DOIs
Publication statusPublished - 2004

Keywords

  • Integrated web data warehouse performance
  • Performance evaluation of web complex query

Cite this