What's the deal - Identifying online bargains

John Cuzzola, Dragan Gašević, Ebrahim Bagheri

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review


The Internet is home to an ever increasing array of products and services available to the general consumer. This trend has given rise to a unique category of internet search where bargain seekers have conjugated towards deal collection databases. This is caused, in part, because traditional internet search engines do not perform well in this domain. Unfortunately, these deal databases are costly to maintain due to the heavy reliance on human participation in order to populate them. This has lead to an interest in the development of this class of internet search. Our research focuses on leveraging machine learning and natural language processing to develop a semi-supervised Web page classifier specific to this problem. We describe the design of our classifier with respect to the machine learning model chosen and the training features selected. We compare our model's effectiveness in classifying deal versus non-deal Web pages against other popular machine learning models such as decision tree, support vector machines, and neural net. Our results show that our proposed model performed the best given the features that were extracted for model training and testing.
Original languageEnglish
Title of host publicationThe Web 2013
Subtitle of host publicationProceedings of the First Australasian Web Conference (AWC 2013), Adelaide, Australia, 31 January - 3 February 2012
EditorsHelen Ashman, Quan Z. Sheng, Andrew Trotman
Place of PublicationSydney NSW Australia
PublisherAustralian Computer Society Inc
Number of pages5
ISBN (Print)9781921770296
Publication statusPublished - 2013
Externally publishedYes
EventAustralasian Web Conference 2013 - Adelaide, Australia
Duration: 29 Jan 20131 Feb 2013
Conference number: 1st

Publication series

NameConferences in Research and Practice in Information Technology Series
PublisherAustralian Computer Society Inc.


ConferenceAustralasian Web Conference 2013
Abbreviated titleAWC 2013
Internet address


  • natural language processing
  • classification
  • Naive Bayes
  • deals
  • products
  • web page classification

Cite this