Abstract
The Internet is home to an ever increasing array of products and services available to the general consumer. This trend has given rise to a unique category of internet search where bargain seekers have conjugated towards deal collection databases. This is caused, in part, because traditional internet search engines do not perform well in this domain. Unfortunately, these deal databases are costly to maintain due to the heavy reliance on human participation in order to populate them. This has lead to an interest in the development of this class of internet search. Our research focuses on leveraging machine learning and natural language processing to develop a semi-supervised Web page classifier specific to this problem. We describe the design of our classifier with respect to the machine learning model chosen and the training features selected. We compare our model's effectiveness in classifying deal versus non-deal Web pages against other popular machine learning models such as decision tree, support vector machines, and neural net. Our results show that our proposed model performed the best given the features that were extracted for model training and testing.
| Original language | English |
|---|---|
| Title of host publication | The Web 2013 |
| Subtitle of host publication | Proceedings of the First Australasian Web Conference (AWC 2013), Adelaide, Australia, 31 January - 3 February 2012 |
| Editors | Helen Ashman, Quan Z. Sheng, Andrew Trotman |
| Place of Publication | Sydney NSW Australia |
| Publisher | Australian Computer Society Inc |
| Pages | 69-73 |
| Number of pages | 5 |
| ISBN (Print) | 9781921770296 |
| Publication status | Published - 2013 |
| Externally published | Yes |
| Event | Australasian Web Conference 2013 - Adelaide, Australia Duration: 29 Jan 2013 → 1 Feb 2013 Conference number: 1st https://cs.adelaide.edu.au/~awc2013/ |
Publication series
| Name | Conferences in Research and Practice in Information Technology Series |
|---|---|
| Publisher | Australian Computer Society Inc. |
| Volume | 144 |
Conference
| Conference | Australasian Web Conference 2013 |
|---|---|
| Abbreviated title | AWC 2013 |
| Country/Territory | Australia |
| City | Adelaide |
| Period | 29/01/13 → 1/02/13 |
| Internet address |
Keywords
- natural language processing
- classification
- Naive Bayes
- deals
- products
- web page classification