Data Preparation

Zahraa S. Abdallah, Lan Du, Geoffrey I. Webb

Research output: Chapter in Book/Report/Conference proceedingEncyclopaedia / Dictionary EntryOther

Abstract

Before data can be analyzed, they must be organized into an appropriate form. Data preparation is the process of manipulating and organizing data prior to analysis.
Data preparation is typically an iterative process of manipulating raw data, which is
often unstructured and messy, into a more structured and useful form that is ready for further analysis. The whole preparation process consists of a series of major activities (or tasks) including data profiling, cleansing, integration, and transformation.
Original languageEnglish
Title of host publicationEncyclopedia of Machine Learning and Data Mining
EditorsClaude Sammut, Geoffrey I. Webb
Place of PublicationBoston, MA
PublisherHumana Press
Pages318-327
Number of pages10
ISBN (Print)978-1-4899-7502-7
DOIs
Publication statusPublished - 2016

Cite this

Abdallah, Z. S., Du, L., & Webb, G. I. (2016). Data Preparation. In C. Sammut, & G. I. Webb (Eds.), Encyclopedia of Machine Learning and Data Mining (pp. 318-327). Humana Press. https://doi.org/10.1007/978-1-4899-7502-7_62-1