Parametric optimization in data mining incorporated with GA-based search

Ling Tan, David Taniar, Kate A Smith

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    1 Citation (Scopus)


    A number of parameters must be specified for a data-mining algorithm. Default values of these parameters are given and generally accepted as ‘good’ estimates for any data set. However, data mining models are known to be data dependent, and so are for their parameters. Default values may be good estimates, but they are often not the best parameter values for a particular data set. A tuned set of parameter values is able to produce a data-mining model of better classification and higher prediction accuracy. However parameter search is known to be expensive. This paper investigates GA-based heuristic techniques in a case study of optimizing parameters of back-propagation neural network classifier. Our experiments show that GA-based optimization technique is capable of finding a better set of parameter values than random search. In addition, this paper extends the island-model of Parallel GA (PGA) and proposes a VC-PGA, which communicates globally fittest individuals to local population with reduced communication overhead. Our result shows that GA-based parallel heuristic optimization technique provides a solution to large parametric optimization problems.
    Original languageEnglish
    Title of host publicationComputational Science – ICCS 2002
    Subtitle of host publicationInternational Conference Amsterdam, The Netherlands, April 21-24, 2002 Proceedings, Part I
    EditorsPeter M.A. Sloot, C.J. Kenneth Tan, Jack J. Dongarra, Alfons G. Hoekstra
    Place of PublicationBerlin Germany
    Number of pages10
    ISBN (Print)3540435913
    Publication statusPublished - 2002
    EventInternational Conference on Computational Science 2002 - Amsterdam, Netherlands
    Duration: 21 Apr 200224 Apr 2002 (Proceedings)

    Publication series

    NameLecture Notes in Computer Science
    ISSN (Print)0302-9743


    ConferenceInternational Conference on Computational Science 2002
    Abbreviated titleICCS 2002
    Internet address

    Cite this