Bayesian regularization of neural networks

Frank Burden, David Winkler

Research output: Chapter in Book/Report/Conference proceedingChapter (Book)Other

312 Citations (Scopus)

Abstract

Bayesian regularized artificial neural networks (BRANNs) are more robust than standard back-propagation nets and can reduce or eliminate the need for lengthy cross-validation. Bayesian regularization is a mathematical process that converts a nonlinear regression into a "well-posed" statistical problem in the manner of a ridge regression. The advantage of BRANNs is that the models are robust and the validation process, which scales as O(N 2) in normal regression methods, such as back propagation, is unnecessary. These networks provide solutions to a number of problems that arise in QSAR modeling, such as choice of model, robustness of model, choice of validation set, size of validation effort, and optimization of network architecture. They are difficult to overtrain, since evidence procedures provide an objective Bayesian criterion for stopping training. They are also difficult to overfit, because the BRANN calculates and trains on a number of effective network parameters or weights, effectively turning off those that are not relevant. This effective number is usually considerably smaller than the number of weights in a standard fully connected back-propagation neural net. Automatic relevance determination (ARD) of the input variables can be used with BRANNs, and this allows the network to "estimate" the importance of each input. The ARD method ensures that irrelevant or highly correlated indices used in the modeling are neglected as well as showing which are the most important variables for modeling the activity data. This chapter outlines the equations that define the BRANN method plus a flowchart for producing a BRANN-QSAR model. Some results of the use of BRANNs on a number of data sets are illustrated and compared with other linear and nonlinear models. 

Original languageEnglish
Title of host publicationArtificial Neural Networks
Subtitle of host publicationMethods and Applications
EditorsDavid J. Livingstone
Pages25-44
Number of pages20
Volume458
ISBN (Electronic)9781603271011
DOIs
Publication statusPublished - 2008
Externally publishedYes

Publication series

NameMethods in Molecular Biology
Volume458
ISSN (Print)10643745

Keywords

  • artificial neural network
  • automatic relevance determination (ARD)
  • Bayesian regularizsation
  • early stopping algorithm
  • overtraining
  • QSAR

Cite this