Phys. Rev. E 72, 026117 (2005) [11 pages]

Efficient training of multilayer perceptrons using principal component analysis

Download: PDF (162 kB) or Buy this Article (Use Article Pack) Export: BibTeX or EndNote (RIS)

Christoph Bunzmann1, Michael Biehl2, and Robert Urbanczik1
1Institut für Theoretische Physik, Universität Würzburg Am Hubland, D-97074 Würzburg, Germany
2Institute for Mathematics and Computing Science, University Groningen, P.O. Box 800, 9700 AV Groningen, The Netherlands

Received 15 March 2005; published 16 August 2005

A training algorithm for multilayer perceptrons is discussed and studied in detail, which relates to the technique of principal component analysis. The latter is performed with respect to a correlation matrix computed from the example inputs and their target outputs. Typical properties of the training procedure are investigated by means of a statistical physics analysis in models of learning regression and classification tasks. We demonstrate that the procedure requires by far fewer examples for good generalization than traditional online training. For networks with a large number of hidden units we derive the training prescription which achieves, within our model, the optimal generalization behavior.


©2005 The American Physical Society

URL: http://link.aps.org/doi/10.1103/PhysRevE.72.026117
DOI: 10.1103/PhysRevE.72.026117
PACS: 84.35.+i, 89.90.+n, 64.60.−i, 87.19.La

[ Abstract  |  Previous article  |  Next article  |  Issue 2 ]