Discovering Solutions with Low Kolmogorov Complexity and High Generalization Capability

Jürgen Schmidhuber*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

Many machine learning algorithms aim at finding "simple" rules to explain training data. The expectation is: the "simpler" the rules, the better the generalization on test data (-_ Occam's razor). Most practical implementations, however, use measures for "simplicity" that lack the power, universality and elegance of those based on Kolmogorov complexity and SolomonofT's algorithmic probability. Likewise, most previous approaches (especially those of the "Bayesian" kind) suffer from the problem of choosing appropriate priors. This paper addresses both issues. It first reviews some basic concepts of algorithmic complexity theory relevant to machine learning, and how the Solomonoff-Levin distribution (or universal prior) deals with the prior problem. The universal prior leads to a probabilistic method for finding "algorithmically simple" problem solutions with high generalization capability. The method is based on Levin complexity (a time-bounded extension of Kolmogorov complexity) and inspired by Levin's optimal universal search algorithm. With a given problem, solution candidates are computed by efficient "self-sizing" programs that influence their own runtime and storage size. The probabilistic search algorithm finds the "good" programs (the ones quickly computing algorithmically probable solutions fitting the training data). Experiments focus on the task of discovering "algorithmically simple" neural networks with low Kolmogorov complexity and high generalization capability. These experiments demonstrate that the method, at least with certain toy problems where it is computationally feasible, can lead to generalization results unmatchable by previous neural net algorithms.

Original languageEnglish (US)
Title of host publicationProceedings of the 12th International Conference on Machine Learning, ICML 1995
EditorsArmand Prieditis, Stuart Russell
PublisherMorgan Kaufmann Publishers, Inc.
Pages488-496
Number of pages9
ISBN (Electronic)1558603778, 9781558603776
StatePublished - 1995
Event12th International Conference on Machine Learning, ICML 1995 - Tahoe City, United States
Duration: Jul 9 1995Jul 12 1995

Publication series

NameProceedings of the 12th International Conference on Machine Learning, ICML 1995

Conference

Conference12th International Conference on Machine Learning, ICML 1995
Country/TerritoryUnited States
CityTahoe City
Period07/9/9507/12/95

Bibliographical note

Publisher Copyright:
© ICML 1995.All rights reserved

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Software
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Discovering Solutions with Low Kolmogorov Complexity and High Generalization Capability'. Together they form a unique fingerprint.

Cite this