Notes on a New Philosophy of Empirical Science
Daniel Burfoot

TL;DR
This book proposes a philosophy of empirical science based on lossless data compression, suggesting theories are scientific if they can compress data effectively, thus unifying principles across fields like computer vision, linguistics, and machine learning.
Contribution
It introduces a novel methodology linking data compression to scientific theories, offering a new demarcation criterion and a way to reformulate various fields as empirical sciences.
Findings
Compression-based theories can effectively model natural data.
Large datasets justify complex models without overfitting.
The approach offers a unified framework for evaluating scientific theories.
Abstract
This book presents a methodology and philosophy of empirical science based on large scale lossless data compression. In this view a theory is scientific if it can be used to build a data compression program, and it is valuable if it can compress a standard benchmark database to a small size, taking into account the length of the compressor itself. This methodology therefore includes an Occam principle as well as a solution to the problem of demarcation. Because of the fundamental difficulty of lossless compression, this type of research must be empirical in nature: compression can only be achieved by discovering and characterizing empirical regularities in the data. Because of this, the philosophy provides a way to reformulate fields such as computer vision and computational linguistics as empirical sciences: the former by attempting to compress databases of natural images, the latter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Time Series Analysis and Forecasting · Computability, Logic, AI Algorithms
