The Alien Space of Science: Sampling Coherent but Cognitively Unavailable Research Directions
Alejandro H. Artiles, Martin Weiss, Levin Brinkmann, Iyad Rahwan, Bernhard Sch\"olkopf, Christopher Pal, Hugo Larochelle, Anirudh Goyal, Nasim Rahaman

TL;DR
This paper introduces a framework to generate scientifically plausible research directions that are unlikely to be proposed by current communities, expanding exploration into novel yet coherent scientific ideas.
Contribution
It presents a method to sample 'alien' research directions by decomposing literature into idea atoms and modeling coherence and community availability, enabling broader and novel scientific exploration.
Findings
Sampler explores 3.5-7 times broader idea space than baselines.
Generated ideas match or surpass baseline quality in evaluations.
Framework separates scientific plausibility from community constraints.
Abstract
Scientific discovery is constrained not only by what is true, but by what is cognitively available to the researchers currently exploring a field. Many directions are coherent in light of the literature yet unlikely to be proposed because no existing community occupies the right combination of concepts, methods, and intuitions. Modern language models inherit this bias, recombining high-density regions of the literature when prompted for novel ideas. We introduce a framework that targets the complementary region, which we call the alien space of science, where directions are plausible under the structure of existing knowledge but unlikely under the distribution of existing researchers. Our method first decomposes papers into granular conceptual units and clusters them into a shared vocabulary of idea atoms. It then learns two complementary models over this vocabulary. A coherence model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Text Readability and Simplification
