A framework for large-scale distributed AI search across disconnected heterogeneous infrastructures
Lars Kotthoff, Tom Kelsey, Martin McCaffery

TL;DR
This paper introduces a flexible, robust framework for large-scale distributed AI search across heterogeneous, disconnected infrastructures, enabling scalable solutions to complex computational problems without requiring dedicated or homogeneous hardware.
Contribution
The framework uniquely handles disconnected, heterogeneous systems, ensuring robustness and minimal effort loss, and demonstrates scalability to previously infeasible computational problems.
Findings
Successfully solved a challenging open problem in computational mathematics.
Framework scales efficiently to large, complex problems.
Ensures robustness and minimal effort loss even after infrastructure failures.
Abstract
We present a framework for a large-scale distributed eScience Artificial Intelligence search. Our approach is generic and can be used for many different problems. Unlike many other approaches, we do not require dedicated machines, homogeneous infrastructure or the ability to communicate between nodes. We give special consideration to the robustness of the framework, minimising the loss of effort even after total loss of infrastructure, and allowing easy verification of every step of the distribution process. In contrast to most eScience applications, the input data and specification of the problem is very small, being easily given in a paragraph of text. The unique challenges our framework tackles are related to the combinatorial explosion of the space that contains the possible solutions and the robustness of long-running computations. Not only is the time required to finish the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Data Management and Algorithms · Scientific Computing and Data Management
