A Hybrid Heuristic Framework for Resource-Efficient Querying of Scientific Experiments Data
Mayank Patel, Minal Bhise

TL;DR
This paper introduces RAW-HF, a resource-aware hybrid framework that significantly reduces query execution time and resource utilization for scientific datasets by optimizing resource usage and workload management.
Contribution
The paper presents a novel lightweight hybrid framework, RAW-HF, that improves resource efficiency and reduces query processing time for scientific data workloads.
Findings
Over 90% reduction in workload execution time for SDSS dataset.
26% reduction in CPU utilization and 25% in IO resource use.
Memory utilization improved by 33% compared to existing techniques.
Abstract
Scientific experiments and modern applications are generating large amounts of data every day. Most organizations utilize In-house servers or Cloud resources to manage application data and workload. The traditional database management system (DBMS) and HTAP systems spend significant time & resources to load the entire dataset into DBMS before starting query execution. On the other hand, in-situ engines may reparse required data multiple times, increasing resource utilization and data processing costs. Additionally, over or under-allocation of resources also increases application running costs. This paper proposes a lightweight Resource Availability &Workload aware Hybrid Framework (RAW-HF) to optimize querying raw data by utilizing existing finite resources efficiently. RAW-HF includes modules that help optimize the resources required to execute a given workload and maximize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Data Management and Algorithms · Advanced Database Systems and Queries
