A Loop-Based Methodology for Reducing Computational Redundancy in Workload Sets
Elie M. Shaccour, Mohammad M. Mansour

TL;DR
This paper introduces a loop-centric methodology that reduces workload set size by half through analyzing core loops, constructing signature vectors, and synthesizing representative micro-benchmarks, thereby decreasing simulation time for processor performance profiling.
Contribution
It presents a novel framework for extracting and analyzing core loops, and a heuristic for reducing workload redundancy, improving simulation efficiency for processor design.
Findings
Workload set size reduced by 50%
Main workload characteristics preserved
Automated tool for methodology implementation
Abstract
The design of general purpose processors relies heavily on a workload gathering step in which representative programs are collected from various application domains. Processor performance, when running the workload set, is profiled using simulators that model the targeted processor architecture. However, simulating the entire workload set is prohibitively time-consuming, which precludes considering a large number of programs. To reduce simulation time, several techniques in the literature have exploited the internal program repetitiveness to extract and execute only representative code segments. Existing so- lutions are based on reducing cross-program computational redundancy or on eliminating internal-program redundancy to decrease execution time. In this work, we propose an orthogonal and complementary loop- centric methodology that targets loop-dominant programs by exploiting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
