Shisha: Online scheduling of CNN pipelines on heterogeneous architectures
Pirah Noor Soomro, Mustafa Abduljabbar, Jeronimo Castrillon, Miquel, Peric\`as

TL;DR
Shisha is an online scheduling method for CNN pipelines on heterogeneous chiplet architectures, optimizing workload distribution quickly and effectively, significantly outperforming traditional exploration algorithms in convergence time.
Contribution
Shisha introduces a fast online scheduling approach for CNNs on chiplet architectures, improving exploration speed and solution quality over existing heuristics.
Findings
Shisha reduces exploration time by approximately 35 times.
Shisha produces better scheduling solutions than other heuristic algorithms.
The approach effectively handles heterogeneity in compute and memory bandwidth.
Abstract
Chiplets have become a common methodology in modern chip design. Chiplets improve yield and enable heterogeneity at the level of cores, memory subsystem and the interconnect. Convolutional Neural Networks (CNNs) have high computational, bandwidth and memory capacity requirements owing to the increasingly large amount of weights. Thus to exploit chiplet-based architectures, CNNs must be optimized in terms of scheduling and workload distribution among computing resources. We propose Shisha, an online approach to generate and schedule parallel CNN pipelines on chiplet architectures. Shisha targets heterogeneity in compute performance and memory bandwidth and tunes the pipeline schedule through a fast online exploration technique. We compare Shisha with Simulated Annealing, Hill Climbing and Pipe-Search. On average, the convergence time is improved by ~35x in Shisha compared to other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Neural Network Applications · Advanced Memory and Neural Computing
