Exploration of Cryptocurrency Mining-Specific GPUs in AI Applications: A Case Study of CMP 170HX
Xing Kangwei

TL;DR
This paper demonstrates that modifying NVIDIA CMP 170HX GPUs by disabling specific instructions can significantly enhance their AI computational performance, offering a cost-effective and environmentally friendly hardware reuse strategy.
Contribution
It introduces a novel method of reusing mining GPUs for AI tasks by disabling certain instructions, validated through extensive benchmarking and hardware analysis.
Findings
FP32 performance exceeds 15 times original capability
Inference performance in large language models triples
Recycling GPUs reduces electronic waste and costs
Abstract
This study systematically tests a computational power reuse scheme proposed by the open source community disabling specific instruction sets (Fused Multiply Add instructions) through CUDA source code modifications on the NVIDIA CMP 170HX platform. Experimental results validate the effectiveness of this approach, partially restoring the GPU's computational capabilities in artificial intelligence (AI) tasks. Performance evaluations using open-source GPU benchmarks (OpenCL benchmark, mixbench) and AI benchmarks (LLAMA-benchmark) reveal that its FP32 floating-point performance exceeds 15 times the original capability, while inference performance for certain precision levels in large language models surpasses threefold improvements. Furthermore, based on hardware architecture analysis, this paper proposes theoretical conjectures for further improving computational utilization through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Recycling and Waste Management Techniques · Green IT and Sustainability
