A Bytecode-based Approach for Smart Contract Classification
Chaochen Shi, Yong Xiang, Robin Ram Mohan Doss, Jiangshan Yu, Keshav, Sood, Longxiang Gao

TL;DR
This paper introduces a bytecode-based smart contract classification model that overcomes limitations of NLP methods, enabling effective, source-code independent classification with improved performance and resistance to adversarial attacks.
Contribution
The paper presents a novel bytecode-based classification approach using feature selection and ensemble learning, addressing source code availability and security vulnerabilities of NLP methods.
Findings
The model classifies smart contracts without source code.
It outperforms baseline models in accuracy.
It demonstrates robustness against adversarial attacks.
Abstract
With the development of blockchain technologies, the number of smart contracts deployed on blockchain platforms is growing exponentially, which makes it difficult for users to find desired services by manual screening. The automatic classification of smart contracts can provide blockchain users with keyword-based contract searching and helps to manage smart contracts effectively. Current research on smart contract classification focuses on Natural Language Processing (NLP) solutions which are based on contract source code. However, more than 94% of smart contracts are not open-source, so the application scenarios of NLP methods are very limited. Meanwhile, NLP models are vulnerable to adversarial attacks. This paper proposes a classification model based on features from contract bytecode instead of source code to solve these problems. We also use feature selection and ensemble learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlockchain Technology Applications and Security · Imbalanced Data Classification Techniques
MethodsFeature Selection
