Embedding with Large Language Models for Classification of HIPAA Safeguard Compliance Rules
Md Abdur Rahman, Md Abdul Barek, ABM Kamrul Islam Riad, Md Mostafizur, Rahman, Md Bajlur Rashid, Smita Ambedkar, Md Raihan Miaa, Fan Wu, Alfredo, Cuzzocrea, Sheikh Iqbal Ahamed

TL;DR
This paper demonstrates that using multilingual BERT embeddings significantly improves the classification accuracy of HIPAA safeguard compliance rules in mHealth app code patterns, aiding developers in secure application development.
Contribution
It introduces the application of multilingual BERT embeddings for classifying HIPAA rules, outperforming traditional Word2Vec and existing methods in accuracy.
Findings
Logistic Regression achieved 99.95% accuracy.
SVM and Random Forest also achieved over 99% accuracy.
The approach outperforms existing classification methods.
Abstract
Although software developers of mHealth apps are responsible for protecting patient data and adhering to strict privacy and security requirements, many of them lack awareness of HIPAA regulations and struggle to distinguish between HIPAA rules categories. Therefore, providing guidance of HIPAA rules patterns classification is essential for developing secured applications for Google Play Store. In this work, we identified the limitations of traditional Word2Vec embeddings in processing code patterns. To address this, we adopt multilingual BERT (Bidirectional Encoder Representations from Transformers) which offers contextualized embeddings to the attributes of dataset to overcome the issues. Therefore, we applied this BERT to our dataset for embedding code patterns and then uses these embedded code to various machine learning approaches. Our results demonstrate that the models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBorder Security and International Relations · Digital and Cyber Forensics
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Adam · Attention Dropout · Dropout · Weight Decay · Dense Connections · Logistic Regression · Layer Normalization · Residual Connection · Linear Warmup With Linear Decay
