Text Similarity from Image Contents using Statistical and Semantic Analysis Techniques
Sagar Kulkarni, Sharvari Govilkar, Dhiraj Amin

TL;DR
This paper presents a system for detecting plagiarism in image contents like figures and graphs by combining statistical and semantic analysis techniques, improving accuracy over existing methods.
Contribution
It introduces a novel approach that integrates statistical algorithms with semantic models such as LSA, BERT, and WordNet for effective image content plagiarism detection.
Findings
Semantic algorithms outperform statistical methods in accuracy
Combining multiple techniques enhances detection efficiency
The system effectively detects plagiarized image content
Abstract
Plagiarism detection is one of the most researched areas among the Natural Language Processing(NLP) community. A good plagiarism detection covers all the NLP methods including semantics, named entities, paraphrases etc. and produces detailed plagiarism reports. Detection of Cross Lingual Plagiarism requires deep knowledge of various advanced methods and algorithms to perform effective text similarity checking. Nowadays the plagiarists are also advancing themselves from hiding the identity from being catch in such offense. The plagiarists are bypassed from being detected with techniques like paraphrasing, synonym replacement, mismatching citations, translating one language to another. Image Content Plagiarism Detection (ICPD) has gained importance, utilizing advanced image content processing to identify instances of plagiarism to ensure the integrity of image content. The issue of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · WordPiece · Residual Connection · Linear Layer · Softmax · Dense Connections
