Criminal Networks Analysis in Missing Data scenarios through Graph Distances
Annamaria Ficara, Lucia Cavallaro, Francesco Curreri, Giacomo Fiumara,, Pasquale De Meo, Ovidiu Bagdasar, Wei Song, Antonio Liotta

TL;DR
This study evaluates how incomplete data affects criminal network analysis by applying spectral and matrix distances to real networks, revealing that small data gaps can significantly distort network understanding.
Contribution
It introduces a method to quantify the impact of missing data on criminal networks using spectral and matrix distances, providing insights into data robustness.
Findings
Networks remain largely understandable with 10% edge removal.
Removing 2% of nodes causes significant network misinterpretation.
Spectral distances effectively measure data incompleteness impact.
Abstract
Data collected in criminal investigations may suffer from: (i) incompleteness, due to the covert nature of criminal organisations; (ii) incorrectness, caused by either unintentional data collection errors and intentional deception by criminals; (iii) inconsistency, when the same information is collected into law enforcement databases multiple times, or in different formats. In this paper we analyse nine real criminal networks of different nature (i.e., Mafia networks, criminal street gangs and terrorist organizations) in order to quantify the impact of incomplete data and to determine which network type is most affected by it. The networks are firstly pruned following two specific methods: (i) random edges removal, simulating the scenario in which the Law Enforcement Agencies (LEAs) fail to intercept some calls, or to spot sporadic meetings among suspects; (ii) nodes removal, that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
