Predicting 3D Motion from 2D Video for Behavior-Based VR Biometrics
Mingjun Li, Natasha Kholgade Banerjee, Sean Banerjee

TL;DR
This paper introduces a Transformer-based method to predict 3D motion of VR controllers from 2D body joint data captured by external cameras, enhancing user authentication accuracy in VR systems.
Contribution
It presents a novel approach that uses 2D joint data to predict 3D controller trajectories, addressing limitations of current on-device tracking for VR biometrics.
Findings
Achieves a minimum EER of 0.025 in authentication.
Reduces EER by up to 0.040 compared to prior methods.
Enhances 3D motion understanding for VR biometric security.
Abstract
Critical VR applications in domains such as healthcare, education, and finance that use traditional credentials, such as PIN, password, or multi-factor authentication, stand the chance of being compromised if a malicious person acquires the user credentials or if the user hands over their credentials to an ally. Recently, a number of approaches on user authentication have emerged that use motions of VR head-mounted displays (HMDs) and hand controllers during user interactions in VR to represent the user's behavior as a VR biometric signature. One of the fundamental limitations of behavior-based approaches is that current on-device tracking for HMDs and controllers lacks capability to perform tracking of full-body joint articulation, losing key signature data encapsulated by the user articulation. In this paper, we propose an approach that uses 2D body joints, namely shoulder, elbow,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition
