PCIE_Interaction Solution for Ego4D Social Interaction Challenge

Kanokphan Lertniphonphan; Feng Chen; Junda Xu; Fengbu Lan; Jun Xie; Tao Zhang; Zhepeng Wang

arXiv:2505.24404·cs.CV·June 2, 2025

PCIE_Interaction Solution for Ego4D Social Interaction Challenge

Kanokphan Lertniphonphan, Feng Chen, Junda Xu, Fengbu Lan, Jun Xie, Tao Zhang, Zhepeng Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces the PCIE_Interaction solution for the Ego4D Social Interaction Challenge, combining face quality enhancement, ensemble methods, and audio-visual fusion to improve social interaction detection accuracy.

Contribution

It presents a novel approach that fuses visual and audio cues with face quality assessment for social interaction detection in egocentric videos.

Findings

01

Achieved 0.81 mAP on LAM task

02

Achieved 0.71 mAP on TTM task

03

Effective fusion of audio and visual cues

Abstract

This report presents our team's PCIE_Interaction solution for the Ego4D Social Interaction Challenge at CVPR 2025, addressing both Looking At Me (LAM) and Talking To Me (TTM) tasks. The challenge requires accurate detection of social interactions between subjects and the camera wearer, with LAM relying exclusively on face crop sequences and TTM combining speaker face crops with synchronized audio segments. In the LAM track, we employ face quality enhancement and ensemble methods. For the TTM task, we extend visual interaction analysis by fusing audio and visual cues, weighted by a visual quality score. Our approach achieved 0.81 and 0.71 mean average precision (mAP) on the LAM and TTM challenges leader board. Code is available at https://github.com/KanokphanL/PCIE_Ego4D_Social_Interaction

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kanokphanl/pcie_ego4d_social_interaction
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImpact of Technology on Adolescents · Technology Use by Older Adults · Multimedia Communication and Technology