Enhancing Public Speaking Skills in Engineering Students Through AI
Amol Harsh, Brainerd Prince, Siddharth Siddharth, Deepan Raj Prabakar Muthirayan, Kabir S Bhalla, Esraaj Sarkar Gupta, Siddharth Sahu

TL;DR
This paper presents an AI-driven multi-modal assessment system that provides personalized feedback on public speaking skills in engineering students by analyzing speech, facial expressions, gestures, and coherence, aiming to improve communication training.
Contribution
The study develops a novel multi-modal AI model that fuses verbal and non-verbal cues to deliver scalable, personalized feedback for public speaking training, surpassing previous separate assessment systems.
Findings
AI feedback moderately aligns with expert evaluations
Gemini Pro outperforms other LLMs in assessment accuracy
The system enables repeated, scalable practice for students
Abstract
This research-to-practice full paper was inspired by the persistent challenge in effective communication among engineering students. Public speaking is a necessary skill for future engineers as they have to communicate technical knowledge with diverse stakeholders. While universities offer courses or workshops, they are unable to offer sustained and personalized training to students. Providing comprehensive feedback on both verbal and non-verbal aspects of public speaking is time-intensive, making consistent and individualized assessment impractical. This study integrates research on verbal and non-verbal cues in public speaking to develop an AI-driven assessment model for engineering students. Our approach combines speech analysis, computer vision, and sentiment detection into a multi-modal AI system that provides assessment and feedback. The model evaluates (1) verbal communication…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Communication in Education and Healthcare · Intelligent Tutoring Systems and Adaptive Learning
