A CHAID Based Performance Prediction Model in Educational Data Mining
M. Ramaswami, R. Bhaskaran

TL;DR
This paper develops a CHAID-based predictive model for student performance in Indian higher secondary education, identifying key factors influencing academic success and outperforming other models in accuracy.
Contribution
It introduces a novel CHAID-based model for predicting student performance using real educational data from India, with an emphasis on identifying influential factors.
Findings
CHAID model achieved satisfactory accuracy
Key factors influencing performance identified
Model outperformed other predictive models
Abstract
The performance in higher secondary school education in India is a turning point in the academic lives of all students. As this academic performance is influenced by many factors, it is essential to develop predictive data mining model for students' performance so as to identify the slow learners and study the influence of the dominant factors on their academic performance. In the present investigation, a survey cum experimental methodology was adopted to generate a database and it was constructed from a primary and a secondary source. While the primary data was collected from the regular students, the secondary data was gathered from the school and office of the Chief Educational Officer (CEO). A total of 1000 datasets of the year 2006 from five different schools in three different districts of Tamilnadu were collected. The raw data was preprocessed in terms of filling up missing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Imbalanced Data Classification Techniques · Educational Technology and Assessment
