Leveraging Prompts in LLMs to Overcome Imbalances in Complex Educational Text Data
Jeanne McClure, Machi Shimmei, Noboru Matsuda, Shiyan Jiang

TL;DR
This study demonstrates that using assertion-based prompts with Large Language Models significantly improves their ability to handle imbalanced and complex educational data, especially in understanding nuanced student responses.
Contribution
The paper introduces a novel assertion-based prompt engineering approach for LLMs, enhancing their performance on complex educational datasets compared to traditional models.
Findings
LLMs with assertions outperform traditional ML models in cognitive engagement classification.
Performance improves by up to 32% in F1-score for minority classes.
Targeted assertions increase LLM performance by approximately 12%.
Abstract
In this paper, we explore the potential of Large Language Models (LLMs) with assertions to mitigate imbalances in educational datasets. Traditional models often fall short in such contexts, particularly due to the complexity and nuanced nature of the data. This issue is especially prominent in the education sector, where cognitive engagement levels among students show significant variation in their open responses. To test our hypothesis, we utilized an existing technology for assertion-based prompt engineering through an 'Iterative - ICL PE Design Process' comparing traditional Machine Learning (ML) models against LLMs augmented with assertions (N=135). Further, we conduct a sensitivity analysis on a subset (n=27), examining the variance in model performance concerning classification metrics and cognitive engagement levels in each iteration. Our findings reveal that LLMs with assertions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
