Automatic deductive coding in discourse analysis: an application of large language models in learning analytics
Lishan Zhang, Han Wu, Xiaoshan Huang, Tengfei Duan, Hanxiang Du

TL;DR
This paper evaluates the effectiveness of large language models, especially GPT with prompt engineering, in automating deductive coding in discourse analysis, showing superior performance over traditional methods.
Contribution
It introduces a novel application of GPT and prompt engineering for automatic deductive coding, demonstrating improved accuracy with limited training data.
Findings
GPT with prompt engineering outperforms traditional and BERT-like methods
Large language models require fewer training samples for accurate coding
Detailed prompt structures enhance GPT's coding performance
Abstract
Deductive coding is a common discourse analysis method widely used by learning science and learning analytics researchers for understanding teaching and learning interactions. It often requires researchers to manually label all discourses to be analyzed according to a theoretically guided coding scheme, which is time-consuming and labor-intensive. The emergence of large language models such as GPT has opened a new avenue for automatic deductive coding to overcome the limitations of traditional deductive coding. To evaluate the usefulness of large language models in automatic deductive coding, we employed three different classification methods driven by different artificial intelligence technologies, including the traditional text classification method with text feature engineering, BERT-like pretrained language model and GPT-like pretrained large language model (LLM). We applied these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Residual Connection · Cosine Annealing · Byte Pair Encoding · Softmax · Dropout · Attention Dropout · Dense Connections
