Multi-Stage Evolutionary Model Merging with Meta Data Driven Curriculum Learning for Sentiment-Specialized Large Language Modeling
Keito Inoshita, Xiaokang Zhou, Akira Kawai

TL;DR
This paper introduces MEM-MCL, a hybrid learning approach that merges expert models via evolutionary algorithms and employs curriculum learning based on task difficulty to improve sentiment analysis in large language models.
Contribution
The study presents a novel multi-stage evolutionary merging method combined with meta-data driven curriculum learning for sentiment-specific large language modeling.
Findings
MEM-MCL outperforms conventional LLMs in sentiment analysis tasks.
The approach improves accuracy across multiple subtasks.
Curriculum learning enhances knowledge extraction from LLMs.
Abstract
The emergence of large language models (LLMs) has significantly transformed natural language processing (NLP), enabling more generalized models to perform various tasks with minimal training. However, traditional sentiment analysis methods, which focus on individual tasks such as sentiment classification or aspect-based analysis, are not practical for real-world applications that usually require handling multiple tasks. While offering flexibility, LLMs in sentiment-specific tasks often fall short of the required accuracy. Techniques like fine-tuning and evolutionary model merging help integrate models into a unified framework, which can improve the learning performance while reducing computational costs. The use of task meta-data and curriculum learning to optimize learning processes remains underexplored, while sentiment analysis is a critical task in NLP that requires high accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Computational and Text Analysis Methods
