SAG: Style-Aligned Article Generation via Model Collaboration

Chenning Xu; Fangxun Shu; Dian Jin; Jinghao Wei; Hao Jiang

arXiv:2410.03137·cs.CL·October 7, 2024

SAG: Style-Aligned Article Generation via Model Collaboration

Chenning Xu, Fangxun Shu, Dian Jin, Jinghao Wei, Hao Jiang

PDF

Open Access

TL;DR

This paper introduces a collaborative training framework combining large and small language models to generate styled articles, achieving state-of-the-art results and improved style consistency while reducing hallucinations.

Contribution

It proposes a novel method that leverages LLMs and SLMs together, including a self-improvement technique and a new benchmark for style-aligned article generation.

Findings

01

Achieves 0.78 higher ROUGE-L score than GPT-4.

02

Improves BLEU-4 score by 0.55 over baseline.

03

Maintains low hallucination rate.

Abstract

Large language models (LLMs) have increased the demand for personalized and stylish content generation. However, closed-source models like GPT-4 present limitations in optimization opportunities, while the substantial training costs and inflexibility of open-source alternatives, such as Qwen-72B, pose considerable challenges. Conversely, small language models (SLMs) struggle with understanding complex instructions and transferring learned capabilities to new contexts, often exhibiting more pronounced limitations. In this paper, we present a novel collaborative training framework that leverages the strengths of both LLMs and SLMs for style article generation, surpassing the performance of either model alone. We freeze the LLMs to harness their robust instruction-following capabilities and subsequently apply supervised fine-tuning on the SLM using style-specific data. Additionally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling

MethodsAttention Is All You Need · Dense Connections · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Dropout · Byte Pair Encoding · Absolute Position Encodings