Instruction Embedding: Latent Representations of Instructions Towards Task Identification
Yiwei Li, Jiayi Shi, Shaoxiong Feng, Peiwen Yuan, Xinglin Wang, Boyuan, Pan, Heda Wang, Yao Hu, Kan Li

TL;DR
This paper introduces instruction embeddings and a benchmark for their evaluation, demonstrating their effectiveness in task identification and downstream instruction-related tasks for improving LLM alignment.
Contribution
It presents a new instruction embedding concept, constructs the IEB benchmark, and proposes the PIE method to enhance task-focused representations.
Findings
PIE outperforms other embeddings in task category identification
Instruction embeddings improve downstream instruction-related tasks
The approach enhances LLM alignment with human instructions
Abstract
Instruction data is crucial for improving the capability of Large Language Models (LLMs) to align with human-level performance. Recent research LIMA demonstrates that alignment is essentially a process where the model adapts instructions' interaction style or format to solve various tasks, leveraging pre-trained knowledge and skills. Therefore, for instructional data, the most important aspect is the task it represents, rather than the specific semantics and knowledge information. The latent representations of instructions play roles for some instruction-related tasks like data selection and demonstrations retrieval. However, they are always derived from text embeddings, encompass overall semantic information that influences the representation of task categories. In this work, we introduce a new concept, instruction embedding, and construct Instruction Embedding Benchmark (IEB) for its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Innovative Teaching and Learning Methods
MethodsSoftmax · Attention Is All You Need · ALIGN
