Mockingbird: How does LLM perform in general machine learning tasks?

Haoyu Jia; Yoshiki Obinata; Kento Kawaharazuka; Kei Okada

arXiv:2508.04279·cs.LG·August 7, 2025

Mockingbird: How does LLM perform in general machine learning tasks?

Haoyu Jia, Yoshiki Obinata, Kento Kawaharazuka, Kei Okada

PDF

TL;DR

This paper introduces Mockingbird, a framework that adapts large language models to general machine learning tasks by instructing them to role-play and self-reflect, demonstrating promising but limited results compared to domain-specific feedback.

Contribution

The paper presents a novel framework, Mockingbird, for leveraging LLMs in general machine learning tasks through role-playing and self-reflection techniques.

Findings

01

LLMs can perform general machine learning tasks with acceptable results.

02

Self-reflection alone does not surpass domain-specific feedback.

03

Mockingbird demonstrates scalability across multiple tasks.

Abstract

Large language models (LLMs) are now being used with increasing frequency as chat bots, tasked with the summarizing information or generating text and code in accordance with user instructions. The rapid increase in reasoning capabilities and inference speed of LLMs has revealed their remarkable potential for applications extending beyond the domain of chat bots to general machine learning tasks. This work is conducted out of the curiosity about such potential. In this work, we propose a framework Mockingbird to adapt LLMs to general machine learning tasks and evaluate its performance and scalability on several general machine learning tasks. The core concept of this framework is instructing LLMs to role-play functions and reflect on its mistakes to improve itself. Our evaluation and analysis result shows that LLM-driven machine learning methods, such as Mockingbird, can achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.