Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Yu Zhao, Huifeng Yin, Bo Zeng, Hao Wang, Tianqi Shi, Chenyang Lyu,, Longyue Wang, Weihua Luo, Kaifu Zhang

TL;DR
Marco-o1 advances open reasoning models capable of handling open-ended, complex real-world problems by integrating Chain-of-Thought fine-tuning, Monte Carlo Tree Search, and reflection mechanisms, extending reasoning beyond standard-answer domains.
Contribution
It introduces Marco-o1, a reasoning model that combines multiple strategies to generalize effectively to open-ended, less structured domains.
Findings
Effective in open-ended reasoning tasks
Outperforms baseline models in complex problem-solving
Demonstrates adaptability to real-world scenarios
Abstract
Currently OpenAI o1 sparks a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding -- which are well-suited for reinforcement learning (RL) -- but also places greater emphasis on open-ended resolutions. We aim to address the question: ''Can the o1 model effectively generalize to broader domains where clear standards are absent and rewards are challenging to quantify?'' Marco-o1 is powered by Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, and innovative reasoning strategies -- optimized for complex real-world problem-solving tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗AIDC-AI/Marco-o1model· 388 dl· ♡ 712388 dl♡ 712
- 🤗QuantFactory/Marco-o1-GGUFmodel· 85 dl· ♡ 985 dl♡ 9
- 🤗cortexso/marco-o1model· 74 dl74 dl
- 🤗UnstableLlama/Marco-o1-exl2model· 2 dl2 dl
- 🤗c01zaut/Marco-o1-rk3588-1.1.2model· 1 dl1 dl
- 🤗c01zaut/Marco-o1-rk3588-1.1.4model· 1 dl1 dl
- 🤗doshisha-mil/llm-jp-13b-OpenMathInstruct-2-v1model· 1 dl1 dl
- 🤗doshisha-mil/llm-jp-13b-OpenMathInstruct_2_v1.1model
- 🤗RichardErkhov/AIDC-AI_-_Marco-o1-4bitsmodel· 6 dl6 dl
- 🤗RichardErkhov/AIDC-AI_-_Marco-o1-8bitsmodel· 8 dl8 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Business Process Modeling and Analysis · Multi-Agent Systems and Negotiation
