MiroFlow is the open-source agent framework that maximizes any model's agent performance β and proves it across 9+ benchmarks with reproducible results.
Plug in GPT-5, Claude, MiroThinker, Kimi, DeepSeek, or any OpenAI-compatible model. Same tools. Same environment. Better results.
Plug in GPT-5, Claude, MiroThinker, Kimi, DeepSeek, or any OpenAI-compatible model. Same tools. Same environment. Better results.
- [2026-03]: MiroFlow 1.7 + MiroThinker 1.7: Major release with Web Application interface (FastAPI + React), comprehensive verifier system for benchmark evaluation, and expanded LLM support including Kimi K2.5 and GPT-5.
Previous Updates
- [2025-09-15]: MiroFlow v0.3: Enhanced codebase architecture and significantly improved benchmark performance, boosting GPT-5's prediction accuracy for future events by 11%. MiroFlow now ranks #1 in the future prediction benchmark. See FutureX.
- [2025-08-27]: MiroFlow v0.2: Achieves state-of-the-art performance across multiple agentic benchmarks, including HLE (27.2%), HLE-Text-Only (29.5%), BrowserComp-EN (33.2%), BrowserComp-ZH (47.1%), and xBench-DeepSearch (72.0%).
- [2025-08-26]: Released GAIA Validation Trace (73.94% pass@1) and Gradio Demo for local deployment.
- [2025-08-08]: MiroFlow v0.1: Complete open-source release of the research agent framework.
- Model-Agnostic Performance: Plug in any LLM β GPT-5, Claude, MiroThinker, Kimi K2.5, DeepSeek β and get better agent performance through smart rollback, iterative reasoning, and optimized tool orchestration.
- Comprehensive Benchmarking: Supports 9+ benchmarks including FutureX, GAIA, HLE, xBench-DeepSearch, BrowseComp, and more.
- One-Line Model Switching: Change
provider_classandmodel_namein YAML. Same tools, same prompts, same environment.
- Standardized Evaluation: Fair model comparison with identical infrastructure. The framework is the constant; the model is the variable.
- Automated Multi-Run Evaluation: Parallel runs with statistical aggregation (mean, std dev, min/max). Every result reproducible from config to score.
- Skill System: Define agent skills via
SKILL.mdβ no code changes needed. - Agent Graph: Compose multi-agent workflows with hierarchical graphs.
- Web Application: FastAPI + React interface out of the box.
- Plugin Architecture:
@registerdecorator β extend without touching core code. - Zero-Code Prompts: YAML + Jinja2 templates.
- Cost-Effective: Single RTX 4090 with open-source MiroThinker.
Benchmark results will be updated after comprehensive testing with v1.7. See the full Model Comparison for details.
Follow our detailed guides to reproduce any result in our Benchmarks Documentation.
# 1. Clone and setup
git clone https://github.com/MiroMindAI/miroflow && cd miroflow
uv sync
# 2. Configure API keys (only OPENAI_API_KEY is required for this example)
cp .env.template .env
# Edit .env and set OPENAI_API_KEY (used by GPT-5 in the default quickstart config)
# 3. Run your first task
bash scripts/test_single_task.sh \
--config config/agent_quickstart.yaml \
--task-question "What is the first country listed in the XLSX file that have names starting with Co?" \
--file-path data/FSI-2023-DOWNLOAD.xlsxExpected output: \boxed{Congo Democratic Republic}
Switch models in one line β same tools, same prompts, different LLM:
# GPT-5
llm:
provider_class: GPT5OpenAIClient
model_name: gpt-5
# Claude 3.7 Sonnet
llm:
provider_class: ClaudeAnthropicClient
model_name: claude-3-7-sonnet-20250219
# MiroThinker (open-source, self-hosted)
llm:
provider_class: MiroThinkerSGLangClient
model_name: mirothinker-v1.5See full documentation for web app setup, more examples, and configuration options.
If you find our work helpful, please consider citing:
MiroThinker (Model & Method)
@article{miromind2025mirothinker,
title={MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling},
author={MiroMind Team and Bai, Song and Bing, Lidong and Chen, Carson and Chen, Guanzheng and Chen, Yuntao and Chen, Zhe and Chen, Ziyi and Dong, Xuan and others},
journal={arXiv preprint arXiv:2511.11793},
year={2025}
}MiroFlow (Framework)
@article{miromind2026miroflow,
title={MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks},
author={Su, Shiqian and Xing, Sen and Dong, Xuan and Zhong, Muyan and Wang, Bin and Zhu, Xizhou and Chen, Yuntao and Wang, Wenhai and Deng, Yue and Zhu, Pengxiang and others},
journal={arXiv preprint arXiv:2602.22808},
year={2026}
}

