Reasoning models lack atomic thought โ๏ธ
Unlike humans using independent units, they store full histories๐ค
Introducing Atom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1 !
The best part? It's plugs in for ANY framework ๐
1/5
Jiayi Zhang
353 posts
Ph.D. student @HKUSTGuangZhou, Researcher @MetaGPT_, Cofounder of OpenManus, previously at RUC, Lenovo Research AI Lab, Zhipu AI.
- Replying to @didiforxWant to try AOT? ๐ฆ Code: github.com/qixucen/atom ๐ Paper: arxiv.org/abs/2502.12018 Huge thanks to main author @SteamedBun18755, co-authors @ZhaoyangYu22356, @AlexanderWu0, and the amazing @metagpt community for their support! โค๏ธ Follow us for more exciting updates! 5/5
- No fortress, purely open ground. Manus ๐. We open-sourced its core feature in 2 hours after dinner. Check it out ๐: github.com/mannaandpoem/Oโฆ 1/4
00:00
00:00 - Replying to @didiforx and @SteamedBun18755How does AOT work? โ๏ธ For each reasoning step: 1. Decompose the question into DAG 2. Contract the subquestions into a NEW simpler question 3. Iterate until reaching an atomic question Just like a Markov process: each new question depends only on the previous state! ๐ฏ 3/5
- Replying to @didiforx and @SteamedBun18755Why do we need atomic thoughts? ๐ค ALL current reasoning approaches, both models (o3, R1...) and frameworks (CoT, ToT, GoT...) suffer from the same issue: keeping full reasoning histories. This leads to: Computationally expensive ๐ฐ Prone to interference ๐ซ 2/5
- Replying to @didiforxThe power of a plug-in ๐ AOT works with any approach: o3, R1, CoT, ToT, GoT, Self-Consistency, or Agentic Workflow. It simplifies inputs while preserving solution quality ๐ก Try integrating AOT into your favorite approach! 4/5
- Replying to @doteyๅฎ็่ๅธ๏ผๆไปฌๅผๆบไบไธ็Openmanus๏ผ่ฝๅฎ็ฐไธ้จๅๅ่ฝใ github.com/mannaandpoem/Oโฆ
- It's actually a pity that we got no enough time to maintain OpenManus during the past 3 months. But the better news is that we will build a formal open-source community for OpenManus at the end of this month.
- Replying to @Comed_Ai_n and @SteamedBun18755we have open sourced it in
- No labels? รvO help you! Excited to share our new paper: Self-Supervised Prompt Optimization (arxiv.org/abs/2502.06855) ๐ฅ Key features: รvO: Output vs Output - no labels/human feedback needed! 99% cost reduction ($0.15) SOTA performance with just 3 examples 1/5
- Excited to share my first ICLR Oral Paper! Special thanks to @isaac_jinyu @ZhaoyangYu22356 @SteamedBun18755 @AlexanderWu0๐ AFLOW accepted for #ICLR2025 Oral! ๐งEasy to use for closed & open tasks! ๐Low inference costs with DeepSeek vs. larger models! โจPromising research: Automatic Agentic Workflow/Systems! Paper: openreview.net/pdf?id=z5uVAKwโฆ Code: github.com/geekan/MetaGPTโฆ
- Replying to @LQGWarpSpeed and @SteamedBun18755Maybe a repost will speed up the process hhh ๐
- Important Announcement โ We have noticed that certain accounts are impersonating our team and claiming to launch a token called "$OpenManus". We hereby declare: 1. OpenManus is a legitimate project developed by the MetaGPT team 2. We have NEVER issued any cryptocurrencies
- Text-to-SQL woes? Reasoning models stumble in zero-shot tasks ๐ Enter Alpha-SQL โ our breakthrough boosts 7B LLMs by 15-20%, topping GPT-4o SOTA and even reasoning models on BIRD! ๐ Test Time Scaling still shines. How we nailed it ๐: 1/5





















