Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs (toward a better DeepSeek R1)
- February 3, 2025
Yue Wang, Qiuzhi Liu, Jiahao Xu, Tian Liang, Xingyu Chen, Zhiwei He, Linfeng Song, Dian Yu, Juntao Li, Zhuosheng Zhang,…
Paper Without Code - A working code is worth a thousand words, so let LLM do it for you! “Paper without code” is an AI project to provide prototype implementations for classic and trending AI papers by using LLM code generation.
Yue Wang, Qiuzhi Liu, Jiahao Xu, Tian Liang, Xingyu Chen, Zhiwei He, Linfeng Song, Dian Yu, Juntao Li, Zhuosheng Zhang,…
Saleh Momeni, Sahisnu Mazumder, Zixuan Ke, Bing Liu The paper, “In-context Continual Learning Assisted by an External Continual Learner,” introduces…
Tom Schaul The paper “Boundless Socratic Learning with Language Games” by Tom Schaul offers a fascinating theoretical exploration of recursive…
Huanjin Yao, Jiaxing Huang, Wenhao Wu, Jingyi Zhang, Yibo Wang, Shunyu Liu, Yingjie Wang, Yuxin Song, Haocheng Feng, Li Shen,…
Tianhao Wu, Janice Lan, Weizhe Yuan, Jiantao Jiao, Jason Weston, Sainbayar Sukhbaatar The research paper “THINKING LLMs: GENERAL INSTRUCTION FOLLOWING…
Liyi Chen, Panrong Tong, Zhongming Jin, Ying Sun, Jieping Ye, Hui Xiong The research paper “Plan-on-Graph: Self-Correcting Adaptive Planning of…
Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths The paper “Cognitive Architectures for Language Agents” by Theodore R.…
Alberto Alfarano, François Charton, Amaury Hayat The paper “Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers”…
Konstantina Christakopoulou, Shibl Mourad, Maja Matarić The paper “Agents Thinking Fast and Slow: A Talker-Reasoner Architecture” from Deepmind presents an…
Xunjian Yin, Xinyi Wang, Liangming Pan, Xiaojun Wan, William Yang Wang The paper “GÖDEL AGENT: A SELF-REFERENTIAL FRAMEWORK FOR AGENTS…