RoboBallet Google Deepmind RoboBallet: Planning for multirobot reaching with graph neural networks and reinforcement learning ... 论文# deepmind# google# robotballet 4周前060
上海交通大学IPADS实验室开源MobiAgent 人人都能炼专属Agent,上海交大开源端侧Agent全栈工具链,真实场景性能超GPT-5! 2025年9月10日 打开手机,让 AI Agent 自动帮你完成订外卖、订酒店、网上购物的琐碎任务,这正成... 论文# IPADS# MobiAgent# 上海交通大学 1个月前060
Trends – Artificial Intelligence 《AI趋势》互联网女王的AI报告(340页) May 30, 2025 Mary Meeker / Jay Simons / Daegwon Chae / Alexander Krey 上集,个体与组... 论文 6个月前0150
Advances and Challenges in Foundation Agents Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Coll... 论文 6个月前05210
图解大模型RLHF系列之:人人都能看懂的PPO原理与源码解读 【by 知乎】 去年此时我写了这篇文章,当时的主要目的是,想让读者在没有RL知识的情况下,能从直觉上快速理解代码,以便上手训练和修改。由于一切从“直觉”出发,因此有很多表述不准确的地方,所以最近我写了... 论文 6个月前02730
Preparing-for-the-Intelligence-Explosion Preparing-for-the-Intelligence-Explosion 在过去的几年里,我们见证了人工智能的突飞猛进:从写诗答题到绘图编程,从对话聊天到法律分析,AI的成长速度简直令人目瞪口... 论文 6个月前03380
Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications 论文 6个月前01310
Explaining Deep Neural Networks Explaining Deep Neural Networks Oana-Maria Camburu Linacre College University of Oxford A thesis sub... 论文 6个月前0260
ORCA: An Open-Source, Reliable, Cost-Effective, Anthropomorphic Robotic Hand for Uninterrupted Dexterous Task Learning ORCA:一款开源、可靠、成本高效、拟人化的机械手,可实现不间断灵巧任务学习 ORCA_An_Open-Source_Reliable_Cost-Effective_Anthropomorphic_R... 论文 6个月前09310
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Hao Sun, Zile Qiao†, Jiayan Guo†, Xuanbo Fan, Yingyan Hou Yong Jiang, Pengjun Xie, Yan Zhang†, Fei H... 论文 6个月前05320