2025-12-31
DeepSeek‑AI
mHC:流形约束超连接
mHC: Manifold-Constrained Hyper-Connections
Zhenda Xie, Yixuan Wei, Huanqi Cao 等
2025-12-20
Andrej Karpathy
2025 年 LLM 年度回顾
2025 LLM Year in Review
Andrej Karpathy
2025-12-19
Anthropic
Bloom:用于自动化行为评估的开源工具
Bloom: an open source tool for automated behavioral evaluations
Isha Gupta 等
2025-12-19
Google DeepMind
分布式 AGI 的安全性
Distributional AGI Safety
Nenad Tomašev、Matija Franklin、Julian Jacobs 等
2025-12-18
OpenAI
监测可监测性:评估思维链(CoT)监控
Monitoring Monitorability
Melody Y. Guan†、Miles Wang†、Micah Carroll† 等 12 位作者
2025-12-17
OpenAI
前沿科学:评估 AI 执行专家级科学任务的能力
FRONTIER SCIENCE: EVALUATING AI’S ABILITY TO PERFORM EXPERT-LEVEL SCIENTIFIC TASKS
Miles Wang*, Joy Jiao, Neil Chowdhury, Ethan Chang, Tejal Patwardhan
2025-12-11
OpenAI
GPT‑5 系统卡更新:GPT‑5.2
Update to GPT‑5 System Card: GPT‑5.2
OpenAI
2025-12-08
OpenAI
企业级人工智能现状
The State of Enterprise AI | 2025 Report
OpenAI
2025-12-04
OpenRouter & a16z
AI 现状:基于 OpenRouter 的一百万亿 Token 实证研究
State of AI: An Empirical 100 Trillion Token Study with OpenRouter
OpenRouter & Andreessen Horowitz (a16z)
2025-12-02
Anthropic
AI 如何改变 Anthropic 的工作方式
How AI is transforming work at Anthropic
Saffron Huang, Bryan Seethor, Esin Durmus, Kunal Handa, Miles McCain, Michael Stern, Deep Ganguli
2025-12-02
DeepSeek‑AI
DeepSeek‑V3.2:推动开源大语言模型前沿
DeepSeek‑V3.2: Pushing the Frontier of Open Large Language Models
DeepSeek‑AI
2025-12-02
UC Berkeley / Stanford / IBM Research
生产环境中的智能体评估
Measuring Agents in Production
Melissa Z. Pan, Negar Arabzadeh, Sara Hooker, Amir Houmansadr, David Glukhov, Matei Zaharia, Joseph E. Gonzalez
2025-12-01
Anthropic
AI 智能体发现 460 万美元区块链智能合约漏洞
AI agents find $4.6M in blockchain smart contract exploits
Winnie Xiao, Cole Killian, Henry Sleight, Alan Chan, Nicholas Carlini, Alwin Peng
2025-11-26
NeurIPS
公布 NeurIPS 2025 最佳论文奖
Announcing the NeurIPS 2025 Best Paper Awards
Communications Chairs 2025
2025-11-20
OpenAI
使用 GPT-5 加速科学的早期实验
Early science acceleration experiments with GPT-5
Sébastien Bubeck, Christian Coester, Ronen Eldan, Timothy Gowers 等 14 位作者
2025-11-18
Google / UCSB / NYU
预算感知的工具使用实现有效的智能体规模化
Budget-Aware Tool-Use Enables Effective Agent Scaling
Tengxiao Liu, Zifeng Wang 等
2025-10-21
DeepSeek‑AI
DeepSeek‑OCR:上下文光学压缩
DeepSeek‑OCR: Contexts Optical Compression
DeepSeek‑AI
2025-09-18
NeurIPS
NeurIPS 2025 接收论文:中文+英文互动检索(本地数据)
NeurIPS 2025 Accepted Papers (Interactive, Local Data)
数据源:NeurIPS / OpenReview / PaperCopilot
2025-04-17
OpenAI
智能体构建实用指南
A Practical Guide to Building Agents
OpenAI
2024-04-08
Stanford OVAL
使用大语言模型从零开始辅助撰写类维基百科文章(STORM)
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
Yijia Shao, Yucheng Jiang, Theodore A. Kanell, Peter Xu, Omar Khattab, Monica S. Lam