人工智能行业最新动态
MiniMax-M2.5 模型权重已发布到 Hugging Face,同时提供 API 服务,开发者可直接下载使用。
MiniMax 发布 M2.5 模型,在编码、Agent 工具调用和办公场景中达到 SOTA 水平。该模型通过大规模真实环境 RL 训练,具备架构级编程能力和高效搜索推理,SGLang 已提供 Day-0 支持。
<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1r3t775/ama_with_minimax_ask_us_anything/"> <img...
<!-- SC_OFF --><div class="md"><p><em>Made with</em> <a href="https://paperglide.net/"><em>Paperglide</em></a> <em>✨ — digest research papers faster</em></p> <p><strong>TL;DR:</strong>...
<!-- SC_OFF --><div class="md"><p>So I am using a R(2+1)D with kinetics 400 weights to train a classifier on two sets of videos. The problem is that one of the two classes has all videos of the same resolution and fps, forcing the model to learn those features instead of...
<!-- SC_OFF --><div class="md"><p>I thought the reviewing period should have started yesterday, but it still says &quot;You have no assigned papers. Please check again after the paper assignment process is complete.&quot; </p> </div><!-- SC_ON...
<!-- SC_OFF --><div class="md"><p>I released a new version of my side project: SoproTTS</p> <p>A 135M parameter TTS model trained for ~$100 on 1 GPU, running ~20× real-time on a base MacBook M3 CPU.</p> <p>v1.5 highlights (on CPU):</p>...
<!-- SC_OFF --><div class="md"><p>I&#39;m currently making a baseline autoencoder for this super freaking huge hyperspectral image dataset I have. It&#39;s a really big pain to work with and to get decent results, and I had to basically pull all stops including...
<!-- SC_OFF --><div class="md"><p>This post details my exploration for a &quot;stable stack&quot; for streaming deep RL (ObGD, SparseInit, LayerNorm, and online normalization) using 433,000 observations of real, non-stationary SSH attack traffic.</p>...
<!-- SC_OFF --><div class="md"><p>We evaluated 22 model configurations across different effort/thinking levels on Deep Research Bench (169 web research tasks, human-verified answers). For two of the most capable models, higher effort settings scored worse. </p>...
<!-- SC_OFF --><div class="md"><p>I’m reviewing for ICML (Policy A, where LLM use is not allowed) and noticed that in my assigned batch, if you copy/paste the full PDF text into a text editor, every single paper contains prompt-injection style instructions embedded...
I've been working on CloudRouter, a skill + CLI that gives coding agents like Claude Code and Codex the ability to start cloud VMs and GPUs.<p>When an agent writes code, it usually needs to start a dev server, run tests, open a browser to verify its work. Today that all happens on your local...
I'm not worried about AI job loss
Previously:<p><i>An AI agent published a hit piece on me</i> - <a href="https://news.ycombinator.com/item?id=46990729">https://news.ycombinator.com/item?id=46990729</a> - Feb 2026 (916 comments)<p><i>AI agent opens a PR write a blogpost to shames the maintainer who...
Open source is not about you (2018)
OpenAI has deleted the word 'safely' from its mission
CBP signs Clearview AI deal to use face recognition for 'tactical targeting'
Zed editor switching graphics lib from blade to wgpu
MinIO repository is no longer maintained
Custom Kernels for All from Codex and Claude
GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists analyze research at scale.
How OpenAI built a real-time access system combining rate limits, usage tracking, and credits to power continuous access to Sora and Codex.
Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration.
A new preprint shows GPT-5.2 proposing a new formula for a gluon amplitude, later formally proved and verified by OpenAI and academic collaborators.
NVIDIA GTC 大会金票抽奖截止 2 月 15 日,获奖者可获得黄仁勋主题演讲 VIP 座席、DGX Spark 奖品以及 NVIDIA 总部欢乐时光活动邀请。
Google Chrome 团队发布 WebMCP 协议早期预览,网站可通过声明式 HTML 表单和命令式 JS API 向 AI Agent 提供结构化操作接口,取代传统 DOM 操作方式,适用于客服、电商、预订等场景。
Vercel 旗下 v0 推出智能规划功能,可自动检测复杂任务并进行策略规划,确保首次构建即准确到位,尤其适合复杂应用和 Agent 开发场景。
ElevenLabs 在伦敦峰会展示 ElevenCreative 平台,整合语音、音乐、音效、图像和视频生成,帮助品牌用 AI 快速制作工作室品质的广告内容。
海螺 AI 展示创作者 Cat Jin 使用 Hailuo 制作的花木兰主题视频作品,铁骑金戈的中国古典风格令人惊艳。
vLLM 发布 DeepSeek R1 在 NVIDIA GB300 上的性能数据:单 GPU 预填充 22.5K TGS、解码 3K TGS,相比 Hopper 架构预填充提升 8 倍。DeepSeek V3.2 仅需 2 块 GPU 即可运行。