全部 今日 本周 本月
2026-02-13

MiniMax-M2.5 模型上线 Hugging Face 开源社区

MiniMax-M2.5 模型权重已发布到 Hugging Face,同时提供 API 服务,开发者可直接下载使用。

大模型
@_akhaliq 阅读 →

LMSys:MiniMax-M2.5 模型发布,编程与 Agent 能力达 SOTA

MiniMax 发布 M2.5 模型,在编码、Agent 工具调用和办公场景中达到 SOTA 水平。该模型通过大规模真实环境 RL 训练,具备架构级编程能力和高效搜索推理,SGLang 已提供 Day-0 支持。

大模型
@lmsysorg 阅读 →

AMA with MiniMax — Ask Us Anything!

<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1r3t775/ama_with_minimax_ask_us_anything/"> <img...

研究
Reddit r/LocalLLaMA 阅读 →

[D] Teaching AI to Reason With Just 13 Parameters

<!-- SC_OFF --><div class="md"><p><em>Made with</em> <a href="https://paperglide.net/"><em>Paperglide</em></a> <em>✨ — digest research papers faster</em></p> <p><strong>TL;DR:</strong>...

产品发布
Reddit r/MachineLearning 阅读 →

[D] How do your control video resolution and fps for a R(2+1)D model?

<!-- SC_OFF --><div class="md"><p>So I am using a R(2+1)D with kinetics 400 weights to train a classifier on two sets of videos. The problem is that one of the two classes has all videos of the same resolution and fps, forcing the model to learn those features instead of...

大模型
Reddit r/MachineLearning 阅读 →

[D] Has anyone received their ICML papers to review yet?

<!-- SC_OFF --><div class="md"><p>I thought the reviewing period should have started yesterday, but it still says "You have no assigned papers. Please check again after the paper assignment process is complete." </p> </div><!-- SC_ON...

研究
Reddit r/MachineLearning 阅读 →

[P] SoproTTS v1.5: A 135M zero-shot voice cloning TTS model trained for ~$100 on 1 GPU, running...

<!-- SC_OFF --><div class="md"><p>I released a new version of my side project: SoproTTS</p> <p>A 135M parameter TTS model trained for ~$100 on 1 GPU, running ~20× real-time on a base MacBook M3 CPU.</p> <p>v1.5 highlights (on CPU):</p>...

产品发布
Reddit r/MachineLearning 阅读 →

[R] Has anyone experimented with MHC on traditional autoencoders/convolutional architectures?

<!-- SC_OFF --><div class="md"><p>I'm currently making a baseline autoencoder for this super freaking huge hyperspectral image dataset I have. It's a really big pain to work with and to get decent results, and I had to basically pull all stops including...

大模型
Reddit r/MachineLearning 阅读 →

[D] Benchmarking Deep RL Stability Capable of Running on Edge Devices

<!-- SC_OFF --><div class="md"><p>This post details my exploration for a "stable stack" for streaming deep RL (ObGD, SparseInit, LayerNorm, and online normalization) using 433,000 observations of real, non-stationary SSH attack traffic.</p>...

研究
Reddit r/MachineLearning 阅读 →

[R] Higher effort settings reduce deep research accuracy for GPT-5 and Gemini Flash 3

<!-- SC_OFF --><div class="md"><p>We evaluated 22 model configurations across different effort/thinking levels on Deep Research Bench (169 web research tasks, human-verified answers). For two of the most capable models, higher effort settings scored worse. </p>...

研究
Reddit r/MachineLearning 阅读 →

[D] ICML: every paper in my review batch contains prompt-injection text embedded in the PDF

<!-- SC_OFF --><div class="md"><p>I’m reviewing for ICML (Policy A, where LLM use is not allowed) and noticed that in my assigned batch, if you copy/paste the full PDF text into a text editor, every single paper contains prompt-injection style instructions embedded...

研究
Reddit r/MachineLearning 阅读 →

Show HN: Skill that lets Claude Code/Codex spin up VMs and GPUs

I&#x27;ve been working on CloudRouter, a skill + CLI that gives coding agents like Claude Code and Codex the ability to start cloud VMs and GPUs.<p>When an agent writes code, it usually needs to start a dev server, run tests, open a browser to verify its work. Today that all happens on your local...

产品发布
Hacker News 阅读 →

I'm not worried about AI job loss

I'm not worried about AI job loss

行业
Hacker News 阅读 →

The "AI agent hit piece" situation clarifies how dumb we are acting

Previously:<p><i>An AI agent published a hit piece on me</i> - <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46990729">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46990729</a> - Feb 2026 (916 comments)<p><i>AI agent opens a PR write a blogpost to shames the maintainer who...

行业
Hacker News 阅读 →

Open source is not about you (2018)

Open source is not about you (2018)

开源
Hacker News 阅读 →

OpenAI has deleted the word 'safely' from its mission

OpenAI has deleted the word 'safely' from its mission

行业
Hacker News 阅读 →

CBP signs Clearview AI deal to use face recognition for 'tactical targeting'

CBP signs Clearview AI deal to use face recognition for 'tactical targeting'

行业
Hacker News 阅读 →

Zed editor switching graphics lib from blade to wgpu

Zed editor switching graphics lib from blade to wgpu

芯片
Hacker News 阅读 →

MinIO repository is no longer maintained

MinIO repository is no longer maintained

开源
Hacker News 阅读 →

Custom Kernels for All from Codex and Claude

Custom Kernels for All from Codex and Claude

大模型
Hugging Face Blog 阅读 →

Scaling social science research

GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists analyze research at scale.

研究
OpenAI Blog 阅读 →

Beyond rate limits: scaling access to Codex and Sora

How OpenAI built a real-time access system combining rate limits, usage tracking, and credits to power continuous access to Sora and Codex.

行业
OpenAI Blog 阅读 →

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration.

产品发布
OpenAI Blog 阅读 →

GPT-5.2 derives a new result in theoretical physics

A new preprint shows GPT-5.2 proposing a new formula for a gluon amplitude, later formally proved and verified by OpenAI and academic collaborators.

大模型
OpenAI Blog 阅读 →

NVIDIA GTC 2026 金票抽奖倒计时:VIP 席位观看黄仁勋演讲

NVIDIA GTC 大会金票抽奖截止 2 月 15 日,获奖者可获得黄仁勋主题演讲 VIP 座席、DGX Spark 奖品以及 NVIDIA 总部欢乐时光活动邀请。

活动
@nvidia 阅读 →

宝玉:Google Chrome 发布 WebMCP 早期预览,让网站主动为 AI Agent 暴露工具接口

Google Chrome 团队发布 WebMCP 协议早期预览,网站可通过声明式 HTML 表单和命令式 JS API 向 AI Agent 提供结构化操作接口,取代传统 DOM 操作方式,适用于客服、电商、预订等场景。

产品发布
@dotey 阅读 →

v0 新增智能规划功能:复杂应用和 Agent 构建更精准

Vercel 旗下 v0 推出智能规划功能,可自动检测复杂任务并进行策略规划,确保首次构建即准确到位,尤其适合复杂应用和 Agent 开发场景。

产品发布
@v0 阅读 →

ElevenLabs Summit:ElevenCreative 让营销团队一天内完成从创意到完整广告

ElevenLabs 在伦敦峰会展示 ElevenCreative 平台,整合语音、音乐、音效、图像和视频生成,帮助品牌用 AI 快速制作工作室品质的广告内容。

活动
@elevenlabsio 阅读 →

海螺 AI 创作者用 Hailuo 重新演绎花木兰

海螺 AI 展示创作者 Cat Jin 使用 Hailuo 制作的花木兰主题视频作品,铁骑金戈的中国古典风格令人惊艳。

行业
@Hailuo_AI 阅读 →

vLLM 实测 DeepSeek R1 在 GB300 上的惊人性能:预填充提速 8 倍

vLLM 发布 DeepSeek R1 在 NVIDIA GB300 上的性能数据:单 GPU 预填充 22.5K TGS、解码 3K TGS,相比 Hopper 架构预填充提升 8 倍。DeepSeek V3.2 仅需 2 块 GPU 即可运行。

芯片
@vllm_project 阅读 →