<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>大模型 on 墨然</title><link>https://moran.is-a.dev/categories/%E5%A4%A7%E6%A8%A1%E5%9E%8B/</link><description>Recent content in 大模型 on 墨然</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Mon, 15 Dec 2025 18:22:00 +0800</lastBuildDate><atom:link href="https://moran.is-a.dev/categories/%E5%A4%A7%E6%A8%A1%E5%9E%8B/atom.xml" rel="self" type="application/rss+xml"/><item><title>评测大模型别只看榜单：我给它出的 30 道“小考卷”</title><link>https://moran.is-a.dev/posts/llm-evaluation-playbook/</link><pubDate>Mon, 15 Dec 2025 18:22:00 +0800</pubDate><guid>https://moran.is-a.dev/posts/llm-evaluation-playbook/</guid><description>榜单像体检报告的平均分，真正重要的是：你的业务里它会在哪些题上失手。</description></item><item><title>推理服务别只盯模型：我踩坑后总结的三件小事</title><link>https://moran.is-a.dev/posts/llm-serving-basics/</link><pubDate>Sun, 30 Nov 2025 14:06:00 +0800</pubDate><guid>https://moran.is-a.dev/posts/llm-serving-basics/</guid><description>用户觉得“模型不稳定”，很多时候是网关、队列、超时策略在暗地里打架。</description></item><item><title>微调？RAG？还是提示词？我用一张“决策树”把自己救了</title><link>https://moran.is-a.dev/posts/llm-finetune-rag-prompt/</link><pubDate>Thu, 06 Nov 2025 09:28:00 +0800</pubDate><guid>https://moran.is-a.dev/posts/llm-finetune-rag-prompt/</guid><description>别一上来就想着“训练一个更懂我的模型”。很多时候，你缺的不是更聪明的模型，而是更清楚的需求。</description></item><item><title>同一句话第二次更快：我终于把 KV Cache 想明白了</title><link>https://moran.is-a.dev/posts/llm-kv-cache/</link><pubDate>Wed, 22 Oct 2025 20:18:00 +0800</pubDate><guid>https://moran.is-a.dev/posts/llm-kv-cache/</guid><description>KV Cache 听起来像黑魔法，其实它更像“你翻过的页不需要再翻一次”。</description></item><item><title>上下文窗口这事儿：我怎么让大模型“别忘太快”</title><link>https://moran.is-a.dev/posts/llm-context-window/</link><pubDate>Fri, 12 Sep 2025 10:05:00 +0800</pubDate><guid>https://moran.is-a.dev/posts/llm-context-window/</guid><description>我以前总以为模型“记性差”，后来才发现：很多遗忘是我自己喂的内容太乱。</description></item></channel></rss>