<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>推理服务 on 墨然</title><link>https://moran.is-a.dev/tags/%E6%8E%A8%E7%90%86%E6%9C%8D%E5%8A%A1/</link><description>Recent content in 推理服务 on 墨然</description><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sun, 30 Nov 2025 14:06:00 +0800</lastBuildDate><atom:link href="https://moran.is-a.dev/tags/%E6%8E%A8%E7%90%86%E6%9C%8D%E5%8A%A1/atom.xml" rel="self" type="application/rss+xml"/><item><title>推理服务别只盯模型：我踩坑后总结的三件小事</title><link>https://moran.is-a.dev/posts/llm-serving-basics/</link><pubDate>Sun, 30 Nov 2025 14:06:00 +0800</pubDate><guid>https://moran.is-a.dev/posts/llm-serving-basics/</guid><description>用户觉得“模型不稳定”，很多时候是网关、队列、超时策略在暗地里打架。</description></item></channel></rss>