# “龙虾款”切换本地模型指南

<span class="md-plain">零刻预装 OpenClaw+ 本地模型的机型是通过 llama.cpp 运行本地模型，支持自行新增或删除本地模型。</span>

<span class="md-plain">本教程以零刻 GTR9Pro 「预装 OpenClaw」机型为例，实际上手操作切换本地模型，只需按如下步骤操作即可。</span>

> <span class="md-plain">本教程仅适用于预装OpenClaw+本地模型的零刻产品</span>

<div cid="n3" class="md-hr md-end-block" id="bkmrk-" mdtype="hr" tabindex="-1">---

</div><span class="md-pair-s">**<span class="md-plain">1. 下载模型</span>**</span>

<span class="md-plain">llama.cpp 使用 GGUF 格式的模型文件，建议通过以下两种方式下载模型：</span>

1. <span class="md-plain">Hugging Face（需</span><span class="md-pair-s ">**<span class="md-plain">科学上网</span>**</span><span class="md-plain">）</span>
2. <span class="md-meta-i-c  md-link">[<span class="md-plain">ModelScope</span>](https://doc.bee-link.com.cn/www.modelscope.cn/models)</span><span class="md-plain">（魔塔社区）</span>

<span class="md-plain">这里我们使用魔塔社区下载 </span><span class="md-pair-s" spellcheck="false">`Qwen3.6-35B-A3B-UD-Q8_K_XL.gguf`</span><span class="md-plain"> 模型，这是一个高阶 8 位量化版本，精度损耗极低，兼顾优质推理能力与合理显存占用，适配本地部署日常使用。</span>

> <span class="md-plain">请根据主机实际配置选择合适的模型。</span>

<div cid="n3" class="md-hr md-end-block" id="bkmrk--1" mdtype="hr" tabindex="-1"></div>[![下载模型.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/raVXZOCgNkVV2yhG-cVjmox6YEZ.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/raVXZOCgNkVV2yhG-cVjmox6YEZ.png)

<span class="md-plain">下载好以后，执行以下命令将模型剪切至本地模型目录：</span>

```
sudo mv /home/用户名/下载/Qwen3.6-35B-A3B-UD-Q8_K_XL.gguf /opt/models/
```

> <span class="md-plain">注：在终端输入密码默认不会显示，正常输入后回车执行即可 </span>

<span class="md-plain">剪切后验证是否剪切成功，执行：</span>

```
sudo ls /opt/models/
```

<span class="md-plain">输出结果中包含文件（Qwen3.6-35B-A3B-UD-Q8\_K\_XL.gguf），说明剪切成功。</span>

[![移动模型.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/Tgu9whUjjBWuQ21P-igy9RT61Ov.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/Tgu9whUjjBWuQ21P-igy9RT61Ov.png)

<span class="md-pair-s ">**<span class="md-plain">2. 编辑 llama 启动脚本</span>**</span>

<span class="md-plain">保存好本地模型后，需要手动编辑 llama 启动脚本，执行：</span>

```
sudo nano /usr/local/bin/start-llama.sh
```

<span class="md-plain">修改 </span><span class="md-pair-s" spellcheck="false">`MODEL`</span><span class="md-plain"> 字段，将后面双引号中的模型名称改为新的模型名称，其他不用修改。</span>

<span class="md-plain">编辑完成后，按下 </span><span class="md-pair-s" spellcheck="false">`Ctrl+X`</span><span class="md-plain"> - </span><span class="md-pair-s" spellcheck="false">`Y`</span><span class="md-plain"> - </span><span class="md-pair-s" spellcheck="false">`回车`</span><span class="md-plain"> ，保存退出编辑器。</span>

[![编辑脚本.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/FCgZer7p6H1rYGa8-zYsP4vnUYO.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/FCgZer7p6H1rYGa8-zYsP4vnUYO.png)

<span class="md-pair-s ">**<span class="md-plain">3. 验证新模型是否启用</span>**</span>

<span class="md-plain">编辑并保存好 llama 启动脚本后，重启一下系统：</span>

```
reboot
```

<span class="md-plain">重启后打开网页 </span><span class="md-pair-s" spellcheck="false">`127.0.0.1:8080`</span><span class="md-plain"> ，可以看到显示的模型名称为 </span><span class="md-pair-s" spellcheck="false">`Qwen3.6-35B-A3B-UD-Q8_K_XL.gguf`</span><span class="md-plain">，说明模型切换成功，打个招呼确认模型能否正常使用，得到回应后说明成功了。</span>

[![验证模型.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/XjsJC8qtW24jbUUy-kZM8fXASAJ.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/XjsJC8qtW24jbUUy-kZM8fXASAJ.png)

<span class="md-pair-s">**<span class="md-plain">4. OpenClaw 切换新模型</span>**</span>

<span class="md-plain">新模型就绪后，还需要到 OpenClaw 中切换默认模型，打开终端执行：</span>

```
openclaw config
```

<span class="md-plain">选择 </span><span class="md-pair-s" spellcheck="false">`Local`</span><span class="md-plain"> - </span><span class="md-pair-s" spellcheck="false">`Model`</span>

<span class="md-plain">然后选择 </span><span class="md-pair-s" spellcheck="false">`vLLM`</span><span class="md-plain">；</span>

<span class="md-plain">vLLM base URL 修改为 </span><span class="md-pair-s" spellcheck="false">`http://127.0.0.1:8080/v1`</span><span class="md-plain">；</span>

<span class="md-plain">vLLM API Key 填写 </span><span class="md-pair-s" spellcheck="false">`sk-local`</span><span class="md-plain">（可以随意输入）</span>

<span class="md-plain">vLLM model 填写 </span><span class="md-pair-s" spellcheck="false">`Qwen3.6-35B-A3B-UD-Q8_K_XL.gguf`</span>

<span class="md-plain">然后回车，再按一次回车即可。</span>

[![配置openclaw.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/TsgqIV4nP1gD8PoR-openclaw.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/TsgqIV4nP1gD8PoR-openclaw.png)

<span class="md-plain">这样就配置完成了，再移动到 </span><span class="md-pair-s" spellcheck="false">`Continue`</span><span class="md-plain"> 并按下回车，结束配置。</span>

[![完成配置openclaw.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/yHZKitudunxnxSkL-openclaw.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/yHZKitudunxnxSkL-openclaw.png)

<span class="md-plain">结束配置后，还需要重启 OpenClaw Gateway 以应用修改，执行：</span>

```
openclaw gateway restart
```

<span class="md-plain">重启后即可以新模型使用 OpenClaw。</span>

[![测试openclaw.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/w8ZeMPEeLgZ9wJQL-openclaw.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/w8ZeMPEeLgZ9wJQL-openclaw.png)

<div cid="n165" class="md-hr md-end-block" id="bkmrk--10" mdtype="hr" tabindex="-1">---

</div><span class="md-pair-s md-expand">**<span class="md-plain">多模型切换</span>**</span>

<span class="md-plain">配置好新模型后，原先的模型如果没有取消勾选，会自动以备用模型配置，可以通过以下命令查询可用模型列表：</span>

```
openclaw models list
```

[![可用模型列表.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/7ynrTjCE6s1HgQPf-Wpw2YCzBBK.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/7ynrTjCE6s1HgQPf-Wpw2YCzBBK.png)

<span class="md-plain">在终端中可以切换默认模型，无需重启 Gateway，执行：</span>

```
openclaw models set 模型全称
```

[![切换默认模型.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/6vKBDYJnd3eWxwfQ-HFWb6PTjFK.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/6vKBDYJnd3eWxwfQ-HFWb6PTjFK.png)

<span class="md-plain">如需临时切换模型，可以在与 OpenClaw 的对话中回复：</span><span class="md-pair-s" spellcheck="false">`/model 模型全称`</span><span class="md-plain"> 快速切换，无需重启。</span>

[![临时切换模型.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/rZGCyxGAAsL6WuME-GyKm6j7uUD.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/rZGCyxGAAsL6WuME-GyKm6j7uUD.png)

---

**<span class="md-plain">查看显存占用</span>**

<span class="md-plain">首先需要下载工具，在终端执行：</span>

```
sudo apt install mesa-utils
```

<span class="md-plain">安装成功后，再次执行：</span>

```
glxinfo | grep -i "video memory\|vram"
```

<span class="md-plain">即可查看当前显存情况</span>

[![剩余显存.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/tUaYvQwRTbb1xU7h-2Z8G7s7Mbg.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/tUaYvQwRTbb1xU7h-2Z8G7s7Mbg.png)

> <span class="md-plain">Video Memory：总显存容量</span>
> 
> <span class="md-plain">Currently available dedicated video memory：剩余可用显存</span>

<span class="md-plain md-expand">如图，目前剩余可用显存约为 54G，说明 GTR9Pro 运行该模型后仍有大量余裕，且运行速度完全足够日常使用。</span>

<div cid="n73" class="md-hr md-end-block" id="bkmrk--16" mdtype="hr" tabindex="-1">---

</div><span class="md-pair-s md-expand">**<span class="md-plain">关闭思考模式</span>**</span>

<span class="md-plain">Qwen3.6-35B-A3B 模型支持关闭思考模式。</span>

<span class="md-plain">如需关闭思考模式（可提高回复速度），可以在最底下的 "--n-gpu-layers 99 </span>"<span class="md-plain"> 的下方新增一条 </span><span class="md-pair-s" spellcheck="false">`--chat-template-kwargs '{"enable_thinking":false}'`</span><span class="md-plain">，如图</span>

[![关闭思考模式.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/XeeSg9sq4Q2MfMkW-ZMqzdyHMjs.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/XeeSg9sq4Q2MfMkW-ZMqzdyHMjs.png)

<span class="md-plain">编辑完成后，按下 </span><span class="md-pair-s" spellcheck="false">`Ctrl+X`</span><span class="md-plain"> - </span><span class="md-pair-s" spellcheck="false">`Y`</span><span class="md-plain"> - </span><span class="md-pair-s" spellcheck="false">`回车`</span><span class="md-plain md-expand"> ，保存退出编辑器。</span>

<span class="md-plain">然后重启系统即可。</span>

[![验证关闭思考.png](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/scaled-1680-/sJEwimp481jVfR23-vpDZu65hHE.png)](https://doc.bee-link.com.cn/uploads/images/gallery/2026-05/sJEwimp481jVfR23-vpDZu65hHE.png)