[过时]从零开始搭建一个 Git 管理的 AstrBot 配置仓库

2026.7.12 编辑：

使用体验是，Astrbot 的 Agent 能力相当差劲，不适合作为团队共用的 Agent。并且实际上想把它做成配置式的应用也相当麻烦。除非仅作为学习探索目的，否则不建议实际使用。

最近想把 AstrBot 当成一个长期可维护的团队机器人入口：后续它要接飞书、Gitea、沙箱、lark-cli、tea、自定义插件和 Skills。如果这些东西都只堆在某台机器的 data/ 目录里，迟早会变成不可复现的手工状态。

所以第一步不是立刻做复杂自动化，而是先从 0 开始做一个可以被 Git 管理的 AstrBot 配置项目。本文先记录最小版本：创建独立仓库，用 Docker Compose 跑通 AstrBot WebUI，并明确哪些文件应该进 Git，哪些运行态文件应该忽略。

目标

我希望最终得到这样的结构：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
astrbot-config/
  compose.yml
  .env.example
  .gitignore
  README.md

  data/
    cmd_config.json
    mcp_server.json
    config/
    plugins/
    skills/

  ops/

这里的重点是：data/ 不是整体入库，而是只把可复现、可审查的配置放进 Git。

应该进入 Git 的内容包括：

data/cmd_config.json
data/config/
data/plugins/
data/skills/
data/mcp_server.json
自己维护的插件、Skills、配置模板

不应该进入 Git 的内容包括：

.env
data/dist/
data/dashboard.zip
data/*.db
data/knowledge_base/
data/temp/
data/workspaces/
data/attachments/
data/site-packages/

初始化项目

新项目放在：

1
/home/veno/Projects/astrbot-config

先创建目录和 Git 仓库：

1
2
3
4
5
cd ~/Projects
mkdir astrbot-config
cd astrbot-config
git init
git branch -m main

目录骨架：

1
2
3
4
5
6
mkdir -p data/config data/plugins data/skills ops
touch data/.gitkeep \
  data/config/.gitkeep \
  data/plugins/.gitkeep \
  data/skills/.gitkeep \
  ops/.gitkeep

Docker Compose

compose.yml 使用官方镜像：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
services:
  astrbot:
    image: ${ASTRBOT_IMAGE:-soulter/astrbot:latest}
    container_name: ${ASTRBOT_CONTAINER_NAME:-astrbot-config}
    restart: unless-stopped
    security_opt:
      - no-new-privileges:true
    ports:
      - "${ASTRBOT_WEB_PORT:-6185}:6185"
      - "${ASTRBOT_ONEBOT_PORT:-6199}:6199"
    environment:
      - TZ=${TZ:-Asia/Shanghai}
    volumes:
      - ./data:/AstrBot/data:z
      - /etc/localtime:/etc/localtime:ro

.env.example：

1
2
3
4
5
ASTRBOT_IMAGE=soulter/astrbot:latest
ASTRBOT_CONTAINER_NAME=astrbot-config
ASTRBOT_WEB_PORT=6185
ASTRBOT_ONEBOT_PORT=6199
TZ=Asia/Shanghai

实际运行时复制一份 .env，但 .env 不进 Git：

1
cp .env.example .env

Git Ignore

.gitignore 的核心是排除运行态文件：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
.env
*.log

data/dist/
data/temp/
data/attachments/
data/webchat/
data/site-packages/
data/knowledge_base/
data/workspaces/
data/plugin_data/
data/dashboard.zip
data/*.db
data/*.db-*
data/t2i_templates/

# Keep config-managed areas tracked.
!data/
!data/config/
!data/plugins/
!data/skills/

这样 AstrBot 首次启动时生成的一堆数据库、WebUI 静态资源、知识库文件不会污染配置仓库。

清掉旧实例

因为这是一个从 0 开始的新部署，所以旧 AstrBot 容器直接作废。

先查看容器：

1
docker ps -a --format '{{.ID}} {{.Names}} {{.Image}} {{.Ports}} {{.Status}}'

当时发现旧容器 astrbot 占用了 6185：

1
astrbot soulter/astrbot:latest 0.0.0.0:6185->6185/tcp Up ...

移除旧 AstrBot：

1
docker rm -f astrbot

我没有动 napcat，因为它不是 AstrBot 本体。

启动新实例

启动：

1
docker compose up -d

第一次启动时踩到一个小坑：因为前一次端口冲突已经创建了容器，后续虽然配置正确，但实际端口没有发布出来。解决方式是强制重建：

1
docker compose up -d --force-recreate

查看状态：

1
docker ps --format '{{.Names}} {{.Ports}} {{.Status}}'

最终看到：

1
astrbot-config 0.0.0.0:6185->6185/tcp, 0.0.0.0:6199->6199/tcp Up

日志里确认 WebUI ready：

1
2
3
AstrBot v4.24.2 WebUI is ready
Local: http://localhost:6185
Default username/password: astrbot / astrbot

访问地址：

1
http://localhost:6185

默认账号密码：

1
astrbot / astrbot

处理生成的配置

AstrBot 会在 data/ 下生成主配置：

1
2
data/cmd_config.json
data/mcp_server.json

其中 data/cmd_config.json 里会出现运行时生成的 dashboard.jwt_secret。这个值不适合进 Git，所以提交前我把它清空：

1
2
3
"dashboard": {
  "jwt_secret": ""
}

容器以 root 写入挂载目录时，本机用户可能无法直接修改文件。可以通过容器把 /AstrBot/data 的所有权改回来：

1
docker exec astrbot-config chown -R 1000:1000 /AstrBot/data

然后再清理 cmd_config.json。

初始提交

确认 Git 状态：

1
git status --short --ignored

应该只看到 .env、数据库、WebUI 静态资源等被忽略，配置文件和骨架目录可提交。

提交：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
git add .env.example .gitignore README.md compose.yml \
  data/.gitkeep \
  data/config/.gitkeep \
  data/plugins/.gitkeep \
  data/skills/.gitkeep \
  data/cmd_config.json \
  data/mcp_server.json \
  ops/.gitkeep

git commit -m "chore: initialize astrbot docker config repo"

当前提交：

1
8a4a00d chore: initialize astrbot docker config repo

WebUI 初始配置

启动跑通之后，我在 AstrBot WebUI 里做了第一次人工配置：

修改了初始 Dashboard 用户名和密码
在模型提供商里创建了 DeepSeek API 来源
启用了 deepseek/deepseek-v4-pro 作为模型
新建了一个配置文件，展示名称为 Feishu - Flow Project
暂时还没有接入飞书机器人、沙箱、CLI、Skills 或插件

这些操作反映到仓库里后，git status 看到：

1
2
3
 M data/cmd_config.json
?? data/config/abconf_99578339-b87b-41b4-97a9-5c4af491f6c2.json
?? data/skills.json

data/cmd_config.json 的主要变化是：

dashboard.username 从 astrbot 改成了 veno
dashboard.password 更新为新的密码哈希
dashboard.jwt_secret 被运行时重新生成
新增 provider_sources[0]，来源 ID 为 deepseek
新增 provider[0]，模型 ID 为 deepseek/deepseek-v4-pro

其中 DeepSeek API key 目前由 AstrBot WebUI 直接写进了 data/cmd_config.json：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
{
  "provider_sources": [
    {
      "provider": "deepseek",
      "type": "openai_chat_completion",
      "api_base": "https://api.deepseek.com/v1",
      "id": "deepseek",
      "enable": true
    }
  ]
}

这里我最后选择了更朴素的方案：把这个仓库明确定位为私有配置仓库，并让 data/cmd_config.json 继续作为 AstrBot 运行时事实来源进入 Git。

原因是 AstrBot WebUI 会直接写这个 JSON，当前还不能确认它是否稳定支持从 .env 或模板安全回填 provider key、Dashboard 密码哈希和 jwt_secret。如果强行做模板渲染，反而可能引入“配置看起来可复现，但实际启动时被覆盖或漏注入”的风险。

因此现阶段的安全边界是：

私有 Git 仓库可以包含真实运行配置
不配置公开 remote，不把配置 diff 贴到公开聊天或博客里
如果未来要公开仓库，先轮换 provider key 和 Dashboard 凭据，再清理 Git 历史
真要提高 secret 安全等级时，再引入 SOPS 或 git-crypt，而不是先做脆弱的自动回填

另外，新建的配置文件实际落在：

1
data/config/abconf_99578339-b87b-41b4-97a9-5c4af491f6c2.json

这个 JSON 文件本身没有直接写入 Feishu - Flow Project 字符串。也就是说，AstrBot 对“配置文件展示名”和 abconf_*.json 文件之间的映射，可能不完全靠这个 JSON 文件表达。后续如果要做严格 GitOps，需要继续确认配置文件命名和展示名的持久化位置。

私有配置提交

确认采用私有仓库策略后，我把 WebUI 产生的真实配置提交进 Git：

1
e75b5fd chore: track astrbot webui config privately

这次提交包含：

data/cmd_config.json：Dashboard 用户、DeepSeek 来源和启用模型等真实运行配置
data/config/abconf_99578339-b87b-41b4-97a9-5c4af491f6c2.json：WebUI 新建配置文件对应的 JSON
data/skills.json：当前为空的 Skills 注册状态
README.md：补充私有仓库安全边界说明

提交后工作区是干净的，并且当前没有配置 Git remote。

Shipyard Neo 沙箱与 Feishu CLI

下一步我按 AstrBot 文档接入沙箱能力。文档里的关键点是：Computer Use Runtime 可以设为 sandbox，沙箱驱动选择 Shipyard Neo；Bay 默认监听 8114，AstrBot 通过 Shipyard Neo API Endpoint 和 Shipyard Neo Access Token 连接。

这次采用本机独立 Shipyard Neo 部署，而不是把 Bay 塞进 AstrBot 容器：

Shipyard Neo 部署目录：/home/veno/Projects/shipyard-neo-deploy
Bay endpoint：http://bay:8114，供 AstrBot 通过 Docker 网络访问
AstrBot computer_use_runtime：sandbox
AstrBot sandbox.booter：shipyard_neo
AstrBot shipyard_neo_profile：python-lark
Bay token：按文档写入 shipyard_neo_access_token，继续由私有配置仓库管理

Fedora 上还遇到一个实际问题：Bay 需要访问 Docker socket 来创建 sandbox 容器，但 SELinux 会拦截默认 bind mount。由于 Bay 挂 Docker socket 本身就已经是高权限组件，我在 Bay service 上显式加了：

1
2
security_opt:
  - label:disable

这不是为了让普通业务容器放宽隔离，而是为了让 Bay 这个 Docker 控制面组件在 Fedora 上能按设计工作。

Feishu CLI 和 Skills 按 larksuite/cli 官方仓库处理：

本机先执行官方推荐的 npx skills add larksuite/cli -g -y，安装官方 lark skills
AstrBot 仓库中同步了官方安装器发现的 26 个 lark skills 到 data/skills/
自定义 Shipyard Neo ship 镜像继承 ghcr.io/astrbotdevs/shipyard-neo-ship:latest
镜像内执行官方推荐的 npx /cli install 安装 lark-cli
额外加入本机已有的静态 tea 二进制

最终验证：Bay 能创建 python-lark sandbox，并且在真实 sandbox 内执行：

1
lark-cli --version && tea --version

返回 lark-cli version 1.0.42 和 tea 0.12.0。

CLI 授权状态持久化

安装 lark-cli 和 tea 之后，还有一个更实际的问题：Agent 每次要用飞书或 Gitea 能力时，不应该重新思考“去哪登录、token 放哪、下次还在不在”。这件事不能交给 Skill 本身解决。Skill 只应该描述工具怎么用，真正的权限边界仍然应该在飞书应用权限、用户授权、Gitea token scope 和 sandbox 运行时配置上。

一开始我以为只要把 HOME 和 XDG_CONFIG_HOME 指到 /workspace，就能靠 Shipyard Neo Cargo 保存 CLI 登录态。后面继续查文档和实际验证后发现，这个判断只对了一半：managed cargo 会随着 sandbox 删除而释放，不能作为长期账号凭证存储。真正适合保存 CLI 凭证的是 external cargo。

最终部署改成了这样：

镜像层只安装工具：lark-cli、tea 和对应 Skills
Skill 层只提供调用说明，不写入任何 token
Shipyard Neo 增加显式 profile：python-lark
python-lark 继续使用同一份工具镜像，并设置 HOME=/workspace、XDG_CONFIG_HOME=/workspace/.config
Bay 中创建一个 external cargo，专门持久化 /workspace
AstrBot 创建 sandbox 时传入固定 cargo_id

对应的 Shipyard Neo profile 是：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
- id: python-lark
  description: "Python sandbox with lark-cli and tea, using persistent CLI config"
  image: "shipyard-neo-ship-lark:latest"
  runtime_type: ship
  runtime_port: 8123
  capabilities:
    - filesystem
    - shell
    - python
  env:
    HOME: "/workspace"
    XDG_CONFIG_HOME: "/workspace/.config"

这里一开始还有一个 AstrBot 侧的小补丁：当前 AstrBot 的 ShipyardNeoBooter 只传 profile 和 ttl，不会传 cargo_id。后来证明这个方向不稳，因为 WebUI 更新 AstrBot 时会覆盖运行时源码。最终方案改为 Bay 侧 profile -> cargo 映射：AstrBot 仍然只传 profile=python-lark，Bay 在创建 sandbox 时自动补上对应的 external cargo。

这个方案的边界比较清楚：

token 不进镜像
token 不进 Skill
token 不出现在博客或公开日志里
CLI 凭证落在 external cargo 的 /workspace/.config
AstrBot 容器不再改源码，Git 只记录 Bay 侧配置和可复现的部署代码

持久化验证也做过一次：在 external cargo 里写入测试文件，删除 sandbox 后重新创建，文件仍然存在。这说明凭证目录不再依赖某个临时 sandbox 生命周期。

随后完成了两个 CLI 的登录和验证：

1
2
3
4
lark_configured=true
lark_auth_status_ok=true
lark_auth_list_ok=true
tea_authenticated=true

其中 tea 已能访问目标 Gitea 实例，账号验证为 veno。Lark 侧使用脱敏验证方式，只检查 config show、auth status、auth list 的退出码，不打印任何 app secret、access token 或用户授权内容。

到这里，AstrBot 在后续调用 lark-cli 和 tea 时，不需要每次重新设计授权流程。模型只需要按 Skill 使用命令；具体能读写什么，由飞书服务端权限、用户授权范围、Gitea token scope 和 external cargo 中的当前登录态共同决定。

WebUI 更新与源码 Patch

这里后来又踩到一个和 WebUI 更新有关的坑。

最初为了让 AstrBot 读取 ASTRBOT_SHIPYARD_NEO_CARGO_ID，我把两个 patch 文件直接以只读 bind mount 的方式挂到了容器里的源码路径：

1
2
/AstrBot/astrbot/core/computer/booters/shipyard_neo.py
/AstrBot/astrbot/core/computer/computer_client.py

这样运行时没问题，但从 WebUI 更新 AstrBot 时会失败：

1
[Errno 30] Read-only file system: '/AstrBot/astrbot/core/computer'

原因很直接：WebUI 更新器会尝试覆盖 AstrBot 本体源码，而源码目录下有只读挂载点。即使只挂了两个文件，更新过程遇到这些路径也会被文件系统拒绝。

第一版修复方式是不要把 patch 文件直接挂到源码路径，而是挂到独立目录，并在容器启动时复制进去：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
command:
  - sh
  - -c
  - |
    cp /AstrBot/local-patches/astrbot/core/computer/booters/shipyard_neo.py /AstrBot/astrbot/core/computer/booters/shipyard_neo.py
    cp /AstrBot/local-patches/astrbot/core/computer/computer_client.py /AstrBot/astrbot/core/computer/computer_client.py
    exec python main.py
volumes:
  - ./data:/AstrBot/data:z
  - ./patches/astrbot:/AstrBot/local-patches/astrbot:ro,z

这样 patch 来源仍然是只读的、可 Git 管理的，但 /AstrBot/astrbot/core/computer 本身恢复为容器内可写目录。WebUI 更新 AstrBot 时不会再被 bind mount 卡住；容器重启后也会重新应用本地 patch。

修复后验证：

1
computer_dir_writable=true

但这个方案后面又暴露了第二层问题：WebUI 更新 AstrBot 之后，源码文件会被更新器重新覆盖。容器没有重建时，command 里的复制动作不会再次执行，于是运行中的 AstrBot 又退回了“不传 cargo_id”的原版逻辑。

现象很典型：机器人调用 lark-cli 时又提示未配置，甚至重新发起 lark-cli config init。这并不是 external cargo 里的凭证丢了，而是 AstrBot 创建 sandbox 时没有再把 fixed cargo 传给 Bay，结果模型落进了新的 managed cargo。

第二版改成了启动时最小文本 patch，并启动后台 watcher 周期性补打 patch。它能止血，但本质上还是在 AstrBot 容器里改源码：

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
command:
  - sh
  - -c
  - |
    python /AstrBot/local-patches/apply-astrbot-cargo-patch.py
    (
      while sleep 10; do
        python /AstrBot/local-patches/apply-astrbot-cargo-patch.py || true
      done
    ) &
    exec python main.py
volumes:
  - ./data:/AstrBot/data:z
  - ./patches:/AstrBot/local-patches:ro,z

这个脚本只做三件事：

给 ShipyardNeoBooter.__init__ 增加 cargo_id
调用 BayClient.create_sandbox() 时传入 cargo_id
在 computer_client.py 中从 shipyard_neo_cargo_id 或 ASTRBOT_SHIPYARD_NEO_CARGO_ID 读取 cargo

它比整文件覆盖更适合 WebUI 更新场景，但仍然不适合作为长期方案。真正的问题不是 AstrBot 是否能读某个环境变量，而是“创建 sandbox 时该由谁决定 cargo”。这个决定更应该放在 Bay 这个 sandbox 控制面里。

最终我把方案改到了 Bay 侧：

1
2
3
4
5
6
cargo:
  root_path: "/var/lib/bay/cargos"
  default_size_limit_mb: 1024
  mount_path: "/workspace"
  profile_defaults:
    python-lark: "ws-fd07bebac9ae"

Bay 的逻辑变成：

如果请求显式带了 cargo_id，尊重请求
如果请求没带 cargo_id，先按 profile 查 cargo.profile_defaults
命中默认 cargo 时，用该 external cargo 创建 sandbox
绑定 fixed cargo 的 profile 跳过 warm pool，因为 warm sandbox 自带 managed cargo，不能重绑

于是 AstrBot 侧可以恢复成原生容器：

1
2
3
4
5
environment:
  - TZ=${TZ:-Asia/Shanghai}
volumes:
  - ./data:/AstrBot/data:z
  - /etc/localtime:/etc/localtime:ro

这次验证结论：

1
2
3
4
bay_default_cargo_applied=true
lark_auth_status_ok=true
lark_auth_list_ok=true
tea_authenticated=true

所以问题不是凭证持久化无效，而是原来的 patch 生命周期不够强。fixed external cargo 仍然有效；现在由 Bay 根据 profile 稳定选择 cargo，不再依赖 AstrBot 源码 patch。

不同配置文件使用不同账号

接下来还有一个更重要的设计问题：如果不同 AstrBot 配置文件要使用不同的 Gitea 账号和飞书账号，账号状态应该绑在哪里？

当前 AstrBot 配置文件可以各自设置 shipyard_neo_profile，所以短期可以做多个 Shipyard Neo profile，例如 python-lark、python-feishu-flow、python-gitea-work。每个 profile 可以复用同一份工具镜像，但在 Bay 的 cargo.profile_defaults 里绑定不同 external cargo。这样每个账号组都有自己的 /workspace/.config，CLI 配置不会互相踩。

更干净的长期方案是把账号组作为部署资源管理：

在 Shipyard Neo 中为每个账号组创建 external cargo
在 AstrBot 配置文件里用 shipyard_neo_profile 表达账号组
由 Bay 根据 profile 选择对应 cargo_id
创建 sandbox 时传入 cargo_id
lark-cli 和 tea 继续把配置写到 /workspace/.config

这样账号状态就从“某个临时 sandbox 的副产物”变成“某个 AstrBot 配置文件绑定的持久化 workspace”。例如：

1
2
3
default config       -> cargo-default       -> /workspace/.config
feishu-flow config   -> cargo-feishu-flow   -> /workspace/.config
gitea-work config    -> cargo-gitea-work    -> /workspace/.config

现在已经有了 Bay 侧 profile -> cargo 映射，所以单账号组持久化已经跑通。后续要支持多账号组，只需要继续增加 profile 和 external cargo 映射，而不是继续把所有账号都塞进同一个 cargo。

如果不想继续改 AstrBot，最稳妥的替代方案仍然是为不同账号组拆成不同 AstrBot 实例：每个实例有自己的配置仓库、自己的 Bay profile 或 Bay 部署、自己的 CLI 登录态。这比较笨，但隔离边界清晰，出问题时也容易回滚。

当前结果

现在我有了一个新的、干净的 AstrBot 配置仓库：

1
/home/veno/Projects/astrbot-config

它已经能通过 Docker Compose 启动 AstrBot，并且接上了可持久化 CLI 凭证的 Shipyard Neo sandbox。

下一步再继续往这个仓库里加：

用 Git 管理的全局 Skills
插件内置 Skills
自研 Gitea / 飞书同步插件
可同步的机器人记忆、规则和状态映射

第一阶段先到这里：先让 AstrBot 以一个可复现的 GitOps 形态跑起来，并让后续工具调用不再重复处理基础授权问题。