Docs/1.1 (#1723)

quanru · yuyutaotao · web-flow · commit 3e87f45c80d9 · 2026-01-05T16:25:51.000+08:00
* docs(site): 1.1 changelog

* fix(docs): yaml script examples for Android automation

* docs(core): update changelog

* chore(docs): remove outdated v1.0 section from changelog

---------

Co-authored-by: yutao &lt;yutao.tao@bytedance.com&gt;
diff --git a/apps/site/docs/en/automate-with-scripts-in-yaml.mdx b/apps/site/docs/en/automate-with-scripts-in-yaml.mdx
@@ -264,13 +264,11 @@ android:
 tasks:
   - name: Launch Settings app
     flow:
-      - launch:
-          uri: com.android.settings
+      - launch: com.android.settings
 
   - name: Open webpage
     flow:
-      - launch:
-          uri: https://www.example.com
+      - launch: https://www.example.com
 ```
 
 ### The `ios` part
diff --git a/apps/site/docs/en/changelog.mdx b/apps/site/docs/en/changelog.mdx
@@ -1,5 +1,50 @@
 # Changelog
 
+## v1.1 - `aiAct` deep thinking and extensible MCP SDK
+
+v1.1 optimizes model planning capabilities and MCP extensibility, making automation more stable in complex scenarios while providing more flexible solutions for enterprise MCP service deployments.
+
+### `aiAct` can enable deep thinking (deepThink)
+
+When deep thinking is enabled in `aiAct`, the model will interpret intent more thoroughly and optimize its planning results. This is suited for complex forms, multi-step flows, and similar scenarios. It improves accuracy but increases planning latency.
+
+Currently supported: Qwen3-vl on Alibaba Cloud and Doubao-vision on Volcano Engine. See [Model strategy](./model-strategy) for details.
+
+Example usage:
+
+```typescript
+await agent.aiAct('If the UI shows an "Add shipping address" button, expand the existing "Shipping address" list and select the last item', { deepThink: true });
+```
+
+### MCP extension and SDK exposure
+
+Developers can use the MCP SDK exposed by Midscene to flexibly deploy a public MCP service. This capability applies to Agent instances on any platform.
+
+Typical application scenarios:
+- Run MCP in enterprise intranet to control private device pools
+- Package Midscene capabilities as internal microservices for multiple teams
+- Extend custom automation toolchains
+
+See documentation: [MCP Services](./mcp)
+
+### Chrome extension improvements
+- Fixed potential event loss during recording, improving recording stability
+- Optimized coordinate passing in `describeElement` for better element description accuracy
+
+### CLI and configuration enhancements
+- **File parameter support**: Fixed CLI issue where `--files` parameter wasn't properly handled when `--config` was specified; now they can be flexibly combined
+- **Dynamic configuration**: Fixed Playground not reading the `MIDSCENE_REPLANNING_CYCLE_LIMIT` environment variable properly
+
+### iOS Agent compatibility improvements
+- Optimized `getWindowSize` method to automatically fall back to legacy endpoint when newer API is unavailable, improving compatibility with WebDriverAgent versions
+
+### Report and Playground improvements
+- Fixed issue where report wasn't properly initialized before accessing screen properties
+- Fixed abnormal behavior of stop function in Playground
+- Improved error handling during video export to avoid crashes caused by frame cancel
+
+Thanks to contributors: @FriedRiceNoodles
+
 ## v1.0 - Midscene v1.0 is here!
 
 Midscene v1.0 is here! Try it out today and see how it can help you automate your workflows.
diff --git a/apps/site/docs/en/mcp.mdx b/apps/site/docs/en/mcp.mdx
@@ -121,7 +121,9 @@ Add the Midscene Android MCP server (`@midscene/android-mcp`) in your MCP client
 
 ## Implement your own MCP
 
-If you want to integrate Midscene tools into your own MCP service, you can use the `mcpKitForAgent` function to get tool definitions without starting a full MCP server.
+If you want to integrate Midscene tools into your own MCP service, you can use the `mcpKitForAgent` function to get tool definitions and expose your own MCP service as needed.
+
+The tools provided by `mcpKitForAgent` include screenshots and every Action in the Action Space.
 
 ### Using mcpKitForAgent
 
diff --git a/apps/site/docs/zh/automate-with-scripts-in-yaml.mdx b/apps/site/docs/zh/automate-with-scripts-in-yaml.mdx
@@ -266,13 +266,11 @@ android:
 tasks:
   - name: 启动设置应用
     flow:
-      - launch:
-          uri: com.android.settings
+      - launch: com.android.settings
 
   - name: 打开网页
     flow:
-      - launch:
-          uri: https://www.example.com
+      - launch: https://www.example.com
 ```
 
 ### `ios` 部分
diff --git a/apps/site/docs/zh/changelog.mdx b/apps/site/docs/zh/changelog.mdx
@@ -1,4 +1,49 @@
 # 更新日志
+ 
+## v1.1 - `aiAct`深度思考与可扩展的 MCP SDK
+
+v1.1 版本在模型规划能力与 MCP 扩展性上实现优化，让复杂场景的自动化更稳定，同时为企业级 MCP 服务部署提供更灵活的方案。
+
+### `aiAct` 可开启深度思考能力（deepThink）
+
+在 `aiAct` 时开启深度思考能力后，模型会更加深入地理解用户意图、优化规划结果，适用于复杂表单、多步骤流程等场景。它会带来更高的准确率，但也会增加规划耗时。
+
+目前已支持阿里云的 Qwen3-vl 与火山引擎的 Doubao-vision 模型，具体请参考 [模型策略](./model-strategy)。
+
+示例用法：
+
+```typescript
+await agent.aiAct('如果界面上展示“添加收货地址”按钮，那么展开已有的“收货地址”列表，并选择最后一项', { deepThink: true });
+```
+
+### MCP 扩展与 SDK 开放
+
+开发者可以使用 Midscene 暴露的 MCP SDK 灵活部署自己的公共 MCP 服务。此能力适用于任意平台的 Agent 实例。
+
+典型应用场景：
+- 在企业内网中运行 MCP 控制私有设备池
+- 将 Midscene 能力封装为内部微服务供多团队使用
+- 扩展自定义自动化工具链
+
+详见文档：[MCP 服务](./mcp)
+
+### Chrome 扩展优化
+- 修复录制期间的潜在事件丢失问题，提升录制稳定性
+- 优化 `describeElement` 的坐标传递，提高元素描述准确性
+
+### CLI 与配置增强
+- **文件参数支持**: 修复 CLI 在同时指定 `--config` 时未正确处理 `--files` 参数的问题，现在可灵活组合使用
+- **动态配置**: 修复 Playground 中环境变量 `MIDSCENE_REPLANNING_CYCLE_LIMIT` 未正确读取的问题
+
+### iOS Agent兼容性提升
+- 优化 `getWindowSize` 方法，在新版本 API 不可用时自动回退到 legacy endpoint，提升对 WebDriverAgent 版本的兼容性
+
+### 报告与 Playground 改进
+- 修复报告在访问屏幕属性前未正确初始化的问题
+- 修复 Playground 中 stop 函数的异常行为
+- 优化视频导出时的错误处理，避免 frame cancel 导致的崩溃
+
+感谢贡献者：@FriedRiceNoodles
 
 ## v1.0 - Midscene v1.0 正式发布！
 
diff --git a/apps/site/docs/zh/mcp.mdx b/apps/site/docs/zh/mcp.mdx
@@ -119,10 +119,11 @@ open report_file_name.html
 }
 ```
 
-
 ## 实现自己的 MCP
 
-如果你想在自己的 MCP 服务中集成 Midscene 工具，可以使用 `mcpKitForAgent` 函数来获取工具定义，而不需要启动完整的 MCP 服务器。
+如果你想在自己的 MCP 服务中集成 Midscene 工具，可以使用 `mcpKitForAgent` 函数来获取工具定义，继而自己按需暴露 MCP 服务。
+
+`mcpKitForAgent` 提供的工具包括截图与 Action Space 中的每个 Action。
 
 ### 使用 mcpKitForAgent