|
1 | 1 | # Changelog |
2 | 2 |
|
| 3 | +## v1.1 - `aiAct` deep thinking and extensible MCP SDK |
| 4 | + |
| 5 | +v1.1 optimizes model planning capabilities and MCP extensibility, making automation more stable in complex scenarios while providing more flexible solutions for enterprise MCP service deployments. |
| 6 | + |
| 7 | +### `aiAct` can enable deep thinking (deepThink) |
| 8 | + |
| 9 | +When deep thinking is enabled in `aiAct`, the model will interpret intent more thoroughly and optimize its planning results. This is suited for complex forms, multi-step flows, and similar scenarios. It improves accuracy but increases planning latency. |
| 10 | + |
| 11 | +Currently supported: Qwen3-vl on Alibaba Cloud and Doubao-vision on Volcano Engine. See [Model strategy](./model-strategy) for details. |
| 12 | + |
| 13 | +Example usage: |
| 14 | + |
| 15 | +```typescript |
| 16 | +await agent.aiAct('If the UI shows an "Add shipping address" button, expand the existing "Shipping address" list and select the last item', { deepThink: true }); |
| 17 | +``` |
| 18 | + |
| 19 | +### MCP extension and SDK exposure |
| 20 | + |
| 21 | +Developers can use the MCP SDK exposed by Midscene to flexibly deploy a public MCP service. This capability applies to Agent instances on any platform. |
| 22 | + |
| 23 | +Typical application scenarios: |
| 24 | +- Run MCP in enterprise intranet to control private device pools |
| 25 | +- Package Midscene capabilities as internal microservices for multiple teams |
| 26 | +- Extend custom automation toolchains |
| 27 | + |
| 28 | +See documentation: [MCP Services](./mcp) |
| 29 | + |
| 30 | +### Chrome extension improvements |
| 31 | +- Fixed potential event loss during recording, improving recording stability |
| 32 | +- Optimized coordinate passing in `describeElement` for better element description accuracy |
| 33 | + |
| 34 | +### CLI and configuration enhancements |
| 35 | +- **File parameter support**: Fixed CLI issue where `--files` parameter wasn't properly handled when `--config` was specified; now they can be flexibly combined |
| 36 | +- **Dynamic configuration**: Fixed Playground not reading the `MIDSCENE_REPLANNING_CYCLE_LIMIT` environment variable properly |
| 37 | + |
| 38 | +### iOS Agent compatibility improvements |
| 39 | +- Optimized `getWindowSize` method to automatically fall back to legacy endpoint when newer API is unavailable, improving compatibility with WebDriverAgent versions |
| 40 | + |
| 41 | +### Report and Playground improvements |
| 42 | +- Fixed issue where report wasn't properly initialized before accessing screen properties |
| 43 | +- Fixed abnormal behavior of stop function in Playground |
| 44 | +- Improved error handling during video export to avoid crashes caused by frame cancel |
| 45 | + |
| 46 | +Thanks to contributors: @FriedRiceNoodles |
| 47 | + |
3 | 48 | ## v1.0 - Midscene v1.0 is here! |
4 | 49 |
|
5 | 50 | Midscene v1.0 is here! Try it out today and see how it can help you automate your workflows. |
|
0 commit comments