Skip to content

Commit f48e1a6

Browse files
authored
Update README.md
1 parent 77bb1b2 commit f48e1a6

File tree

1 file changed

+10
-10
lines changed

1 file changed

+10
-10
lines changed

README.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -20,19 +20,10 @@ ome
2020

2121
## Key Features
2222
- **Compatibility**: Designed for various multimodal models.
23-
- **Integration**: Currently integrated with **GPT-4o, o1,th Gemini Pro Vision, Claude 3 and LLaVa.**
23+
- **Integration**: Currently integrated with **GPT-4o, o1, Gemini Pro Vision, Claude 3 and LLaVa.**
2424
- **Future Plans**: Support for additional models.
2525

26-
## Ongoing Development
27-
At [HyperwriteAI](https://www.hyperwriteai.com/), we are developing Agent-1-Vision a multimodal model with more accurate click location predictions.
28-
29-
## Agent-1-Vision Model API Access
30-
We will soon be offering API access to our Agent-1-Vision model.
31-
32-
If you're interested in gaining access to this API, sign up [here](https://othersideai.typeform.com/to/FszaJ1k8?typeform-source=www.hyperwriteai.com).
33-
3426
## Demo
35-
3627
https://github.com/OthersideAI/self-operating-computer/assets/42594239/9e8abc96-c76a-46fb-9b13-03678b3c67e0
3728

3829

@@ -61,6 +52,15 @@ operate
6152

6253
## Using `operate` Modes
6354

55+
#### Try OpenAI models
56+
57+
The default model for the project is gpt-4o. Which is run by simply typing `operate`. To try running OpenAI's new `o1` model, use the command below.
58+
59+
```
60+
operate -m o1-with-ocr
61+
```
62+
63+
6464
### Multimodal Models `-m`
6565
An additional model is now compatible with the Self Operating Computer Framework. Try Google's `gemini-pro-vision` by following the instructions below.
6666

0 commit comments

Comments
 (0)