menloresearch
diff --git a/‎docs/docs/architecture/cortexrc.mdx
Lines changed: 0 additions & 1 deletion b/‎docs/docs/architecture/cortexrc.mdx
Lines changed: 0 additions & 1 deletion
diff --git a/‎docs/docs/architecture/data-folder.mdx
Lines changed: 5 additions & 12 deletions b/‎docs/docs/architecture/data-folder.mdx
Lines changed: 5 additions & 12 deletions
diff --git a/‎docs/docs/basic-usage/index.mdx
Lines changed: 2 additions & 3 deletions b/‎docs/docs/basic-usage/index.mdx
Lines changed: 2 additions & 3 deletions
diff --git a/‎docs/docs/capabilities/models/index.mdx
Lines changed: 2 additions & 9 deletions b/‎docs/docs/capabilities/models/index.mdx
Lines changed: 2 additions & 9 deletions
diff --git a/‎docs/docs/capabilities/models/model-yaml.mdx
Lines changed: 0 additions & 5 deletions b/‎docs/docs/capabilities/models/model-yaml.mdx
Lines changed: 0 additions & 5 deletions
diff --git a/‎docs/docs/cli/config.mdx
Lines changed: 35 additions & 10 deletions b/‎docs/docs/cli/config.mdx
Lines changed: 35 additions & 10 deletions
diff --git a/‎docs/docs/cli/engines/index.mdx
Lines changed: 16 additions & 11 deletions b/‎docs/docs/cli/engines/index.mdx
Lines changed: 16 additions & 11 deletions
@@ -34,7 +34,6 @@ You can configure the following parameters in the `.cortexrc` file:
 | `apiServerPort`  | Port number for the Cortex.cpp API server.       | `39281`                        |
 | `logFolderPath`  | Path the folder where logs are located           | User's home folder.            |
 | `logLlamaCppPath`  | The llama-cpp engine .                         | `./logs/cortex.log`            |
-| `logTensorrtLLMPath`  | The tensorrt-llm engine log file path.      | `./logs/cortex.log`            |
 | `logOnnxPath`    | The onnxruntime engine log file path.            | `./logs/cortex.log`            |
 | `maxLogLines`    | The maximum log lines that write to file.        | `100000`                       |
 | `checkedForUpdateAt`  | The last time for checking updates.         | `0`                            |
 
@@ -51,18 +51,11 @@ it typically follows the structure below:
 ├── cortex.db
 ├── engines/
 │   ├── cortex.llamacpp/
-│   │   ├── deps/
-│   │   │   ├── libcublasLt.so.12
-│   │   │   └── libcudart.so.12
-│   │   └── linux-amd64-avx2-cuda-12-0/
-│   │       └── ...
-│   └── cortex.tensorrt-llm/
-│       ├── deps/
-│       │   └── ...
-│       └── linux-cuda-12-4/
-│           └── v0.0.9/
-│               ├── ...
-│               └── libtensorrt_llm.so
+│   ├── deps/
+│   │   ├── libcublasLt.so.12
+│   │   └── libcudart.so.12
+│   └── linux-amd64-avx2-cuda-12-0/
+│       └── ...
 ├── files
 ├── logs/
 │   ├── cortex-cli.log
 
@@ -36,7 +36,7 @@ curl --request DELETE \
 
 ## Engines
 Cortex currently supports a general Python Engine for highly customised deployments and
-3 specialized ones for different multi-modal foundation models: llama.cpp, ONNXRuntime and TensorRT-LLM.
+2 specialized ones for different multi-modal foundation models: llama.cpp and ONNXRuntime.
 
 By default, Cortex installs `llama.cpp` as it main engine as it can be used in most laptops,
 desktop environments and operating systems.
@@ -58,8 +58,7 @@ curl --request GET \
       "name": "linux-amd64-avx2-cuda-12-0",
       "version": "v0.1.49"
     }
-  ],
-  "tensorrt-llm": []
+  ]
 }
 ```
 
 
@@ -3,15 +3,9 @@ title: Model Overview
 description: The Model section overview
 ---
 
-:::warning
-🚧 Cortex.cpp is currently under active development. Our documentation outlines the intended behavior
-of Cortex, which may not yet be fully implemented in the codebase.
-:::
-
 Models in cortex are used for inference purposes (e.g., chat completion, embedding, etc.) after they
 have been downloaded locally. Currently, we support different engines including `llama.cpp` with the
-GGUF model format, TensorRT-LLM for optimized inference on NVIDIA hardware, and ONNX for edge or
-different model deployments.
+GGUF model format, and ONNX for edge or different model deployments.
 
 In the future, you will also be able to run remote models (like OpenAI GPT-4 and Claude 3.5 Sonnet) via
 Cortex. Support for OpenAI and Anthropic engines is under development and will be available soon.
@@ -27,7 +21,6 @@ can facilitate the following:
 Cortex supports three model formats and each model format require specific engine to run:
 - GGUF - run with `llama-cpp` engine
 - ONNX - run with `onnxruntime` engine
-- TensorRT-LLM - run with `tensorrt-llm` engine
 
 Within the Python Engine (currently under development), you can run models in other formats
 
@@ -45,6 +38,6 @@ These models are ready to be downloaded and you can check them out at the link a
 
 Built-in models are made available across the following variants:
 
-- **By format**: `gguf`, `onnx`, and `tensorrt-llm`
+- **By format**: `gguf` and `onnx`
 - **By Size**: `7b`, `13b`, and more.
 - **By quantization method**: `q4`, `q8`, and more.
@@ -6,11 +6,6 @@ description: The model.yaml
 import Tabs from "@theme/Tabs";
 import TabItem from "@theme/TabItem";
 
-:::warning
-🚧 Cortex is currently under active development. Our documentation outlines the intended behavior of
-Cortex, which may not yet be fully implemented in the codebase.
-:::
-
 Cortex uses a `model.yaml` file to specify the configuration desired for each model. Models can be downloaded
 from the Cortex Model Hub or Hugging Face repositories. Once downloaded, the model data is parsed and stored
 in the `models` directory.
 
@@ -9,6 +9,11 @@ import TabItem from "@theme/TabItem";
 
 # `cortex config`
 
+:::warning
+At the moment, the `cortex config` command only supports a few configurations. More
+configurations will be added soon.
+:::
+
 This command allows you to update server configurations such as CORS and Allowed Headers.
 
 ## Usage
@@ -65,14 +70,34 @@ This command returns all server configurations.
 For example, it returns the following:
 
 ```
-+-------------------------------------------------------------------------------------+
-| Config name     | Value                                                             |
-+-------------------------------------------------------------------------------------+
-| allowed_origins | http://localhost:39281                                           |
-+-------------------------------------------------------------------------------------+
-| allowed_origins | http://127.0.0.1:39281/                                           |
-+-------------------------------------------------------------------------------------+
-| cors            | true                                                              |
-+-------------------------------------------------------------------------------------+
++-----------------------+-------------------------------------+
+| Config name           | Value                               |
++-----------------------+-------------------------------------+
+| allowed_origins       | http://localhost:39281              |
++-----------------------+-------------------------------------+
+| allowed_origins       | http://127.0.0.1:39281              |
++-----------------------+-------------------------------------+
+| allowed_origins       | http://0.0.0.0:39281                |
++-----------------------+-------------------------------------+
+| cors                  | true                                |
++-----------------------+-------------------------------------+
+| huggingface_token     |                                     |
++-----------------------+-------------------------------------+
+| no_proxy              | example.com,::1,localhost,127.0.0.1 |
++-----------------------+-------------------------------------+
+| proxy_password        |                                     |
++-----------------------+-------------------------------------+
+| proxy_url             |                                     |
++-----------------------+-------------------------------------+
+| proxy_username        |                                     |
++-----------------------+-------------------------------------+
+| verify_host_ssl       | true                                |
++-----------------------+-------------------------------------+
+| verify_peer_ssl       | true                                |
++-----------------------+-------------------------------------+
+| verify_proxy_host_ssl | true                                |
++-----------------------+-------------------------------------+
+| verify_proxy_ssl      | true                                |
++-----------------------+-------------------------------------+
 
-```
+```
@@ -9,8 +9,8 @@ import TabItem from "@theme/TabItem";
 
 This command allows you to manage various engines available within Cortex.
 
-
 **Usage**:
+
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
@@ -24,26 +24,25 @@ This command allows you to manage various engines available within Cortex.
   </TabItem>
 </Tabs>
 
-
 **Options**:
 
 | Option            | Description                                           | Required | Default value | Example         |
 |-------------------|-------------------------------------------------------|----------|---------------|-----------------|
 | `-h`, `--help`    | Display help information for the command.             | No       | -             | `-h`        |
 {/* | `-vk`, `--vulkan`             | Install Vulkan engine.                                                                           | No       | `false`       | `-vk`                         | */}
 
----
-# Subcommands:
+
 ## `cortex engines list`
+
 :::info
 This CLI command calls the following API endpoint:
 - [List Engines](/api-reference#tag/engines/get/v1/engines)
 :::
-This command lists all the Cortex's engines.
-
 
+This command lists all the Cortex's engines.
 
 **Usage**:
+
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
@@ -58,6 +57,7 @@ This command lists all the Cortex's engines.
 </Tabs>
 
 For example, it returns the following:
+
 ```
 +---+--------------+-------------------+---------+----------------------------+---------------+
 | # | Name         | Supported Formats | Version | Variant                    | Status        |
@@ -66,18 +66,19 @@ For example, it returns the following:
 +---+--------------+-------------------+---------+----------------------------+---------------+
 | 2 | llama-cpp    | GGUF              | 0.1.34  | linux-amd64-avx2-cuda-12-0 | Ready         |
 +---+--------------+-------------------+---------+----------------------------+---------------+
-| 3 | tensorrt-llm | TensorRT Engines  |         |                            | Not Installed |
-+---+--------------+-------------------+---------+----------------------------+---------------+
 ```
 
 ## `cortex engines get`
+
 :::info
 This CLI command calls the following API endpoint:
 - [Get Engine](/api-reference#tag/engines/get/v1/engines/{name})
 :::
+
 This command returns an engine detail defined by an engine `engine_name`.
 
 **Usage**:
+
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
@@ -92,18 +93,19 @@ This command returns an engine detail defined by an engine `engine_name`.
 </Tabs>
 
 For example, it returns the following:
+
 ```
 +-----------+-------------------+---------+-----------+--------+
 | Name      | Supported Formats | Version | Variant   | Status |
 +-----------+-------------------+---------+-----------+--------+
 | llama-cpp | GGUF              | 0.1.37  | mac-arm64 | Ready  |
 +-----------+-------------------+---------+-----------+--------+
 ```
+
 :::info
 To get an engine name, run the [`engines list`](/docs/cli/engines/list) command.
 :::
 
-
 **Options**:
 
 | Option            | Description                                           | Required | Default value | Example         |
@@ -114,16 +116,18 @@ To get an engine name, run the [`engines list`](/docs/cli/engines/list) command.
 
 
 ## `cortex engines install`
+
 :::info
 This CLI command calls the following API endpoint:
 - [Init Engine](/api-reference#tag/engines/post/v1/engines/{name}/init)
 :::
+
 This command downloads the required dependencies and installs the engine within Cortex. Currently, Cortex supports three engines:
 - `llama-cpp`
 - `onnxruntime`
-- `tensorrt-llm`
 
 **Usage**:
+
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
@@ -133,7 +137,6 @@ This command downloads the required dependencies and installs the engine within
   <TabItem value="Windows" label="Windows">
   ```sh
   cortex.exe engines install [options] <engine_name>
-
   ```
   </TabItem>
 </Tabs>
@@ -150,6 +153,7 @@ This command downloads the required dependencies and installs the engine within
 This command uninstalls the engine within Cortex.
 
 **Usage**:
+
 <Tabs>
   <TabItem value="MacOs/Linux" label="MacOs/Linux">
   ```sh
@@ -164,6 +168,7 @@ This command uninstalls the engine within Cortex.
 </Tabs>
 
 For Example:
+
 ```bash
 ## Llama.cpp engine
 cortex engines uninstall llama-cpp