intel
diff --git a/‎docs/mddocs/DockerGuides/docker_cpp_xpu_quickstart.md
Lines changed: 29 additions & 26 deletions b/‎docs/mddocs/DockerGuides/docker_cpp_xpu_quickstart.md
Lines changed: 29 additions & 26 deletions
diff --git a/‎docs/mddocs/DockerGuides/docker_pytorch_inference_gpu.md
Lines changed: 31 additions & 38 deletions b/‎docs/mddocs/DockerGuides/docker_pytorch_inference_gpu.md
Lines changed: 31 additions & 38 deletions
diff --git a/‎docs/mddocs/DockerGuides/docker_run_pytorch_inference_in_vscode.md
Lines changed: 28 additions & 28 deletions b/‎docs/mddocs/DockerGuides/docker_run_pytorch_inference_in_vscode.md
Lines changed: 28 additions & 28 deletions
@@ -1,16 +1,16 @@
-## Run llama.cpp/Ollama/Open-WebUI on an Intel GPU via Docker
+# Run llama.cpp/Ollama/Open-WebUI on an Intel GPU via Docker
 
 ## Quick Start
 
 ### Install Docker
 
 1. Linux Installation
 
-    Follow the instructions in this [guide](https://ipex-llm.readthedocs.io/en/latest/doc/LLM/DockerGuides/docker_windows_gpu.html#linux) to install Docker on Linux.
+    Follow the instructions in this [guide](./docker_windows_gpu.md#linux) to install Docker on Linux.
 
 2. Windows Installation
 
-    For Windows installation, refer to this [guide](https://ipex-llm.readthedocs.io/en/latest/doc/LLM/DockerGuides/docker_windows_gpu.html#install-docker-desktop-for-windows).
+    For Windows installation, refer to this [guide](./docker_windows_gpu.md#install-docker-desktop-for-windows).
 
 #### Setting Docker on windows
 
@@ -24,18 +24,18 @@ docker pull intelanalytics/ipex-llm-inference-cpp-xpu:latest
 
 ### Start Docker Container
 
-```eval_rst
-.. tabs::
-   .. tab:: Linux
+Choose one of the following methods to start the container:
 
-      To map the `xpu` into the container, you need to specify `--device=/dev/dri` when booting the container. Select the device you are running(device type:(Max, Flex, Arc, iGPU)). And change the `/path/to/models` to mount the models. `bench_model` is used to benchmark quickly. If want to benchmark, make sure it on the `/path/to/models`
+<details>
+<Summary>For <strong>Linux</strong>:</summary>
 
-      .. code-block:: bash
+  To map the `xpu` into the container, you need to specify `--device=/dev/dri` when booting the container. Select the device you are running(device type:(Max, Flex, Arc, iGPU)). And change the `/path/to/models` to mount the models. `bench_model` is used to benchmark quickly. If want to benchmark, make sure it on the `/path/to/models`
 
-        #/bin/bash
-        export DOCKER_IMAGE=intelanalytics/ipex-llm-inference-cpp-xpu:latest
-        export CONTAINER_NAME=ipex-llm-inference-cpp-xpu-container
-        sudo docker run -itd \
+  ```bash
+  #/bin/bash
+  export DOCKER_IMAGE=intelanalytics/ipex-llm-inference-cpp-xpu:latest
+  export CONTAINER_NAME=ipex-llm-inference-cpp-xpu-container
+  sudo docker run -itd \
                 --net=host \
                 --device=/dev/dri \
                 -v /path/to/models:/models \
@@ -46,17 +46,19 @@ docker pull intelanalytics/ipex-llm-inference-cpp-xpu:latest
                 -e DEVICE=Arc \
                 --shm-size="16g" \
                 $DOCKER_IMAGE
-   
-   .. tab:: Windows
+  ```
+</details>
 
-      To map the `xpu` into the container, you need to specify `--device=/dev/dri` when booting the container. And change the `/path/to/models` to mount the models. Then add `--privileged` and map the `/usr/lib/wsl` to the docker.
+<details>
+<summary>For <strong>Windows</strong>:</summary>
 
-      .. code-block:: bash
+  To map the `xpu` into the container, you need to specify `--device=/dev/dri` when booting the container. And change the `/path/to/models` to mount the models. Then add `--privileged` and map the `/usr/lib/wsl` to the docker.
 
-        #/bin/bash
-        export DOCKER_IMAGE=intelanalytics/ipex-llm-inference-cpp-xpu:latest
-        export CONTAINER_NAME=ipex-llm-inference-cpp-xpu-container
-        sudo docker run -itd \
+  ```bash
+  #/bin/bash
+  export DOCKER_IMAGE=intelanalytics/ipex-llm-inference-cpp-xpu:latest
+  export CONTAINER_NAME=ipex-llm-inference-cpp-xpu-container
+  sudo docker run -itd \
                 --net=host \
                 --device=/dev/dri \
                 --privileged \
@@ -69,9 +71,10 @@ docker pull intelanalytics/ipex-llm-inference-cpp-xpu:latest
                 -e DEVICE=Arc \
                 --shm-size="16g" \
                 $DOCKER_IMAGE
+  ```
+</details>
 
-```
-
+---
 
 After the container is booted, you could get into the container through `docker exec`.
 
@@ -126,7 +129,7 @@ llama_print_timings:        eval time =     xxx ms /    31 runs   (   xxx ms per
 llama_print_timings:       total time =     xxx ms /    xxx tokens
 ```
 
-Please refer to this [documentation](https://ipex-llm.readthedocs.io/en/latest/doc/LLM/Quickstart/llama_cpp_quickstart.html) for more details.
+Please refer to this [documentation](../Quickstart/llama_cpp_quickstart.md) for more details.
 
 
 ### Running Ollama serving with IPEX-LLM on Intel GPU
@@ -194,13 +197,13 @@ Sample output:
 ```
 
 
-Please refer to this [documentation](https://ipex-llm.readthedocs.io/en/latest/doc/LLM/Quickstart/ollama_quickstart.html#pull-model) for more details.
+Please refer to this [documentation](../Quickstart/ollama_quickstart.md#4-pull-model) for more details.
 
 
 ### Running Open WebUI with Intel GPU
 
 Start the ollama and load the model first, then use the open-webui to chat.
-If you have difficulty accessing the huggingface repositories, you may use a mirror, e.g. add export HF_ENDPOINT=https://hf-mirror.com before running bash start.sh.
+If you have difficulty accessing the huggingface repositories, you may use a mirror, e.g. add `export HF_ENDPOINT=https://hf-mirror.com`before running bash start.sh.
 ```bash
 cd /llm/scripts/
 bash start-open-webui.sh
@@ -218,4 +221,4 @@ INFO:     Uvicorn running on http://0.0.0.0:8080 (Press CTRL+C to quit)
   <img src="https://llm-assets.readthedocs.io/en/latest/_images/open_webui_signup.png" width="100%" />
 </a>
 
-For how to log-in or other guide, Please refer to this [documentation](https://ipex-llm.readthedocs.io/en/latest/doc/LLM/Quickstart/open_webui_with_ollama_quickstart.html) for more details.
+For how to log-in or other guide, Please refer to this [documentation](../Quickstart/open_webui_with_ollama_quickstart.md) for more details.
@@ -2,16 +2,12 @@
 
 We can run PyTorch Inference Benchmark, Chat Service and PyTorch Examples on Intel GPUs within Docker (on Linux or WSL).
 
-```eval_rst
-.. note::
-
-   The current Windows + WSL + Docker solution only supports Arc series dGPU. For Windows users with MTL iGPU, it is recommended to install directly via pip install in Miniforge Prompt. Refer to `this guide <https://ipex-llm.readthedocs.io/en/latest/doc/LLM/Quickstart/install_windows_gpu.html>`_.
-
-```
+> [!NOTE]
+> The current Windows + WSL + Docker solution only supports Arc series dGPU. For Windows users with MTL iGPU, it is recommended to install directly via pip install in Miniforge Prompt. Refer to [this guide](../Quickstart/install_windows_gpu.md).
 
 ## Install Docker
 
-Follow the [Docker installation Guide](./docker_windows_gpu.html#install-docker) to install docker on either Linux or Windows.
+Follow the [Docker installation Guide](./docker_windows_gpu.md#install-docker) to install docker on either Linux or Windows.
 
 ## Launch Docker
 
@@ -20,37 +16,37 @@ Prepare ipex-llm-xpu Docker Image:
 docker pull intelanalytics/ipex-llm-xpu:latest
 ```
 
-Start ipex-llm-xpu Docker Container:
-
-```eval_rst
-.. tabs::
-   .. tab:: Linux
+Start ipex-llm-xpu Docker Container. Choose one of the following commands to start the container:
 
-      .. code-block:: bash
+<details>
+<summary>For <strong>Linux</strong>:</summary>
 
-        export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
-        export CONTAINER_NAME=my_container
-        export MODEL_PATH=/llm/models[change to your model path]
+  ```bash
+  export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
+  export CONTAINER_NAME=my_container
+  export MODEL_PATH=/llm/models[change to your model path]
 
-        docker run -itd \
+  docker run -itd \
             --net=host \
             --device=/dev/dri \
             --memory="32G" \
             --name=$CONTAINER_NAME \
             --shm-size="16g" \
             -v $MODEL_PATH:/llm/models \
             $DOCKER_IMAGE
+  ```
+</details>
 
-   .. tab:: Windows WSL
-
-      .. code-block:: bash
+<details>
+<summary>For <strong>Windows WSL</strong>:</summary>
 
-         #/bin/bash
-        export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
-        export CONTAINER_NAME=my_container
-        export MODEL_PATH=/llm/models[change to your model path]
+  ```bash
+  #/bin/bash
+  export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
+  export CONTAINER_NAME=my_container
+  export MODEL_PATH=/llm/models[change to your model path]
 
-        sudo docker run -itd \
+  sudo docker run -itd \
                 --net=host \
                 --privileged \
                 --device /dev/dri \
@@ -60,8 +56,10 @@ Start ipex-llm-xpu Docker Container:
                 -v $MODEL_PATH:/llm/llm-models \
                 -v /usr/lib/wsl:/usr/lib/wsl \ 
                 $DOCKER_IMAGE
-```
+  ```
+</details>
 
+---
 
 Access the container:
 ```
@@ -77,18 +75,13 @@ root@arda-arc12:/# sycl-ls
 [ext_oneapi_level_zero:gpu:0] Intel(R) Level-Zero, Intel(R) Arc(TM) A770 Graphics 1.3 [1.3.26241]
 ```
 
-```eval_rst
-.. tip::
-
-  You can run the Env-Check script to verify your ipex-llm installation and runtime environment.
-
-  .. code-block:: bash
-
-     cd /ipex-llm/python/llm/scripts
-     bash env-check.sh
-
-
-```
+> [!TIP]
+> You can run the Env-Check script to verify your ipex-llm installation and runtime environment.
+>
+> ```bash
+> cd /ipex-llm/python/llm/scripts
+> bash env-check.sh
+> ```
 
 ## Run Inference Benchmark 
 
 
@@ -4,21 +4,18 @@ An IPEX-LLM container is a pre-configured environment that includes all necessar
 
 This guide provides steps to run/develop PyTorch examples in VSCode with Docker on Intel GPUs.
 
-```eval_rst
-.. note::
 
-   This guide assumes you have already installed VSCode in your environment. 
-   
-   To run/develop on Windows, install VSCode and then follow the steps below. 
-   
-   To run/develop on Linux, you might open VSCode first and SSH to a remote Linux machine, then proceed with the following steps.
-
-```
+> [!note]
+> This guide assumes you have already installed VSCode in your environment. 
+>  
+> To run/develop on Windows, install VSCode and then follow the steps below. 
+>  
+> To run/develop on Linux, you might open VSCode first and SSH to a remote Linux machine, then proceed with the following steps.
 
 
 ## Install Docker
 
-Follow the [Docker installation Guide](./docker_windows_gpu.html#install-docker) to install docker on either Linux or Windows.
+Follow the [Docker installation Guide](./docker_windows_gpu.md#install-docker) to install docker on either Linux or Windows.
 
 ## Install Extensions for VSCcode
 
@@ -52,37 +49,38 @@ Open the Terminal in VSCode (you can use the shortcut `` Ctrl+Shift+` ``), then
 docker pull intelanalytics/ipex-llm-xpu:latest
 ```
 
-Start ipex-llm-xpu Docker Container:
+Start ipex-llm-xpu Docker Container. Choose one of the following commands to start the container:
 
-```eval_rst
-.. tabs::
-   .. tab:: Linux
+<details>
+<summary>For <strong>Linux</strong>:</summary>
 
-      .. code-block:: bash
+  ```bash
 
-        export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
-        export CONTAINER_NAME=my_container
-        export MODEL_PATH=/llm/models[change to your model path]
+  export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
+  export CONTAINER_NAME=my_container
+  export MODEL_PATH=/llm/models[change to your model path]
 
-        docker run -itd \
+  docker run -itd \
             --net=host \
             --device=/dev/dri \
             --memory="32G" \
             --name=$CONTAINER_NAME \
             --shm-size="16g" \
             -v $MODEL_PATH:/llm/models \
             $DOCKER_IMAGE
+  ```
+</details>
 
-   .. tab:: Windows WSL
-
-      .. code-block:: bash
+<details>
+<summary>For <strong>Windows WSL</strong>:</summary>
 
-         #/bin/bash
-        export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
-        export CONTAINER_NAME=my_container
-        export MODEL_PATH=/llm/models[change to your model path]
+  ```bash
+  #/bin/bash
+  export DOCKER_IMAGE=intelanalytics/ipex-llm-xpu:latest
+  export CONTAINER_NAME=my_container
+  export MODEL_PATH=/llm/models[change to your model path]
 
-        sudo docker run -itd \
+  sudo docker run -itd \
                 --net=host \
                 --privileged \
                 --device /dev/dri \
@@ -92,8 +90,10 @@ Start ipex-llm-xpu Docker Container:
                 -v $MODEL_PATH:/llm/llm-models \
                 -v /usr/lib/wsl:/usr/lib/wsl \ 
                 $DOCKER_IMAGE
-```
+  ```
+</details>
 
+---
 
 ## Run/Develop Pytorch Examples