You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| Model | Model link | Example link | Verified Platforms |
117
+
|:--|:--|:--|:--|
118
+
| LLaMA 2 |[meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
119
+
| LLaMA 3 |[meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
120
+
| LLaMA 3.2 |[meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct), [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
121
+
| Qwen 2 |[Qwen/Qwen2-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2-1.5B-Instruct), [Qwen/Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
122
+
| Qwen 2.5 |[Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Lunar Lake |
123
+
||[Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
124
+
| GLM-Edge |[THUDM/glm-edge-1.5b-chat](https://huggingface.co/THUDM/glm-edge-1.5b-chat), [THUDM/glm-edge-4b-chat](https://huggingface.co/THUDM/glm-edge-4b-chat)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
125
+
| MiniCPM |[openbmb/MiniCPM-1B-sft-bf16](https://huggingface.co/openbmb/MiniCPM-1B-sft-bf16), [openbmb/MiniCPM-2B-sft-bf16](https://huggingface.co/openbmb/MiniCPM-2B-sft-bf16)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Meteor Lake, Lunar Lake, Arrow Lake |
126
+
| Baichuan 2 |[baichuan-inc/Baichuan2-7B-Chat](https://huggingface.co/baichuan-inc/Baichuan2-7B-Chat)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md#2-run-optimized-models-experimental)| Lunar Lake |
127
+
| MiniCPM-Llama3-V-2_5 |[openbmb/MiniCPM-Llama3-V-2_5](https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/Multimodal/README.md#2-run-optimized-models-experimental)| Lunar Lake |
128
+
| MiniCPM-V-2_6 |[openbmb/MiniCPM-V-2_6](https://huggingface.co/openbmb/MiniCPM-V-2_6)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/Multimodal/README.md#2-run-optimized-models-experimental)| Lunar Lake |
129
+
| Bce-Embedding-Base-V1 |[maidalun1020/bce-embedding-base_v1](https://huggingface.co/maidalun1020/bce-embedding-base_v1)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/Multimodal/README.md#2-run-optimized-models-experimental)| Lunar Lake |
130
+
| Speech_Paraformer-Large |[iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch](https://www.modelscope.cn/models/iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/Multimodal/README.md#2-run-optimized-models-experimental)| Lunar Lake |
130
131
131
132
132
133
> [!TIP]
@@ -138,14 +139,15 @@ IPEX-LLM also provides C++ API for running Hugging Face `transformers` models.
138
139
139
140
Refer to the following table for examples of verified models:
| Model | Model link | Example link | Verified Platforms |
143
+
|:--|:--|:--|:--|
144
+
| LLaMA 2 |[meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Meteor Lake, Lunar Lake, Arrow Lake |
145
+
| LLaMA 3 |[meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Meteor Lake, Lunar Lake, Arrow Lake |
146
+
| LLaMA 3.2 |[meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct), [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Meteor Lake, Lunar Lake, Arrow Lake |
147
+
| Qwen 2 |[Qwen/Qwen2-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2-1.5B-Instruct), [Qwen/Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Meteor Lake, Lunar Lake, Arrow Lake |
148
+
| Qwen 2.5 |[Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Lunar Lake |
149
+
||[Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Meteor Lake, Lunar Lake, Arrow Lake |
150
+
| MiniCPM |[openbmb/MiniCPM-1B-sft-bf16](https://huggingface.co/openbmb/MiniCPM-1B-sft-bf16), [openbmb/MiniCPM-2B-sft-bf16](https://huggingface.co/openbmb/MiniCPM-2B-sft-bf16)|[link](../../../python/llm/example/NPU/HF-Transformers-AutoModels/LLM/CPP_Examples)| Meteor Lake, Lunar Lake, Arrow Lake |
149
151
150
152
> [!TIP]
151
153
> You could refer to [here](../../../python/llm/example/NPU/HF-Transformers-AutoModels) for full IPEX-LLM examples on Intel NPU.
Please refer to [Quick Start](../../../../../../../docs/mddocs/Quickstart/npu_quickstart.md#c-api) for details about verified platforms.
16
+
15
17
## 0. Prerequisites
16
18
For `ipex-llm` NPU support, please refer to [Quick Start](../../../../../../../docs/mddocs/Quickstart/npu_quickstart.md#install-prerequisites) for details about the required preparations.
Please refer to [Quick Start](../../../../../../docs/mddocs/Quickstart/npu_quickstart.md#python-api) for details about verified platforms.
25
+
24
26
## 0. Prerequisites
25
27
For `ipex-llm` NPU support, please refer to [Quick Start](../../../../../../docs/mddocs/Quickstart/npu_quickstart.md#install-prerequisites) for details about the required preparations.
Please refer to [Quick Start](../../../../../../docs/mddocs/Quickstart/npu_quickstart.md#python-api) for details about verified platforms.
15
+
14
16
## 0. Prerequisites
15
17
For `ipex-llm` NPU support, please refer to [Quick Start](../../../../../../docs/mddocs/Quickstart/npu_quickstart.md#install-prerequisites) for details about the required preparations.
0 commit comments