Skip to content

Commit 19f8935

Browse files
Update README.md
Updated to make it very clear that no llama models are being provided at this time.
1 parent 41c2fc6 commit 19f8935

File tree

1 file changed

+9
-7
lines changed

1 file changed

+9
-7
lines changed

SQFT/README.md

+9-7
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,9 @@ We have released several foundation models (sparse or sparse-and-quantized) for
1717
|-----------------------------------------------------------------------------------|----------|------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------|
1818
| [Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) | 50% | [IntelLabs/sqft-mistral-7b-v0.3-50-base](https://huggingface.co/IntelLabs/sqft-mistral-7b-v0.3-50-base) | [IntelLabs/sqft-mistral-7b-v0.3-50-base-gptq](https://huggingface.co/IntelLabs/sqft-mistral-7b-v0.3-50-base-gptq) |
1919
| [Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) | 50% | [IntelLabs/sqft-phi-3-mini-4k-50-base](https://huggingface.co/IntelLabs/sqft-phi-3-mini-4k-50-base) | [IntelLabs/sqft-phi-3-mini-4k-50-base-gptq](https://huggingface.co/IntelLabs/sqft-phi-3-mini-4k-50-base-gptq) |
20-
| [Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) | 50% | [IntelLabs/sqft-llama-3-8b-50-base]() | [IntelLabs/sqft-llama-3-8b-50-base-gptq]() |
21-
`*` **Llama-3 models are under review**
20+
| [Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) | 50% | IntelLabs/sqft-llama-3-8b-50-base<sup>*</sup> | IntelLabs/sqft-llama-3-8b-50-base-gptq<sup>*</sup> |
21+
22+
<sup>*</sup> *Llama-3 models are currently under internal review and will be released soon.*
2223

2324
[//]: # (https://huggingface.co/IntelLabs/sqft-llama-3-8b-50-base)
2425
[//]: # (https://huggingface.co/IntelLabs/sqft-llama-3-8b-50-base-gptq)
@@ -333,10 +334,11 @@ lm_eval --model hf \
333334

334335
| Base Model | Task | Method | Fine-tuned Model |
335336
|------------------------------------------------------------------------------------------------|--------|-----------------------|-------------------------------------------------------------------------------------------------------------------------------------|
336-
| [sqft-llama-3-8b-50-base]() | GSM8K | SQFT + SparsePEFT | [sqft-llama-3-8b-50-gptq-gsm8k-heu-adapter]() |
337-
| [sqft-llama-3-8b-50-base-gptq]() | GSM8K | SQFT | [sqft-sparsepeft-llama-3-8b-50-gsm8k-heu]() |
338-
| [sqft-llama-3-8b-50-base-gptq]() | GSM8K | SQFT + QA-SparsePEFT | [sqft-qa-sparsepeft-llama-3-8b-50-gptq-gsm8k-heu]() |
339-
`*` **Llama-3-8B fine-tuned models are under review**
337+
| sqft-llama-3-8b-50-base<sup>*</sup> | GSM8K | SQFT + SparsePEFT | sqft-llama-3-8b-50-gptq-gsm8k-heu-adapter |
338+
| sqft-llama-3-8b-50-base-gptq | GSM8K | SQFT | sqft-sparsepeft-llama-3-8b-50-gsm8k-heu |
339+
| sqft-llama-3-8b-50-base-gptq | GSM8K | SQFT + QA-SparsePEFT | sqft-qa-sparsepeft-llama-3-8b-50-gptq-gsm8k-heu |
340+
341+
<sup>*</sup> *Llama-3 models are currently under internal review and will be released soon.*
340342

341343
[//]: # (https://huggingface.co/IntelLabs/sqft-llama-3-8b-50-base)
342344

@@ -360,4 +362,4 @@ If you find SQFT's code and paper helpful, please kindly cite:
360362
year = {2024},
361363
url = {}
362364
}
363-
```
365+
```

0 commit comments

Comments
 (0)