It's time for me to dive head first into this. #448

aifartist · 2023-03-20T04:37:58Z

aifartist
Mar 20, 2023

Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.

I've been haunting github A1111 forums getting performance on my 4090 from 13 it/s to 39 to 42 and today to 51 using torch.compile(). It's easy if you have a i9-13900K. I'm less interested in pretty pictures than the potential of chatgpt like things so I want to give ?llama? a try.

I'm trying to get this thing running to see what it can do and if I can find anything to make it faster. Downloading the 252GB HFv2 thing the instructions said to do first. It seems, according to the instructions, that I need this AND one of the 4bit files. I grabbed 13B for my first test.

25% downloaded so far and another hour to go.
cloned the repo, pip installed the requirements, and just waiting to do my first run.

One problem is that I found a way to grab the 6? json files matching llama-13b-hf but could find how to grab the json files for llama-13b-hf-int4. I'll just copy them from the other dir and hope it works.

aifartist · 2023-03-20T07:14:25Z

aifartist
Mar 20, 2023
Author

Egads! llama works. llama-13b
I asked "What are you?"
Output generated in 15.04 seconds (13.23 tokens/s, 199 tokens)
Is that good? It seems CPU bound since my 4090 was only at about 60% utilization.

Although I had to do the following to get it to work:
find venv/lib/python3.10/site-packages/transformers -name \*.py | xargs sed -ibkp 's/LlamaTokenizer/LLaMATokenizer/'

Now I want to find where the model is loaded in the PY code to try an experiment.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

It's time for me to dive head first into this. #448

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

It's time for me to dive head first into this. #448

Uh oh!

aifartist Mar 20, 2023

Replies: 1 comment

Uh oh!

aifartist Mar 20, 2023 Author

aifartist
Mar 20, 2023

aifartist
Mar 20, 2023
Author