Running demo code on V100 #8

mikeleatila · 2025-01-16T23:38:32Z

Hi,

Congrats on the great work!

I only have V100 GPUs available to me.

Is there a way to run your inference/demo code and how (e.g. with no flash attention)?

Many thanks in advance!

lxtGH · 2025-01-17T05:35:29Z

@zhang-tao-whu Check the issues. It seems like you need to modify the code to remove the flash attention.

mikeleatila · 2025-01-17T08:35:28Z

@lxtGH @zhang-tao-whu Thanks a lot. Do you have some hints how to remove flash attention? Can it be done by "passing-a-parameter-kind-of-thing"? Been looking around and trying out but it still does not work.

HarborYuan · 2025-01-30T02:32:29Z

@mikeleatila

Maybe you can try:

use_flash_attn=False
to disable the flash attention?

Please let me know whether it works well on V100.

Ruining0916 · 2025-02-06T18:47:46Z

Hi I have also tried to set use_flash_attn=False; but it seems not working for V100? as there is still error message saying the modeling file is requiring flash_attn

HarborYuan · 2025-02-06T19:27:29Z

I see. Let me check it.

mikeleatila · 2025-02-06T21:26:36Z

Yes, I did the same as per @HarborYuan comment and got the same error as @Ruining0916 described above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running demo code on V100 #8

Running demo code on V100 #8

mikeleatila commented Jan 16, 2025 •

edited

Loading

lxtGH commented Jan 17, 2025

mikeleatila commented Jan 17, 2025 •

edited

Loading

HarborYuan commented Jan 30, 2025

Ruining0916 commented Feb 6, 2025

HarborYuan commented Feb 6, 2025

mikeleatila commented Feb 6, 2025

Running demo code on V100 #8

Running demo code on V100 #8

Comments

mikeleatila commented Jan 16, 2025 • edited Loading

lxtGH commented Jan 17, 2025

mikeleatila commented Jan 17, 2025 • edited Loading

HarborYuan commented Jan 30, 2025

Ruining0916 commented Feb 6, 2025

HarborYuan commented Feb 6, 2025

mikeleatila commented Feb 6, 2025

mikeleatila commented Jan 16, 2025 •

edited

Loading

mikeleatila commented Jan 17, 2025 •

edited

Loading