Question #6

fakerybakery · 2023-12-31T19:54:46Z

Hi,
Thanks for releasing this code. Does this codebase decrease the size of the model (ie file size, required VRAM)?
Thank you!

dkmisra · 2024-01-03T04:15:19Z

This is one of the feature of LASER but currently the code doesn't do that. This is because we do SVD on a mxn W matrix which gives us U, S, V matrices of size m x m, mxn and nxn respectively, but then we take the top-k dimensions and multiply them back giving back a mxn low-rank approximation of W. To save memory, we should not multiple these matrices back but instead store them as 3 separate matrices of size mxk, kxk, kxn where k is the required low-rank.

The code needs to be modified so that instead of using the modified W parameter we trigger matrix multiplication with 3 separate matrices. This will cut down memory time albeit it will increase the number of sequential steps.

I am adding this as a feature request and we should be able to support this.

dkmisra · 2024-01-04T05:50:18Z

Related to #9

dkmisra self-assigned this Jan 3, 2024

dkmisra added the enhancement New feature or request label Jan 3, 2024

Mihaiii mentioned this issue Jan 3, 2024

Mistral Support #4

Open

dkmisra mentioned this issue Jun 30, 2024

After your code is saved, the size of the weights is the same as the pre-trained ones, and no memory is saved. What is the reason for this? #28

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question #6

Question #6

fakerybakery commented Dec 31, 2023

dkmisra commented Jan 3, 2024 •

edited

Loading

dkmisra commented Jan 4, 2024

Question #6

Question #6

Comments

fakerybakery commented Dec 31, 2023

dkmisra commented Jan 3, 2024 • edited Loading

dkmisra commented Jan 4, 2024

dkmisra commented Jan 3, 2024 •

edited

Loading