RPC offloading uses a local model copy #9740
Replies: 6 comments 4 replies
-
It may be tricky because the backend interface has no concept of files. You could try caching the calls to the I would suggest first making structs for all the RPC protocol commands to make the code more readable and avoid mistakes. It could also be a good way to introduce yourself to the code. |
Beta Was this translation helpful? Give feedback.
-
This is what is going on during initialization on the 1Gbit network: |
Beta Was this translation helpful? Give feedback.
-
i think what will be best is like download whole model on both side and then just loading specific number of layer the network tells you to. |
Beta Was this translation helpful? Give feedback.
-
@alfrentgen any updates? |
Beta Was this translation helpful? Give feedback.
-
no one fking care here :\ |
Beta Was this translation helpful? Give feedback.
-
I will try to implement @slaren's idea, you can follow #10095 for details |
Beta Was this translation helpful? Give feedback.
-
Hello!
First of all, I would like to say that the RPC server is a greate feature. Thank you.
However, it takes too long to offload a part of the model using 100Mbit connection. I am thinking of buying 1Gbit switch now )
But, I think it would be better to have on option in RPC server to specify a model file copy stored on the local storage. So, it can download its part of the model faster from the storage, avoiding network transmission on init stage.
Another idea is to have cache in the RPC server. The cache can also be stored locally either in RAM or on the disk. It looks more complicated than the first idea.
I can try to implement RPC offloading with the local model copy. But I would like to get some hints on where to start looking in the code.
Beta Was this translation helpful? Give feedback.
All reactions