Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Environment selection in documentation #2

Closed

Conversation

anandhu-eng
Copy link

No description provided.

anandhu-eng and others added 13 commits May 21, 2024 18:43
Created initial file for network

Added arg node

Added arg port

Incorporated tqdm

Code clean
Class call made proper

Set offload folder

Update backend.py

added offload state dict

Update offload conditions

Initial commit gptj network

Update backend.py

Update run.py

Fixed datatypes

set debug=True

debug

included statement for error checking

bug fix

Update request format

fixed format

Update backend.py

Update backend.py
Instead of processing the encoded tensors, texts are being passed, which is encoded in server.

Delete backend.py

File name changed

Delete run.py
Accepts single text query

Changed return variable

Updated response datatype
Renamed cache variable

f-string error fix

Update network_SUT.py

Semaphore declaration refined

Check for semaphore globally

Update size to GB

Semaphore initialisation bug fixed

Bug fix in getting model mem size

Updated KV_cache formula

Added semaphore support
Update README.md
@anandhu-eng anandhu-eng closed this Jun 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants