Skip to content

Commit 9daf368

Browse files
author
sd109
committed
Add explicit model sampling parameters to requests
1 parent ee0c0eb commit 9daf368

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

web-app-utils/example_app_vanilla.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,12 @@ def inference(message, history):
2929
"prompt": prompt.format(prompt=context),
3030
"stream": True,
3131
"max_tokens": 1000,
32+
# Parameters requested by HU
33+
"sampling_params": {
34+
"temperature": 0.7,
35+
"top_p": 0.4,
36+
"top_k": 40,
37+
}
3238
}
3339
response = requests.post(
3440
f"{backend_url}/generate", headers=headers, json=pload, stream=True

0 commit comments

Comments
 (0)