Skip to content

Commit 206f25e

Browse files
committed
Bump to exllamav2 0.2.3 and fix for API changes
1 parent cb09821 commit 206f25e

File tree

2 files changed

+4
-2
lines changed

2 files changed

+4
-2
lines changed

backend/sessions.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -533,7 +533,9 @@ def generate(self, data):
533533
gen_settings = gen_settings,
534534
token_healing = p_healing,
535535
abort_event = abort_event,
536-
banned_strings = banned_strings
536+
banned_strings = banned_strings,
537+
filters = gen_settings.filters,
538+
filter_prefer_eos = gen_settings.filters
537539
)
538540
if abort_event.is_set():
539541
abort_event.clear()

requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
torch>=2.1.0
22
pynvml
3-
exllamav2>=0.2.0
3+
exllamav2>=0.2.3
44
Flask>=2.3.2
55
waitress>=2.1.2

0 commit comments

Comments
 (0)