-
Notifications
You must be signed in to change notification settings - Fork 101
Issues: huggingface/swift-transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to use customized tokenizer?
tokenization
related to tokenizers
#140
opened Nov 11, 2024 by
cch1219
Edge tokenization issues: Unicode parsing
tokenization
related to tokenizers
#116
opened Aug 19, 2024 by
pcuenca
Option to use Related to the Swift -> Hugging Face Hub integration
~/.cache/huggingface/hub
for downloaded models
hub
#102
opened Jul 4, 2024 by
DePasqualeOrg
Tokenizer behavior is different from Python transformers
bug
Something isn't working
tokenization
related to tokenizers
#96
opened May 8, 2024 by
shinichy
Convert OpenELM to float16 Core ML
good first issue
Good for newcomers
modelling
related to CoreML/Transformers
#95
opened Apr 30, 2024 by
pcuenca
Tokenizer force unwraps an optional that's nil for Gemma
tokenization
related to tokenizers
#88
opened Apr 10, 2024 by
ViRo3
SplitPreTokenizer with invert true returning array with empty string
tokenization
related to tokenizers
#55
opened Mar 6, 2024 by
davidkoski
Whisper normalization for evals
tokenization
related to tokenizers
#47
opened Feb 7, 2024 by
pcuenca
Tokenizers: handle related to tokenizers
special_tokens
tokenization
#31
opened Dec 24, 2023 by
pcuenca
Support for embedding models (BGE, GTE etc)
good first issue
Good for newcomers
#22
opened Nov 27, 2023 by
michaeljelly
Allow discrete sequence lengths (enumerated input shapes)
modelling
related to CoreML/Transformers
#10
opened Aug 8, 2023 by
pcuenca
Optimization: cache past key-values
enhancement
New feature or request
modelling
related to CoreML/Transformers
#9
opened Aug 8, 2023 by
pcuenca
Support for encoder-decoder models like T5 and Flan
tokenization
related to tokenizers
#8
opened Aug 8, 2023 by
pcuenca
Tokenizer models: port Unigram and WordPiece
tokenization
related to tokenizers
#5
opened Aug 8, 2023 by
pcuenca
Tokenizers: additional Normalizers, PreTokenizers, PostProcessors
good first issue
Good for newcomers
tokenization
related to tokenizers
#4
opened Aug 8, 2023 by
pcuenca
ProTip!
Add no:assignee to see everything that’s not assigned.