The padding token choice #1577
-
In the function "preprocess_packed_supervised_dataset", the code used -100 as the padding for ignoring.
|
Beta Was this translation helpful? Give feedback.
Answered by
hiyouga
Nov 21, 2023
Replies: 1 comment 1 reply
-
https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html |
Beta Was this translation helpful? Give feedback.
1 reply
Answer selected by
JianhuiWei7
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html