-
Notifications
You must be signed in to change notification settings - Fork 267
Issues: linkedin/Liger-Kernel
[RFC] Liger FlexChunkLoss: Alignment and Distillation loss
#371
opened Nov 8, 2024 by
shivam15s
Open
22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
The convergence test Something isn't working
test_mini_models_with_logits
is failing with the latest transformers
bug
#543
opened Jan 27, 2025 by
Tcc0403
revert_liger_kernel_to_xxx
can't revert LigerCrossEntropyLoss for transformers>=4.46.1
bug
#542
opened Jan 27, 2025 by
Tcc0403
Memory Optimization with Liger Kernel Shows Limited Effect on larger Model (more than 7B)
#517
opened Jan 8, 2025 by
dyyoungg
LigerFusedLinearCrossEntropyLoss
Causes Training Loss to Diverge After Reaching ~8
#512
opened Jan 4, 2025 by
penghui-yang
Extending Liger-Kernel Optimizations to Encoder Models Like BER
#500
opened Dec 26, 2024 by
pengzhangzhi
error when run Good for newcomers
sh run_qwen.sh
good first issue
#487
opened Dec 18, 2024 by
CharlesJhonson
Potential Optimization for Preference Training with Prefix Sharing
#476
opened Dec 13, 2024 by
austin362667
ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)
#401
opened Nov 20, 2024 by
shivam15s
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.