You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are conflicts between JobConfig and ModelArgs. Specfically, if we
let ModelArgs arguments be in JobConfig, then users have to control
these arguments via toml files or command line arguments. However, for
some flext_attn configurations, the requirement doesn't make sense as
some models have the desired settings.
This PR removes these configurations from JobConfig and uses model
flavor to control whether flex_attn should be enabled or not.
0 commit comments