You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The main conflict is coming from the fact that differential heads are essentially doubled in size, so that they can be split in half, later. Infini-Attention was not designed to handle this, and thus fails here.
I tried to fix, but could not find an elegant solution, which left the math intact. Would love some help here.
The text was updated successfully, but these errors were encountered:
You can reproduce the errors like this:
The main conflict is coming from the fact that differential heads are essentially doubled in size, so that they can be split in half, later. Infini-Attention was not designed to handle this, and thus fails here.
I tried to fix, but could not find an elegant solution, which left the math intact. Would love some help here.
The text was updated successfully, but these errors were encountered: