How to implement 4D attention mask within Flashattn?

<img width="480" height="410" alt="Image" src="https://github.com/user-attachments/assets/57488f01-dbe1-4c94-8de3-8b47f18db5a1" />

As shown in the figure, I would like to know how to implement such a 4D attention mask, or interleaved attention (i.e. some tokens use bidirectional attention and some tokens use casual attention) in Flashattn?