GitHub - knotgrass/attention: several types of attention modules written in PyTorch for learning purposes

This repository implements several types of attention modules in PyTorch, including:

Attention: The basic attention module
Multi-head Attention: A multi-head attention module that performs attention on multiple different "heads"(each head is a set of Q, K, V) of the input sequence.
Multi-Query Attention: A multi-query attention module that allows multiple queries and only one key, value to attend to the same input sequence.
Grouped-Query Attention: A grouped query attention module that allows queries to be grouped together (each group include multiple queries and only one key) and attended to jointly.

Linformer: which reduces the overall self-attention complexity from O(n²) to O(n) in both time and space.

multi-query attention and grouped-query attention modules is an alternative to multi-head attention with much lower memory bandwidth requirements. They has been used in many models, the most famous of which are:

multi-query attention: PaLM, AlphaDev, AlphaZero, Google Gemini
grouped-query attention: llama2

I implemented it in a simple way, with the purpose of understanding how attention works. It is an unoptimized version. If you are looking for attention with better performance, I suggest:

For more information, please see the following papers:

Attention is All You Need (Vaswani et al., 2017)
Fast Transformer Decoding: One Write-Head is All You Need (Noam Shazeer, 2019)
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints (Ainslie et al., 2023)
Linformer: Self-Attention with Linear Complexity (Wang et al., 2020)

Name	Name	Last commit message	Last commit date
Latest commit knotgrass Merge pull request #1 from onisa-jr/patch-1 Oct 1, 2024 3d54093 · Oct 1, 2024 History 45 Commits
attn	attn	Merge pull request #1 from onisa-jr/patch-1	Oct 1, 2024
images	images	ad Linformer	Sep 11, 2023
test	test	change to origin name mention in paper: Linformer	Dec 19, 2023
.gitignore	.gitignore	add gitignore	Sep 26, 2023
README.md	README.md	add xformers	Dec 21, 2023
TODO.md	TODO.md	add attn	Dec 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

knotgrass/attention

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages