Skip to content

zihling/ECE-285-ViT

Repository files navigation

ECE-285-ViT

Final project of ECE 285

TODO List

  • Build module :
    • patch embedding (need to be debugged and test) : divide the image and do the patch and position embedding, include the Linear Projection and Flatten-related work.
    • Transformer Encoder (need to be built from scratch): build 1 but reuse the block for 5~6 times
      • Norm
      • Multihead Attention
      • Norm
      • MLP
    • Head
  • Pretrain: We can use dataset ImageNet (same as ViT paper)
  • Train and test on our dataset

The structure

About

Final project of ECE 285

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •