Skip to content

Commit a7ef3a6

Browse files
yaolustas00
authored andcommitted
update
1 parent 185f7a5 commit a7ef3a6

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

resources/README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -52,8 +52,11 @@ The listing is in no particular order other than being grouped by the year.
5252

5353
- [MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs](https://arxiv.org/abs/2402.15627) - the paper covers various training issues and their resolution - albeit on models that are proprietary yet just as instructional/useful.
5454

55+
- Imbue's [From bare metal to a 70B model: infrastructure set-up and scripts](https://imbue.com/research/70b-infrastructure/) very detailed technical post covers many training-related issues that they had to overcome while training a proprietary 70B-param model.
5556

5657

58+
https://imbue.com/research/70b-infrastructure/
59+
5760

5861

5962
## Hardware setup logbooks

0 commit comments

Comments
 (0)