Fix model breakages #53

kevinwuTT · 2024-08-09T00:28:48Z

Model status for this PR:

⚠️ GPT-2 fixes

Unmark gpt2 and mnist test models to expect passing
Disable conversion from aten._to_copy
~~Pass device for all from_torch ops~~ Reverted because this conflicts with unsqueeze conversion
Replace aten.full op to a literal scalar for certain cases
Compare only Tensor types for dictionary outputs
Fails because:

Explicit casting of dtype to ttnn.bfloat16 for from_torch op makes GPT-2 fail. Removing it causes, Bloom, Llama, and Yolos to fail.

Failed to generate binaries for embeddings_tilize TT_THROW @ ../tt_metal/jit_build/build.cpp:396: tt::exception
info:
ncrisc build failed

MNIST fixes:

Replace aten.view with aten.reshape

⚠️ Falcon-7B status:

Fails because:

layer_norm conversion

TT_THROW @ ../tt_metal/impl/program/program.cpp:489: tt::exception
info:
Statically allocated circular buffers on core range [(x=0,y=0) - (x=0,y=0)] grow to 1611072 B which is beyond max L1 size of 1048576 B

Bloom and Llama fixes:

Add conversion for aten.min
Add exception to aten.eq conversion
Fix reusing ttnn data movement op if mixed with aten ops
Convert all inputs to ttnn.bfloat16 when moving data in
Skip unsqueeze transformation if last dim of input is not the same as…
Add exception to aten.expand conversion when last dimension of input …
Support list type arguments
Check layout change for ttnn reshape and embedding op
Freeze encoder for llama model

Yolos fixes

Add workaround for ttnn.permute when dim 0 is 1 for rank 3
Reconvert int64 types from metadata when mixing ttnn and aten ops
Check for valid page size for ops that decompose to ttnn.full
Delete aten.expand op if output has the exact same shape

General fixes:

Consolidate metadata during op conversion
Fix output type of aten.arange unit test to match output of original
Disable to_copy unit test to re-evaluate conversion
Lower pcc for addmm slightly
Change input shapes of some unit test to match exceptions in current …

… last dim of output

…is 1

…state of lowering

README.md

tests/models/gpt2/test_gpt2.py

tests/models/llama/test_llama.py

tests/lowering/tensor_manipulation/test_unsqueeze.py

tests/lowering/creation/test_to_copy.py

torch_ttnn/passes/lowering/to_tt_pass.py

This reverts commit 775fb9f.

kevinwuTT force-pushed the fix_model_breakages branch 2 times, most recently from b02d3cc to 08898b3 Compare August 12, 2024 22:12

kevinwuTT added 27 commits August 12, 2024 23:46

Consolidate metadata during op conversion

ed00a76

Unmark gpt2 and mnist test models to expect passing

c98a2ca

Disable conversion from aten._to_copy

82fce4d

Pass device for all from_torch ops

775fb9f

Replace aten.full op to a literal scalar for certain cases

a3c1cd7

Compare only Tensor types for dictionary outputs

ce5256e

Replace aten.view with aten.reshape

b6dbdf9

Unmark bloom, llama, and yolos from xfail

07b98bf

Add conversion for aten.min

3c7f316

Add exception to aten.eq conversion

786f66a

Fix reusing ttnn data movement op if mixed with aten ops

7103bc5

Convert all inputs to ttnn.bfloat16 when moving data in

18ec3b1

Skip unsqueeze transformation if last dim of input is not the same as…

5dbf801

… last dim of output

Add exception to aten.expand conversion when last dimension of input …

99d17d8

…is 1

Support list type arguments

f898985

Check layout change for ttnn reshape and embedding op

7531fd1

Freeze encoder for llama model

dcc3deb

Add workaround for ttnn.permute when dim 0 is 1 for rank 3

5c8433a

Reconvert int64 types from metadata when mixing ttnn and aten ops

221a8c7

Check for valid page size for ops that decompose to ttnn.full

7e8dee2

Delete aten.expand op if output has the exact same shape

31a1d92

Mark GPT-2 model as xfail

993c7d5

Update README with new model stats

5dc04a3

Fix output type of aten.arange unit test to match output of original

876ac5a

Disable to_copy unit test to re-evaluate conversion

18f089d

Lower pcc for addmm slightly

ff2daad

Change input shapes of some unit test to match exceptions in current …

02bc9b4

…state of lowering

kevinwuTT force-pushed the fix_model_breakages branch from e7f1936 to 02bc9b4 Compare August 12, 2024 23:46

kevinwuTT requested a review from ayerofieiev-tt August 12, 2024 23:49

kevinwuTT marked this pull request as ready for review August 12, 2024 23:49

ayerofieiev-tt reviewed Aug 12, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

ayerofieiev-tt reviewed Aug 13, 2024

View reviewed changes

tests/models/gpt2/test_gpt2.py Outdated Show resolved Hide resolved

ayerofieiev-tt reviewed Aug 13, 2024

View reviewed changes

tests/models/gpt2/test_gpt2.py Outdated Show resolved Hide resolved

kevinwuTT added 3 commits August 13, 2024 06:54

Fix page size validation for conversions involving ttnn.full ops

a6aa0c2

Update README

65a923b

Revert changes to GPT-2 since it isn't working in this PR

d08431e