[Refactor] Modular Integration Test Framework with DeepSeek-v3 Support #1431

wwwjn · 2025-07-21T20:12:53Z

Integration Tests Restructuring

Split tests into two sets:
1. Depth Test: Use llama3 model, to test all the main components of torchtitan are functioning as expected
2. Breath Test: As we are supporting more models in torchtitan core, setup parallelsim related tests for each model, to test model architecture / args related changes. Make sure the Integration test implementation is easy to extend to new models.
Moved integration test files from the root directory to a dedicated tests/integration_tests/ directory
Added a base configuration file base_config.toml for integration tests, as most of the train_configs shared 90% same settings
Remove "use_for_integration_test" field in train configs. Change to by default using "debugmodel" flavor for integration tests.

tianyu-l

Great initiative! Left some initial comments, let's discuss.

tests/integration_tests/base_config.toml

.github/workflows/integration_test_8gpu_core.yaml

tianyu-l · 2025-07-28T06:24:19Z

tests/integration_tests/integration_tests.py

+    parser.add_argument(
+        "--config_path",
+        default="./tests/integration_tests/base_config.toml",
+        help="Base config path for integration tests. This is the config that will be used as a base for all tests.",
+    )


do we need to expose this anyway?

I still keep this field for 2 reasons:

We want to leave a little bit flexibility, for developers to quickly run integration tests locally with different config. For example, I could run the features.py with a deepseek debug model config.

A file path is needed since we run intergration tests using command line, and we need to specify CONFIG_FILES={}

tests/integration_tests/integration_tests.py

tests/integration_tests/integration_tests_ft.py

tianyu-l

I suggest we make model tests flat, and reuse functions such as main, run_tests, etc. across all tests -- basically decouple control logic and data.

tianyu-l · 2025-08-02T01:44:47Z

README.md

@@ -4,7 +4,7 @@

 #### A PyTorch native platform for training generative AI models

-[![integration tests](https://github.com/pytorch/torchtitan/actions/workflows/integration_test_8gpu.yaml/badge.svg?branch=main)](https://github.com/pytorch/torchtitan/actions/workflows/integration_test_8gpu.yaml?query=branch%3Amain)
+[![integration tests](https://github.com/pytorch/torchtitan/actions/workflows/integration_test_8gpu_features.yaml/badge.svg?branch=main)](https://github.com/pytorch/torchtitan/actions/workflows/integration_test_8gpu.yaml?query=branch%3Amain)


I think the badge will show 8 GPU Integration Test - Core Features and I feel it's too verbose.

Also do we want to include two badges?

e.g. 8 GPU Feature Tests and 8 GPU Model Tests

tests/README.md

tianyu-l · 2025-08-02T02:01:13Z

tests/integration_tests/models.py

+
+
+@dataclass
+class OverrideDefinitions:


any reason we can't reuse this?

Oh yes I should. Let me refactor this PR a little bit to reuse the functions

tests/integration_tests/models.py

tianyu-l · 2025-08-02T02:05:23Z

tests/integration_tests/models.py

+    for model_name, test_list in test_case_dict.items():
+        for test_flavor in test_list:
+            # Filter by test_name if specified
+            if args.test_name != "all" and test_flavor.test_name != args.test_name:


what if different model_names have the same test_name? Do you run them all?

tests/integration_tests/models.py

tianyu-l · 2025-08-02T02:08:48Z

tests/integration_tests/features.py

@@ -601,39 +599,48 @@ def run_test(test_flavor: OverrideDefinitions, full_path: str, output_dir: str):


 def run_tests(args):


This code is repeatedly defined in every .py tests which I think is very unnecessary.
We only need to have separate .yaml files for GH actions.
We can have different files to host OverrideDefinitions.
But we don't need different files to host the same functions such as main, run_tests, etc.

wwwjn requested review from tianyu-l, fegin and wconstab as code owners July 21, 2025 20:12

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 21, 2025

wwwjn changed the title ~~Integration Tests Restructuring for extensible test cases.~~ Modular Integration Test Framework with DeepSeek-v3 Support Jul 21, 2025

wwwjn marked this pull request as draft July 21, 2025 20:15

wwwjn changed the title ~~Modular Integration Test Framework with DeepSeek-v3 Support~~ [WIP] Modular Integration Test Framework with DeepSeek-v3 Support Jul 25, 2025

wwwjn force-pushed the ci-refactor branch from dd623c8 to 4d078b6 Compare July 25, 2025 18:39

wwwjn marked this pull request as ready for review July 25, 2025 22:22

wwwjn changed the title ~~[WIP] Modular Integration Test Framework with DeepSeek-v3 Support~~ Modular Integration Test Framework with DeepSeek-v3 Support Jul 25, 2025

tianyu-l reviewed Jul 28, 2025

View reviewed changes

wwwjn changed the title ~~Modular Integration Test Framework with DeepSeek-v3 Support~~ [Refactor] Modular Integration Test Framework with DeepSeek-v3 Support Jul 29, 2025

wwwjn force-pushed the ci-refactor branch 2 times, most recently from 7b17cc8 to b8f7a7b Compare July 29, 2025 06:01

wwwjn added 16 commits July 31, 2025 15:17

add CI for torchtitan

cb18af5

lint

71593c6

refactor CI

5dc6f03

add integration test

05e45fc

refactor v1

858b33b

remove use_for_integration_test

1592ab7

rename

8656086

rebase v2

87bac09

refactor

028312d

change commandline

c9b83f8

change commandline

b369a66

fix parameter name for configs

0e233ce

temporarily disable pp tests

7621a3b

delete filename

6b608c4

refactor v2

a7cf26b

fix readme

81b34ab

wwwjn added 9 commits July 31, 2025 15:17

rebase

909cf55

rebase to main

60dbacd

fix test failures

892c9e9

lint

5241ceb

fix CI error

e3bb0f3

rebase

5a787c8

rebase

9960be6

change badge

9e4552f

rebase to main

5cf7850

wwwjn force-pushed the ci-refactor branch from f94d708 to 5cf7850 Compare July 31, 2025 22:20

tianyu-l requested changes Aug 2, 2025

View reviewed changes

fix readme

36d1ccd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Refactor] Modular Integration Test Framework with DeepSeek-v3 Support #1431

[Refactor] Modular Integration Test Framework with DeepSeek-v3 Support #1431

Uh oh!

wwwjn commented Jul 21, 2025

Uh oh!

tianyu-l left a comment

Uh oh!

Uh oh!

Uh oh!

tianyu-l Jul 28, 2025

Uh oh!

wwwjn Jul 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l left a comment

Uh oh!

tianyu-l Aug 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l Aug 2, 2025

Uh oh!

wwwjn Aug 4, 2025

Uh oh!

Uh oh!

tianyu-l Aug 2, 2025

Uh oh!

Uh oh!

tianyu-l Aug 2, 2025

Uh oh!

Uh oh!

		@@ -601,39 +599,48 @@ def run_test(test_flavor: OverrideDefinitions, full_path: str, output_dir: str):


		def run_tests(args):



		@dataclass
		class OverrideDefinitions:

[Refactor] Modular Integration Test Framework with DeepSeek-v3 Support #1431

Are you sure you want to change the base?

[Refactor] Modular Integration Test Framework with DeepSeek-v3 Support #1431

Uh oh!

Conversation

wwwjn commented Jul 21, 2025

Integration Tests Restructuring

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tianyu-l Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

wwwjn Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

tianyu-l Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

wwwjn Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tianyu-l Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tianyu-l Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!