feat(refactor): improve usability #1306
Triggered via pull request
February 14, 2025 04:16
Status
Failure
Total duration
13m 10s
Artifacts
–
e2e_test.yaml
on: pull_request
training_4GPU
25s
training_8GPU_ISP
26s
training_8GPU_ISP_CKPT
25s
training_8GPU_4DP2PP_ZB
27s
Matrix: training_16GPU_4DP2TP2PP_FSP
Matrix: training_16GPU_4DP2TP2PP_MSP
Matrix: training_16GPU_4DP2TP2PP_MTP
Matrix: training_8GPU_4DP2PP
Matrix: training_8GPU_4DP2TP
Matrix: training_8GPU_4DP2TPSP
Matrix: training_llama2
Annotations
11 errors and 11 warnings
training_16GPU_4DP2TP2PP_FSP (t_cluster)
Process completed with exit code 143.
|
training_16GPU_4DP2TP2PP_MSP (t_cluster)
Process completed with exit code 143.
|
training_16GPU_4DP2TP2PP_MTP (t_cluster)
Process completed with exit code 143.
|
training_4GPU
Process completed with exit code 2.
|
training_8GPU_4DP2PP (t_cluster)
Process completed with exit code 2.
|
training_8GPU_4DP2PP_ZB
Process completed with exit code 143.
|
training_8GPU_4DP2TP (t_cluster)
Process completed with exit code 2.
|
training_8GPU_4DP2TPSP (t_cluster)
Process completed with exit code 143.
|
training_8GPU_ISP
Process completed with exit code 2.
|
training_8GPU_ISP_CKPT
Process completed with exit code 2.
|
training_llama2 (t_cluster)
Process completed with exit code 2.
|
training_16GPU_4DP2TP2PP_FSP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_16GPU_4DP2TP2PP_MSP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_16GPU_4DP2TP2PP_MTP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_4GPU
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_8GPU_4DP2PP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_8GPU_4DP2PP_ZB
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_8GPU_4DP2TP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_8GPU_4DP2TPSP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_8GPU_ISP
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_8GPU_ISP_CKPT
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_llama2 (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|