Skip to content

Commit aadd15a

Browse files
author
Felipe Mello
committed
update configs
1 parent eb6a3a1 commit aadd15a

File tree

2 files changed

+11
-2
lines changed

2 files changed

+11
-2
lines changed

apps/sft/llama3_8b.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ training:
3737
split: "train[:95%]"
3838

3939
eval:
40-
eval_every_n_steps: 5 # (null = disabled)
40+
eval_every_n_steps: 50 # (null = disabled)
4141
max_eval_steps: null # Max batches per eval dataset (null = run until epoch completes)
4242
datasets:
4343
- path: "yahma/alpaca-cleaned"

apps/sft/qwen3_8b.yaml

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,16 @@ training:
3131
max_norm: 1.0
3232
steps: 1000
3333
compile: false
34-
dataset: "c4"
34+
datasets:
35+
- path: "yahma/alpaca-cleaned"
36+
split: "train[:95%]"
37+
38+
eval:
39+
eval_every_n_steps: 50 # (null = disabled)
40+
max_eval_steps: null # Max batches per eval dataset (null = run until epoch completes)
41+
datasets:
42+
- path: "yahma/alpaca-cleaned"
43+
split: "train[95%:]"
3544

3645
parallelism:
3746
data_parallel_replicate_degree: 1

0 commit comments

Comments
 (0)