We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent ee14f5b commit f18f88bCopy full SHA for f18f88b
machine/jobs/settings.yaml
@@ -7,7 +7,7 @@ default:
7
parent_model_name: facebook/nllb-200-distilled-1.3B
8
train_params:
9
do_train: true
10
- optim: adamw_torch
+ optim: apollo_adamw
11
warmup_steps: 1000
12
per_device_train_batch_size: 16
13
gradient_accumulation_steps: 4
0 commit comments