Add SmolVLA example training script #2647

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

ryanhoangt wants to merge 3 commits into huggingface:main from ryanhoangt:add-train-smolvla-example

ryanhoangt commented Dec 14, 2025

What this does

Explain what this PR does. Feel free to tag your PR with the appropriate label(s).

This PR is to add an example script for training SmolVLA, similar to the existing using_smolvla_example.py script.

How it was tested

Explain/show how you tested your changes.

I ran the script on Google Colab after installing lerobot and verified the training job started successfully.

How to checkout & try? (for the reviewer)

Follow the same setup as the SmolVLA notebook
Create a new train_smolvla.py script in /content
Run a cell with content: !python train_smolvla.py.

ryanhoangt added 2 commits

December 14, 2025 22:06


          add training smolvla script

6a7517a


          fix bug

9fb4569

Copilot AI review requested due to automatic review settings

December 14, 2025 15:43


          Merge branch 'main' into add-train-smolvla-example

b7ee20e

Copilot started reviewing on behalf of ryanhoangt

December 14, 2025 15:44

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull request overview

This PR adds a new example training script for SmolVLA that demonstrates both fine-tuning from a pretrained checkpoint and training from scratch. The script is intended to help users understand how to train SmolVLA on their own datasets, similar to existing training examples for ACT and Diffusion policies.

Key Changes

Adds comprehensive training example with detailed comments explaining configuration options
Supports two training modes: fine-tuning from pretrained checkpoint (default) or training from scratch
Includes optimizer and learning rate scheduler setup using SmolVLA's preset configurations
Provides optional Hub push functionality with safety flag

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/tutorial/smolvla/train_smolvla_example.py

+                      loss, output_dict = policy.forward(batch)
+                      # Backward pass and optimization
+                      loss.backward()

Copilot AI Dec 14, 2025

Missing gradient clipping before optimizer step. The SmolVLA config defines grad_clip_norm=10 (via get_optimizer_preset), but gradient clipping must be manually applied. Add torch.nn.utils.clip_grad_norm_(policy.parameters(), optimizer_config.grad_clip_norm) after loss.backward() and before optimizer.step().

Suggested change

      
                    loss.backward()
          
                    loss.backward()
          
                    torch.nn.utils.clip_grad_norm_(policy.parameters(), optimizer_config.grad_clip_norm)

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

+              output_directory = Path("outputs/train/my_smolvla")
+              output_directory.mkdir(parents=True, exist_ok=True)
+              device = torch.device("cuda")  # or "cuda" or "cpu"

Copilot AI Dec 14, 2025

The comment says 'or "cuda" or "cpu"' but the device is already set to "cuda", making this redundant. This should either say 'or "mps" or "cpu"' to match the other examples, or the device should be set to "mps" to be consistent with act_training_example.py and diffusion_training_example.py which use torch.device("mps").

Suggested change

      
            device = torch.device("cuda")  # or "cuda" or "cpu"
          
            device = torch.device("mps")  # or "cuda" or "cpu"

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

+              # Optional: Push to Hugging Face Hub
+              # Uncomment and update with your Hugging Face username
+              push_to_hub = False  # Set to True to push to Hub
+              hub_repo_id = "YOUR_HF_USERNAME/my_smolvla_so101"  # Replace with your repo ID

Copilot AI Dec 14, 2025

Inconsistent dataset naming. The dataset ID uses 'svla_so100_pickplace' but line 192's comment refers to 'my_smolvla_so101' and line 205 mentions 'SO101 robot'. The documentation at docs/source/smolvla.mdx:40 confirms the dataset is 'svla_so100_pickplace'. The comments should consistently use SO100 to match the dataset, or clarify if SO101 is intentionally different.

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

+              print("Training complete! Next steps:")
+              print("1. Test the model with: examples/tutorial/smolvla/using_smolvla_example.py")
+              print(f"2. Update model_id in the script to: {output_directory}")
+              print("3. Deploy on your SO101 robot!")

Copilot AI Dec 14, 2025

Inconsistent reference to robot type. The comment mentions 'SO101 robot' but the dataset being used is 'svla_so100_pickplace' (line 18). This should be 'SO100 robot' to match the dataset, or clarified if SO101 is a different robot model.

Suggested change

      
            print("3. Deploy on your SO101 robot!")
          
            print("3. Deploy on your SO100 robot!")

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

		@@ -0,0 +1,206 @@
		from pathlib import Path

Copilot AI Dec 14, 2025

Missing module-level docstring. Other training examples in this repository (act_training_example.py, diffusion_training_example.py) include a docstring at the top that describes what the script demonstrates. Consider adding a similar docstring such as: """This script demonstrates how to train SmolVLA Policy on a real-world dataset."""

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

+                  dataset,
+                  batch_size=batch_size,
+                  shuffle=True,
+                  pin_memory=device.type == "cuda",

Copilot AI Dec 14, 2025

Inconsistent pin_memory condition. This script uses device.type == "cuda" but other training examples (act_training_example.py:64, diffusion_training_example.py:65) use device.type != "cpu". The latter is more inclusive as it also covers MPS devices. Consider changing to device.type != "cpu" for consistency.

Suggested change

      
                pin_memory=device.type == "cuda",
          
                pin_memory=device.type != "cpu",

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

Comment on lines +38 to +41

+                  rename_map = {
+                      "observation.images.top": "observation.images.camera1",
+                      "observation.images.wrist": "observation.images.camera2",
+                  }

Copilot AI Dec 14, 2025

Hardcoded rename_map is specific to the svla_so100_pickplace dataset and will not work with other datasets that have different camera keys. Consider adding a comment explaining this mapping is dataset-specific and may need adjustment, or checking if the pretrained model's camera keys match the dataset's keys before applying the rename.

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

+                      },
+                  )
+              else:
+                  print("Initializing new SmolVLA model from scratch...")

Copilot AI Dec 14, 2025

This statement is unreachable.

Copilot uses AI. Check for mistakes.

examples/tutorial/smolvla/train_smolvla_example.py

+              hub_repo_id = "YOUR_HF_USERNAME/my_smolvla_so101"  # Replace with your repo ID
+              if push_to_hub:
+                  print(f"\nPushing model to Hugging Face Hub: {hub_repo_id}...")

Copilot AI Dec 14, 2025

This statement is unreachable.

Copilot uses AI. Check for mistakes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet