llama4 video support #2942

awasthiabhijeet · 2025-12-22T20:42:02Z

Summary:
Adds video processing support to the Llama4 model by extending the existing vision encoder infrastructure to handle video content. It introduces video-specific special tokens (<|video|>, <|vid_start|>, <|vid_end|>, <|vid_frame_separator|>) in the tokenizer, implements a new transform_video() method that processes video clips as sequences of frames through the existing image transform pipeline, and registers a "video" encoder in the EarlyFusionModel that reuses the vision encoder while maintaining separate tokenization paths for images and videos.

(Used HF implementation as a reference to ensure consistent changes in _tokenizer.py)

Differential Revision: D89577119

meta-codesync · 2025-12-22T20:42:12Z

@awasthiabhijeet has exported this pull request. If you are a Meta employee, you can view the originating Diff in D89577119.

meta-codesync · 2025-12-22T21:45:07Z

@awasthiabhijeet has imported this pull request. If you are a Meta employee, you can view this in D89577119.

Summary: Adds video processing support to the Llama4 model by extending the existing vision encoder infrastructure to handle video content. It introduces video-specific special tokens (<|video|>, <|vid_start|>, <|vid_end|>, <|vid_frame_separator|>) in the tokenizer, implements a new transform_video() method that processes video clips as sequences of frames through the existing image transform pipeline, and registers a "video" encoder in the EarlyFusionModel that reuses the vision encoder while maintaining separate tokenization paths for images and videos. (Used HF implementation as a reference to ensure consistent changes in _tokenizer.py) Reviewed By: felipemello1 Differential Revision: D89577119 Pulled By: awasthiabhijeet

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 22, 2025

meta-codesync bot added fb-exported meta-exported labels Dec 22, 2025

pbontrager requested a review from felipemello1 December 22, 2025 21:13

felipemello1 approved these changes Dec 22, 2025

View reviewed changes

awasthiabhijeet force-pushed the export-D89577119 branch from 1b95438 to 3a2b526 Compare December 23, 2025 03:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama4 video support #2942

llama4 video support #2942

Uh oh!

awasthiabhijeet commented Dec 22, 2025

Uh oh!

meta-codesync bot commented Dec 22, 2025

Uh oh!

meta-codesync bot commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

llama4 video support #2942

Are you sure you want to change the base?

llama4 video support #2942

Uh oh!

Conversation

awasthiabhijeet commented Dec 22, 2025

Uh oh!

meta-codesync bot commented Dec 22, 2025

Uh oh!

meta-codesync bot commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants