Jay Zhangjie Wu* Xuanchi Ren* Tianchang Shen Tianshi Cao Kai He
Yifan Lu Ruiyuan Gao Enze Xie Shiyi Lan Jose M. Alvarez
Jun Gao Sanja Fidler
Zian Wang Huan Ling*†
* equal contribution † corresponding author
📖 Project Page | 📑 Technical Report
TL;DR: ChronoEdit reframes image editing as a video generation task, using input and edited images as start/end frames to leverage pretrained video models with temporal consistency. A temporal reasoning stage introduces reasoning tokens to ensure physically plausible edits and visualize the editing trajectory.
@article{wu2025chronoedit,
title={ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation},
author={Wu, Jay Zhangjie and Ren, Xuanchi and Shen, Tianchang and Cao, Tianshi and He, Kai and Lu, Yifan and Gao, Ruiyuan and Xie, Enze and Lan, Shiyi and Alvarez, Jose M. and Gao, Jun and Fidler, Sanja and Wang, Zian and Ling, Huan},
journal={arXiv preprint arXiv:2510.04290},
year={2025}
}