This project combines SAM2, Florence2, and ProPainter for intelligent video object removal using text prompts. Simply describe which objects you want to remove (e.g "man, car, cap, basket"), and the AI will do the rest! 🪄
- 🎯 Text-guided object removal from videos
- 📹 Support for common video formats
- 🔄 Adjustable scale factor for memory optimization
- ⚡ GPU-accelerated processing
- ⏱️ Customizable processing duration
conda create -n video-inpainting python=3.11 -y
conda activate video-inpainting
pip install -r requirements.txt
To get started with the project, you'll need to download the required pretrained models and organize them as follows:
Download the SAM2 checkpoints from here: SAM2_PRETRAINED_MODELS
🗂️ Place the downloaded files in the following directory: ./checkpoints
Download the ProPainter checkpoints from here: PROPAINTER_PRETRAINED_MODELS
🗂️ Place the downloaded files in the following directory: ./weights
Run the Gradio interface:
python app.py
- CUDA-capable GPU with 16GB+ VRAM
- CUDA 12.1 or higher
- 16GB+ RAM
- Python 3.11