This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing".
- Dataset and Annotations
- Shot embeddings from SOTA audio and video models
- Downstream tasks
You can download all the labels in AVE using the following link. The annotation file is organized as follows,
{
"scene id (ex. xBr1UV3kWqA)":
{
"shot id (ex. 02a91bd7-feda-4844-8a85-b01d43437140)":
{
"clip-type": ["shot"],
"start-time": [51.05],
"end-time": [62.15],
"shot-size": ["medium"],
"shot-angle": ["low-angle"],
"shot-type": ["two-shot"],
"shot-motion": ["handheld"],
"shot-subject": ["human"],
"shot-location": ["int"],
"num-people": [2],
"sound-source": ["on-screen"]
},
},
}
We do not own the movie clips used in this work. However, they are publicly available on the MovieClips YouTube channel. Please follow the following steps to obtain the AVE dataset,
- You can download all the movie scene clips using the yt-dlp package and the scenes url file in the dataset folder.
python3 -m pip install -U yt-dlp
source download.sh
Note that geographic restriction might be an issue when downloading some of the videos depending on where you are located. We recommend you to use a VPN if such case happens. It is also possible that some of the videos may no longer be available in the future. If such incidents happen, feel free to send me an email ([email protected]) and I will gladly help.
- After downloading the movie scenes, the shot clips of each scene can be obtained using the FFmpeg module and the "start-time" and "end-time" labels of the shots in the annotation file.
sudo apt install ffmpeg
python scenes_to_shots.py
If you find our work to be useful, please don't forget to cite us.
@inproceedings{argaw2022anatomy,
Author = {Argaw, Dawit Mureja and Heilbron, Fabian Caba and Lee, Joon-Young and Woodson, Markus and Kweon, In So},
Title = {The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing},
Booktitle = {ECCV},
Year = {2022}}