This repository introduces VADB, a large-scale video aesthetics database, and VADB-Net, a novel video aesthetics scoring framework designed to evaluate the aesthetic quality of videos across multiple dimensions. Our work provides comprehensive resources for researchers and developers interested in video aesthetics analysis, computer vision, and multimedia content assessment.
- VADB dataset: Licensed under CC BY-NC 4.0 (commercial use prohibited).
- Code and models: Licensed under CC BY 4.0
The VADB dataset is publicly available on Hugging Face:
https://huggingface.co/datasets/BestiVictoryLab/VADB
The VADB dataset is licensed under CC BY-NC 4.0, which prohibits any commercial use. For commercial collaborations, please contact us.
It includes:
- 7,881 videos covering diverse video styles and content categories
- Detailed language comments for each video
- Aesthetic scores across 7-11 dimensions, comprehensively covering the aesthetic attribute features of videos
- Rich objective tags, annotating video shooting techniques and other objective dimensions
The pre-trained video encoder model can be obtained from Google Drive:
drive_link
This encoder extracts aesthetic feature vectors from videos and serves as the foundational component for all scoring models.
The repository is structured into three main components:
- Folder:
1TotalScore - Model: Predicts the overall aesthetic score of videos
- Usage:
cd 1TotalScore python 1TotalScore.py
- Folder:
2GeneralAttribute - Model: Evaluates general aesthetic attributes of videos
- Evaluation Dimensions:
- Composition
- Shot Size
- Lighting
- Visual Tone
- Color
- Depth of Field
- Usage:
cd 2GeneralAttribute python 2GeneralAttribute.py
- Folder:
3HumanAttribute - Model: Focuses on evaluating specific aesthetic attributes of human subjects
- Evaluation Dimensions:
- Expression
- Movement
- Costume
- Makeup
- Usage:
cd 3HumanAttribute python 3HumanAttribute.py
- Install the required dependencies (From CLIP):
conda install --yes -c pytorch pytorch=1.7.1 torchvision cudatoolkit=11.0 pip install ftfy regex tqdm pip install opencv-python boto3 requests pandas
- Download the VADB dataset from Hugging Face:
git lfs install git clone https://huggingface.co/datasets/BestiVictoryLab/VADB
- Load the pre-trained video encoder (see Video Encoder).
- Run the scoring model suitable for your use case (see Scoring Models).

