[sana](https://github.com/NVlabs/Sana) is super fast comparing to SDXL, this would make vision-xl possible to handle 1080p videos