-
Notifications
You must be signed in to change notification settings - Fork 3
Release AnimeHair Training Dataset on Hugging Face #1
Description
Hi @hyz317 🤗
I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2509.21114.
The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance),
you can also claim the paper as yours which will show up on your public profile at HF, add Github and project page URLs.
It's great to see the CHARM model checkpoint and test cases already available on the Hugging Face Hub! I also noticed that your paper introduces the "AnimeHair" training dataset (37K high-quality anime hairstyles), which currently requires users to follow a multi-step process involving external data and scripts to reconstruct.
Would you be interested in hosting the fully processed AnimeHair training dataset directly on https://huggingface.co/datasets?
Hosting it directly on Hugging Face would significantly improve its visibility and discoverability, allowing researchers to easily access and use it with a simple load_dataset call. We can add tags to the dataset card so that people find it easier, link it to the paper page, etc.
For instance, people could do:
from datasets import load_dataset
dataset = load_dataset("your-hf-org-or-username/AnimeHair")If you're interested, here's a guide: https://huggingface.co/docs/datasets/loading.
Besides that, there's the dataset viewer which allows people to quickly explore the first few rows of the data in the browser.
Let me know if you're interested/need any guidance regarding this!
Kind regards,
Niels