“E621 Face Dataset”, Arfafax2020-02-18 (, ; backlinks)⁠:

Tool for getting the dataset of cropped faces from [furry booru] e621 (NSFW; WikiFur description). It was created by training a YOLOv3 network on annotated facial features from about 1500 faces.

The total dataset includes ~186k faces. Rather than provide the cropped images, this repo contains CSV files with the bounding boxes of the detected features from my trained network, and a script to download the images from e621 and crop them based on these CSVs.

The CSVs also contain a subset of tags, which could potentially be used as labels to train a conditional GAN.

File
get_faces.py Script for downloading base e621 files and cropping them based on the coordinates in the CSVs.
faces_s.csv CSV containing URLs, bounding boxes, and a subset of the tags for 90k cropped faces with rating=safe from e621.
features_s.csv CSV containing the bounding boxes for 389k facial features with rating=safe from e621.
faces_q.csv CSV containing URLs, bounding boxes, and a subset of the tags for 96k cropped faces with rating=questionable from e621.
features_q.csv CSV containing the bounding boxes for 400k facial features with rating=questionable from e621.
Preview grid