DAIR, 404 Media reports that "Largest Dataset Powering AI Images Removed After Discovery of Child Sexual Abuse Material" 🧵
However, in 2021, a preprint by @abebab, Vinay Uday Prabhu & Emmanuel Kahembwe found a number issues in the dataset including " troublesome and explicit images and text pairs of rape, pornography, malign stereotypes, racist and ethnic slurs, and other extremely problematic content."
The preprint can be found here: https://arxiv.org/abs/2110.01963
https://www.404media.co/laion-datasets-removed-stanford-csam-child-abuse/
Add comment