Sam Hames

Nyah!

Large image datasets: A pyrrhic win for computer vision? | OpenReview

https://openreview.net/forum?id=s-e2zaAlG3I

A detailed investigation of the problematic content of just one of the large image databases commonly used for AI/ML research. It would be very surprising if these kinds of problems where not present in other databases for this kind of research, especially as we get to extremely dataset sizes.

Some press on the fallout of this happening: https://www.theregister.com/2020/07/01/mit_dataset_removed/

Tags

Linked Notes

Related By Tags

Details

Revised
Created
Edited