Data Provenance Initiative
Jump to navigation
Jump to search
= "Where does the data to build AI come from?" [1]
URL = https://www.dataprovenance.org/Multimodal_Data_Provenance.pdf
Description
By Melissa Heikkilä & Stephanie Arnett:
"The Data Provenance Initiative, a group of over 50 researchers from both academia and industry, wanted to fix that. They wanted to know, very simply: Where does the data to build AI come from? They audited nearly 4,000 public data sets spanning over 600 languages, 67 countries, and three decades. The data came from 800 unique sources and nearly 700 organizations.
Their findings, shared exclusively with MIT Technology Review, show a worrying trend: AI’s data practices risk concentrating power overwhelmingly in the hands of a few dominant technology companies."