AI image generators trained on 5 billion scraped photographer images face lawsuits but no injunctions, leaving photographers in legal limbo where their work trains competitors but courts move too slowly to provide relief
businessbusiness0 views
Professional photographers and visual artists discovered that AI image generators including Midjourney, Stable Diffusion, and DALL-E were trained on datasets like LAION-5B containing 5 billion images scraped from the internet, with 47% of the LAION-Aesthetics subset coming from stock photo sites like Shutterstock, Getty Images, and Flickr. Getty Images identified over 15,000 of its own photos in the Stable Diffusion training dataset. Why it matters: photographers' copyrighted work is used without consent or compensation to train AI systems, so those AI systems generate competing images at zero marginal cost, so commercial buyers switch from licensing stock photos to generating AI images, so photographer licensing revenue declines (Getty Creative revenue fell 4.5% in 2024), so photographers cannot afford to continue producing the high-quality original work that AI systems depend on for future training data. The structural root cause is that copyright law was designed for discrete acts of copying and distribution, not for statistical pattern extraction across billions of works, so courts must adjudicate novel legal theories (is training 'fair use'?) through multi-year litigation while AI companies continue operating, and the leaked Midjourney spreadsheet of 16,000 non-consenting artists demonstrates that the industry treats photographer consent as an obstacle to route around rather than a right to respect.
Evidence
Andersen v. Stability AI class action filed January 2023; U.S. District Judge William Orrick allowed copyright claims to proceed in August 2024 (source: Artnet News). LAION-5B dataset contains 5 billion images; 47% of LAION-Aesthetics sourced from stock photo and user-generated content sites (source: lawsuit filing cited by NYU JIPEL). Getty Images filed separate suit identifying 15,000+ photos in Stable Diffusion dataset (source: Getty Images press materials). Midjourney leaked spreadsheet of 16,000 non-consenting artists published January 2024 (source: The Art Newspaper, The Register). Getty Images Creative segment revenue declined 4.5% YoY in full-year 2024 to $552.8 million (source: Getty Images Q4 2024 earnings report).