AUTO-UPDATED

Media Briefing: Another AI threat emerges for publishers: the third-party scraper

Publishers are raising alarms over a growing black market where third-party web scrapers harvest content to sell to major AI and enterprise firms without compensating original rights holders.

Key Points

  • Media executives report that smaller tech vendors are bypassing licensing agreements to scrape publisher content for sale to large AI and enterprise companies.
  • A recent report identified 21 vendors, including Firecrawl and Perplexity Sonar, selling scraped data to over 70 major firms like Apple, IBM, and Salesforce.
  • AI scraping activity grew at an average quarterly rate of 24.4% from Q2 2025 through the end of the year.
  • Many advanced scraping services actively circumvent bot-blocking mechanisms and can access full versions of paywalled articles.
  • Jonathon Oake, the global head of publisher operations at TikTok, is departing the company after nearly five years in the role.

Why it Matters

This trend highlights a significant revenue leakage for publishers as their intellectual property is monetized by intermediaries rather than the creators themselves. The inability to effectively block these scrapers creates a challenging environment where major corporations indirectly fund content theft, complicating future efforts to establish legitimate AI licensing marketplaces.
Digiday Published by Sara Guaglione
Read original