Inteligência Artificial

Cloudflare’s new policy pushes AI companies to pay for publishers’ content

Publicado porRedacao AIDaily
4 min de leitura
Autor na fonte original: Sarah Perez

Cloudflare is giving AI companies until September 15 to separate web crawlers used for search from those used for AI training and agents, or risk being blocked by default on many publisher sites.

Compartilhar:

Cloudflare has just issued the AI industry a new deadline to separate the web crawlers used for traditional search purposes, like Google Search, from those used for AI agents and training. Starting on September 15, 2026, Cloudflare’s default settings will block “mixed-use” crawlers from any pages that host ads, the company announced on Wednesday.

That means that the crawlers that blend search, agent use, and training will be blocked from crawling these sites by default, unless the site owner adjusts the settings otherwise. These changes to the defaults will apply to new Cloudflare customers, new sites set up by existing customers, and all existing free customers, the company says.

The move could impact how AI model providers are able to access web content for training purposes and to help power their agentic services.

Cloudflare points out that most website owners want their content to be discoverable via search and often through AI services as well, but they want protections against having their intellectual property given away for free.

Cloudflare specifically calls out the “world’s largest search engine” (clearly a Google reference!) as having access to about “2x more information” than other AI companies because the search giant makes it difficult for customers to remain discoverable without being used for AI.

Google has pushed back against this generalization in the past, noting that it provides a bot called Google Extended that lets site owners opt out of having their content used for training and AI products and services like Gemini Apps and Vertex API. Its use doesn’t impact a site’s inclusion in Google Search. However, the tech giant’s flagship Googlebot crawls for Search, including AI features like AI Overviews and AI Mode.

“Now that the majority of traffic on the Internet is non-human, we must go further and act faster so that a sustainable ecosystem can emerge,” said Cloudflare co-founder and CEO Matthew Prince in his announcement of the news, referring to the recent milestone where bots surpassed human traffic online for the first time. That shift was not expected to occur until next year.

“Cloudflare’s new tools and partnerships give website owners increased visibility and commercial opportunities and benefit AI companies that have bots with clear and transparent intent. We hope that our proposed default changes encourage mixed-use crawlers to separate out search from agent use and training,” Prince said.

While Cloudflare offers a number of products to help users launch their own AI systems , the company has also released a range of tools to give publishers more control over their content in the AI era. In recent years, Cloudflare launched tools to combat AI bots , including a marketplace that lets websites charge AI bots for scraping , dubbed Pay Per Crawl.

The latter is now also evolving into “Pay Per Use,” the company said, which will allow publishers to charge AI companies when their content creates value, not just when it’s fetched.

The change could also help conserve publishers’ bandwidth and compute resources for AI model providers, as Cloudflare’s data suggested that over 50% of crawl traffic from AI crawlers is spent re-fetching unchanged pages.

To put this into action, Cloudflare is initially working with two partners, Ceramic.ai and You.com. When a publisher opts in, they’re paid when their content appears in Ceramic’s AI search results or when You.com accesses a piece of their premium content.

Other AI companies can customize this model for how they work, Cloudflare says.

When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence.

Last chance to save up to $190 on TechCrunch Founder Summit. Join 1,000+ founders and VCs at all stages for real-world scaling insights and connections that move the needle. Savings end June 26, 11:59 p.m. PT .

Flipper Device’s new Busy Bar is a customizable display for productivity Ivan Mehta

Flipper Device’s new Busy Bar is a customizable display for productivity

Flipper Device’s new Busy Bar is a customizable display for productivity

Ford rehires ‘gray beard’ engineers after AI falls short Anthony Ha

Ford rehires ‘gray beard’ engineers after AI falls short

Ford rehires ‘gray beard’ engineers after AI falls short

Govee’s smart nugget ice maker makes every iced drink feel like a luxury Aisha Malik

Govee’s smart nugget ice maker makes every iced drink feel like a luxury

Govee’s smart nugget ice maker makes every iced drink feel like a luxury

Instagram is testing more ways to customize ‘Your Algorithm’ Anthony Ha

Instagram is testing more ways to customize ‘Your Algorithm’

Instagram is testing more ways to customize ‘Your Algorithm’

Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on Kate Park

Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on

Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on

FTC gives Musk the OK to acquire SpaceX alumni startup Mesh Marina Temkin

FTC gives Musk the OK to acquire SpaceX alumni startup Mesh

FTC gives Musk the OK to acquire SpaceX alumni startup Mesh

Corgi, the buzzy Y Combinator-backed insurance tech startup, says it didn’t steal an open source product Julie Bort

Corgi, the buzzy Y Combinator-backed insurance tech startup, says it didn’t steal an open source product

Corgi, the buzzy Y Combinator-backed insurance tech startup, says it didn’t steal an open source product

Pontos-chave

  • A Cloudflare impõe separação entre crawlers de busca e de IA, impactando acesso a dados.
  • A mudança pode estimular um debate sobre monetização de conteúdo digital no Brasil.
  • Empresas de IA precisarão adaptar suas práticas para evitar bloqueios em sites de editores.

Análise editorial

A nova política da Cloudflare representa um ponto de inflexão significativo para a relação entre empresas de IA e editores de conteúdo, especialmente em um cenário onde a proteção da propriedade intelectual se torna cada vez mais crítica. Para o setor de tecnologia brasileiro, essa mudança pode estimular um debate mais amplo sobre a monetização de conteúdo digital e o papel das plataformas intermediárias. A necessidade de separar crawlers de busca de crawlers de treinamento de IA pode levar empresas brasileiras a reconsiderar suas estratégias de visibilidade online, especialmente aquelas que dependem de tráfego gerado por mecanismos de busca e serviços de IA.

Além disso, a decisão da Cloudflare pode ter implicações diretas sobre como as empresas de IA acessam dados para treinamento. Com a possibilidade de bloqueio de crawlers mistos, as empresas que não se adaptarem a essa nova realidade podem enfrentar dificuldades em obter informações necessárias para o desenvolvimento de modelos de IA. Isso pode criar um cenário em que apenas as empresas que investem em práticas transparentes e éticas de coleta de dados consigam prosperar, o que, por sua vez, pode beneficiar a qualidade e a diversidade dos serviços de IA disponíveis no mercado.

Por fim, é importante observar como essa política será recebida por outras plataformas e empresas de tecnologia. A Cloudflare, ao destacar a questão do tráfego não humano, está sinalizando um movimento mais amplo em direção à regulamentação do uso de dados na era da IA. O Brasil, com sua crescente indústria de tecnologia e inovação, deve acompanhar de perto essas mudanças, pois elas podem influenciar a forma como startups e empresas estabelecidas operam no ecossistema digital. O que se espera agora é uma resposta do mercado e como as empresas brasileiras se posicionarão frente a essas novas exigências e desafios.

O que esta cobertura entrega

  • Atribuicao clara de fonte com link para a publicacao original.
  • Enquadramento editorial sobre relevancia, impacto e proximos desdobramentos.
  • Revisao de legibilidade, contexto e duplicacao antes da publicacao.

Fonte original:

TechCrunch AI

Sobre este artigo

Este artigo foi curado e publicado pelo AIDaily como parte da nossa cobertura editorial sobre desenvolvimentos em inteligência artificial. O conteúdo é baseado na fonte original citada abaixo, enriquecido com contexto e análise editorial. Ferramentas automatizadas podem auxiliar tradução e estruturação inicial, mas a decisão de publicar, a revisão factual e o enquadramento de contexto seguem responsabilidade editorial.

Saiba mais sobre nosso processo editorial