Thursday, 2 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Baidu restricts Google and Bing from scraping content for AI training
AI

Baidu restricts Google and Bing from scraping content for AI training

Last updated: August 28, 2024 7:19 pm
Published August 28, 2024
Share
Baidu restricts Google and Bing from scraping content for AI training
SHARE

Chinese language web search supplier Baidu has updated its Wikipedia-like Baike service to stop Google and Microsoft Bing from scraping its content material.

This transformation was noticed within the newest replace to the Baidu Baike robots.txt file, which denies entry to Googlebot and Bingbot crawlers.

In line with the Wayback Machine, the change passed off on August 8. Beforehand, Google and Bing engines like google have been allowed to index Baidu Baike’s central repository, which incorporates virtually 30 million entries, though some goal subdomains on the web site have been restricted.

This motion by Baidu comes amid growing demand for big datasets utilized in coaching synthetic intelligence fashions and functions. It follows comparable strikes by different corporations to guard their on-line content material. In July, Reddit blocked varied engines like google, besides Google, from indexing its posts and discussions. Google, like Reddit, has a monetary settlement with Reddit for knowledge entry to coach its AI companies.

In line with sources, previously yr, Microsoft thought-about proscribing entry to internet-search knowledge for rival search engine operators; this was most related for individuals who used the info for chatbots and generative AI companies.

In the meantime, the Chinese language Wikipedia, with its 1.43 million entries, stays out there to go looking engine crawlers. A survey carried out by the South China Morning Publish discovered that entries from Baidu Baike nonetheless seem on each Bing and Google searches. Maybe the various search engines proceed to make use of older cached content material.

Such a transfer is rising towards the background the place builders of generative AI around the globe are more and more working with content material publishers in a bid to entry the highest-quality content material for his or her initiatives. As an example, comparatively just lately, OpenAI signed an settlement with Time journal to entry the whole archive, courting again to the very first day of the journal’s publication over a century in the past. The same partnership was inked with the Monetary Instances in April.

See also  Microsoft's new Phi-4 AI models pack big performance in small packages

Baidu’s choice to limit entry to its Baidu Baike content material for main engines like google highlights the rising significance of information within the AI period. As corporations make investments closely in AI growth, the worth of huge, curated datasets has considerably elevated. This has led to a shift in how on-line platforms handle entry to their content material, with many selecting to restrict or monetise entry to their knowledge.

Because the AI business continues to evolve, it’s possible that extra corporations will reassess their data-sharing insurance policies, probably resulting in additional adjustments in how data is listed and accessed throughout the web.

(Picture by Kelli McClintock)

See additionally: Google advances cell AI in Pixel 9 smartphones

Need to study extra about AI and large knowledge from business leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The excellent occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Tags: ai, content material moderation, Google, microsoft, search engine

Source link

TAGGED: Baidu, Bing, content, Google, restricts, scraping, training
Share This Article
Twitter Email Copy Link Print
Previous Article VMware Cloud Foundation 9 Released, Accelerating Private Cloud Adoption VMware Cloud Foundation 9 Released, Accelerating Private Cloud Adoption
Next Article Recreating Innovation summit Secure your tickets for the Recreating Innovation summit
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

U.S. Seeks Intel Stake Without Governance Rights, Lutnick Says

(Bloomberg) -- Commerce Secretary Howard Lutnick confirmed discussions between the U.S. and Intel for the…

September 4, 2025

DeepSeek unleashes ‘Janus Pro 7B’ vision model amidst AI stock bloodbath, igniting fresh fears of Chinese tech dominance

Be part of our day by day and weekly newsletters for the most recent updates…

January 28, 2025

Portman Partners expands global team

Following the acquisition by household funding agency, Meyer Mundell Restricted earlier this yr, Portman is…

March 6, 2024

Zencoder buys Machinet to challenge GitHub Copilot as AI coding assistant consolidation accelerates

Be part of our every day and weekly newsletters for the most recent updates and…

April 25, 2025

Indonesia’s Largest Geothermal Firm Aims to Power Data Centers

(Bloomberg) -- Star Power Geothermal, owned by considered one of Indonesia’s most precious corporations, PT…

July 31, 2024

You Might Also Like

Experian uncovers financial services' AI fraud paradox
AI

Experian uncovers financial services’ AI fraud paradox

By saad
Hershey applies AI across its supply chain operations
AI

Hershey applies AI across its supply chain operations

By saad
Inside the AI agent playbook driving enterprise margin gains
AI

Inside the AI agent playbook driving enterprise margin gains

By saad
DeepL makes the case for language AI as enterprise infrastructure
AI

DeepL makes the case for language AI as enterprise infrastructure

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.