Thursday, 19 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Baidu restricts Google and Bing from scraping content for AI training
AI

Baidu restricts Google and Bing from scraping content for AI training

Last updated: August 28, 2024 7:19 pm
Published August 28, 2024
Share
Baidu restricts Google and Bing from scraping content for AI training
SHARE

Chinese language web search supplier Baidu has updated its Wikipedia-like Baike service to stop Google and Microsoft Bing from scraping its content material.

This transformation was noticed within the newest replace to the Baidu Baike robots.txt file, which denies entry to Googlebot and Bingbot crawlers.

In line with the Wayback Machine, the change passed off on August 8. Beforehand, Google and Bing engines like google have been allowed to index Baidu Baike’s central repository, which incorporates virtually 30 million entries, though some goal subdomains on the web site have been restricted.

This motion by Baidu comes amid growing demand for big datasets utilized in coaching synthetic intelligence fashions and functions. It follows comparable strikes by different corporations to guard their on-line content material. In July, Reddit blocked varied engines like google, besides Google, from indexing its posts and discussions. Google, like Reddit, has a monetary settlement with Reddit for knowledge entry to coach its AI companies.

In line with sources, previously yr, Microsoft thought-about proscribing entry to internet-search knowledge for rival search engine operators; this was most related for individuals who used the info for chatbots and generative AI companies.

In the meantime, the Chinese language Wikipedia, with its 1.43 million entries, stays out there to go looking engine crawlers. A survey carried out by the South China Morning Publish discovered that entries from Baidu Baike nonetheless seem on each Bing and Google searches. Maybe the various search engines proceed to make use of older cached content material.

Such a transfer is rising towards the background the place builders of generative AI around the globe are more and more working with content material publishers in a bid to entry the highest-quality content material for his or her initiatives. As an example, comparatively just lately, OpenAI signed an settlement with Time journal to entry the whole archive, courting again to the very first day of the journal’s publication over a century in the past. The same partnership was inked with the Monetary Instances in April.

See also  Military AI contracts awarded to Anthropic, OpenAI, Google, and xAI

Baidu’s choice to limit entry to its Baidu Baike content material for main engines like google highlights the rising significance of information within the AI period. As corporations make investments closely in AI growth, the worth of huge, curated datasets has considerably elevated. This has led to a shift in how on-line platforms handle entry to their content material, with many selecting to restrict or monetise entry to their knowledge.

Because the AI business continues to evolve, it’s possible that extra corporations will reassess their data-sharing insurance policies, probably resulting in additional adjustments in how data is listed and accessed throughout the web.

(Picture by Kelli McClintock)

See additionally: Google advances cell AI in Pixel 9 smartphones

Need to study extra about AI and large knowledge from business leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The excellent occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Tags: ai, content material moderation, Google, microsoft, search engine

Source link

TAGGED: Baidu, Bing, content, Google, restricts, scraping, training
Share This Article
Twitter Email Copy Link Print
Previous Article VMware Cloud Foundation 9 Released, Accelerating Private Cloud Adoption VMware Cloud Foundation 9 Released, Accelerating Private Cloud Adoption
Next Article Recreating Innovation summit Secure your tickets for the Recreating Innovation summit
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

In the Spotlight… Mitsubishi Electric Video Interview

Within the newest episode of In The Highlight, Information Centre Evaluate sits down with Shahid…

May 4, 2025

five takeaways from the Munich auto show

The Govy AirCab two-seater electrical 'flying automotive', made by a subsidiary of Chinese language carmaker…

September 15, 2025

Lessons learned – from decisions to data

“Digital transformation turns into very ineffective in case you don’t have any focus,” explains Kamala…

May 8, 2024

Vertiv trends forecast sees intense focus on AI enablement and energy management

The proliferation of AI (as Vertiv predicted two years ago) along with the infrastructure and…

January 22, 2024

Vertiv to showcase latest high-capacity liquid cooling innovations

Vertiv will showcase the corporate’s latest merchandise and options at Knowledge Centre World (DCW) on…

March 7, 2025

You Might Also Like

Infosys AI implementation framework offers business leaders guidance
AI

Infosys AI implementation framework offers business leaders guidance

By saad
How financial institutions are embedding AI decision-making
AI

How financial institutions are embedding AI decision-making

By saad
Goldman Sachs deploys Anthropic systems with success
AI

Goldman Sachs deploys Anthropic systems with success

By saad
Alibaba Qwen is challenging proprietary AI model economics
AI

Alibaba Qwen is challenging proprietary AI model economics

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.