Thursday, 12 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Baidu restricts Google and Bing from scraping content for AI training
AI

Baidu restricts Google and Bing from scraping content for AI training

Last updated: August 28, 2024 7:19 pm
Published August 28, 2024
Share
Baidu restricts Google and Bing from scraping content for AI training
SHARE

Chinese language web search supplier Baidu has updated its Wikipedia-like Baike service to stop Google and Microsoft Bing from scraping its content material.

This transformation was noticed within the newest replace to the Baidu Baike robots.txt file, which denies entry to Googlebot and Bingbot crawlers.

In line with the Wayback Machine, the change passed off on August 8. Beforehand, Google and Bing engines like google have been allowed to index Baidu Baike’s central repository, which incorporates virtually 30 million entries, though some goal subdomains on the web site have been restricted.

This motion by Baidu comes amid growing demand for big datasets utilized in coaching synthetic intelligence fashions and functions. It follows comparable strikes by different corporations to guard their on-line content material. In July, Reddit blocked varied engines like google, besides Google, from indexing its posts and discussions. Google, like Reddit, has a monetary settlement with Reddit for knowledge entry to coach its AI companies.

In line with sources, previously yr, Microsoft thought-about proscribing entry to internet-search knowledge for rival search engine operators; this was most related for individuals who used the info for chatbots and generative AI companies.

In the meantime, the Chinese language Wikipedia, with its 1.43 million entries, stays out there to go looking engine crawlers. A survey carried out by the South China Morning Publish discovered that entries from Baidu Baike nonetheless seem on each Bing and Google searches. Maybe the various search engines proceed to make use of older cached content material.

Such a transfer is rising towards the background the place builders of generative AI around the globe are more and more working with content material publishers in a bid to entry the highest-quality content material for his or her initiatives. As an example, comparatively just lately, OpenAI signed an settlement with Time journal to entry the whole archive, courting again to the very first day of the journal’s publication over a century in the past. The same partnership was inked with the Monetary Instances in April.

See also  Perplexity just made AI research crazy cheap—what that means for the industry

Baidu’s choice to limit entry to its Baidu Baike content material for main engines like google highlights the rising significance of information within the AI period. As corporations make investments closely in AI growth, the worth of huge, curated datasets has considerably elevated. This has led to a shift in how on-line platforms handle entry to their content material, with many selecting to restrict or monetise entry to their knowledge.

Because the AI business continues to evolve, it’s possible that extra corporations will reassess their data-sharing insurance policies, probably resulting in additional adjustments in how data is listed and accessed throughout the web.

(Picture by Kelli McClintock)

See additionally: Google advances cell AI in Pixel 9 smartphones

Need to study extra about AI and large knowledge from business leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The excellent occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Tags: ai, content material moderation, Google, microsoft, search engine

Source link

TAGGED: Baidu, Bing, content, Google, restricts, scraping, training
Share This Article
Twitter Email Copy Link Print
Previous Article VMware Cloud Foundation 9 Released, Accelerating Private Cloud Adoption VMware Cloud Foundation 9 Released, Accelerating Private Cloud Adoption
Next Article Recreating Innovation summit Secure your tickets for the Recreating Innovation summit
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

X agrees to halt use of certain EU data for AI chatbot training

Just lately, the European Union turned the centre stage of an information privateness controversy associated…

August 14, 2024

Corpay to Acquire Paymerang

Corpay, (NYSE: CPAY), an Atlanta, CA-based company funds firm, is to accumulate Paymerang, a Richmond, VA-based…

May 13, 2024

Samsung, SK Hynix Ink Deal to Supply Gear to Stargate

(Bloomberg) -- Samsung Electronics Firm and SK Hynix Firm have cast preliminary agreements to provide…

October 1, 2025

US military cloud no longer backed by Microsoft’s China team

Microsoft has stopped letting engineers primarily based in China present technical assist for US navy…

July 21, 2025

Nous Chat launches with access to Hermes 3-70B

Be a part of our every day and weekly newsletters for the newest updates and…

November 7, 2024

You Might Also Like

Lenovo's YouTube channel.
AI

FIFA World Cup 2026 will be the most AI-driven tournament ever. Here’s the proof

By saad
Wayve vehicle in London as the integration of physical AI into vehicles remains a primary objective for automakers looking to accelerate innovation.
AI

How physical AI integration accelerates vehicle innovation

By saad
New partnership to offer smart robots for dangerous environments
AI

New partnership to offer smart robots for dangerous environments

By saad
Software screenshot as virtual simulation data is driving the development of physical AI across corporate environments, led by initiatives like Ai2’s MolmoBot.
AI

Building physical AI with virtual simulation data

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.