Google gives itself permission to scrape Public Data to Train AI Models

364
SHARE

In a recent update to its privacy policy, Google has announced that its AI services, including Bard and Cloud AI, may now be trained on public data collected from the web. The revised policy effective July 1, 2023, says the tech giant can scrape public data on the web.

Enter Email to View Articles

Loading...

The move has drawn attention to the company’s practices and raised concerns regarding privacy and the use of copyrighted materials. 

The updated policy clarifies that Google uses publicly available information to train language models for services like Google Translate, with newer services like Bard also included. Google emphasizes its commitment to privacy principles and safeguards aligned with its AI Principles.

Google will utilize the information to enhance its services and develop new products and features for the benefit of users and the public. It acknowledges the use of publicly available information to train Google’s AI models, including Google Translate, Bard, and Cloud AI capabilities.

Scrape public data including copyrighted info

While the update provides additional clarity on the services trained using collected data, it leaves unanswered questions about how Google prevents copyrighted materials from being included in the data pool.