OpenAI Web Crawler GPTBot Privacy Controversy, Here’s How to Block It

0
291

These accusations have prompted the Federal Trade Commission to initiate an investigation into OpenAI’s compliance with consumer protection laws. This investigation aims to assess OpenAI’s efforts to rectify false or personal information revealed by its AI models, including ChatGPT.

While OpenAI is offering a potential avenue for improving AI models, it isn’t clear how the company trained its previous ChatGPT versions. The lack of transparency regarding AI model development methods and the potential privacy implications of data collection remains a significant concern.

As AI evolves, it’s crucial to establish robust safeguards, transparent practices, and ethical data usage to ensure both progress and privacy are maintained. And it’s important to understand that you can block it on your site.

Signup for the USA Herald exclusive Newsletter

How to Block OpenAI web crawler GPTBot

To execute the block, add GPTBot in your site’s robots.txt file and disallow it:

  • User-agent: GPTBot
  • Disallow: /

GPTBot can be identified with the following token:

  • User agent token: GPTBot
  • Full user-agent string: Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; GPTBot/1.0; +https://openai.com/gptbot)

You can also customize access by adding the GPTBot Token in select parts of your site’s robot.txt file. Here’s an example:

  • User-agent: GPTBot
  • Allow: /directory-1/
  • Disallow: /directory-2/