These accusations have prompted the Federal Trade Commission to initiate an investigation into OpenAI’s compliance with consumer protection laws. This investigation aims to assess OpenAI’s efforts to rectify false or personal information revealed by its AI models, including ChatGPT.
While OpenAI is offering a potential avenue for improving AI models, it isn’t clear how the company trained its previous ChatGPT versions. The lack of transparency regarding AI model development methods and the potential privacy implications of data collection remains a significant concern.
As AI evolves, it’s crucial to establish robust safeguards, transparent practices, and ethical data usage to ensure both progress and privacy are maintained. And it’s important to understand that you can block it on your site.
How to Block OpenAI web crawler GPTBot
To execute the block, add GPTBot in your site’s robots.txt file and disallow it:
- User-agent: GPTBot
- Disallow: /
GPTBot can be identified with the following token:
- User agent token: GPTBot
- Full user-agent string: Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; GPTBot/1.0; +https://openai.com/gptbot)
You can also customize access by adding the GPTBot Token in select parts of your site’s robot.txt file. Here’s an example:
- User-agent: GPTBot
- Allow: /directory-1/
- Disallow: /directory-2/