O ChatGPT it may even please us, mere internet users. But there are many companies upset with this new technology, to the point of blocking GPTBot, a robot from OpenAI, creator of ChatGPT, which collects web content from across the web.
According to a survey by Originality.ai, over 15 of the 100 most accessed websites on the internet blocked the robot. Some of the sites on the list are powerhouses and have thousands of hits daily.
see more
Google launches tool that improves security levels and…
No more complications: replace the HD with an SSD without having to reinstall the…
Some of them are:
In general terms, it is a way of protecting the copyright of the content on these sites.
According to a Reuters spokesperson, “Intellectual property is the lifeblood of our business and we need to protect the copyright of our content.” The comment was made to the report in The Guardian newspaper.
There is also another explanation: to prevent GPTBot from using the content of these domains to train and develop other Artificial Intelligences.
GPTBot is what is called a “crawler”. In other words, a robot that “crawls” around the internet collecting information and data. This is not a new technology. Google, Bing and other search engines also used it to index pages and display results quickly.
However, OpenAI wants to use crawlers to train its software. With this information, they could update the ChatGPT and make you even sharper and more competent.
GPTBot was announced in August 2023. Aware of the possible negative repercussions, OpenAI also presented all the necessary equipment so that websites could prevent their crawler from collecting their content.
Other crawlers were also blocked from the sites mentioned at the beginning of the article. Among them is CCBot, used for Common Crawl. The purpose of this tool is to create public, non-profit archives.
As a result, it is speculated that not only copyrights are at stake in companies' fight against AIs. One theory is that companies want users to access their content straight from the source – generating access and revenue for them, not the 3rd.
Graduated in Social Communication from the Federal University of Goiás. Passionate about digital media, pop culture, technology, politics and psychoanalysis.