Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

1. Someone asks chat gpt. No answer

2. Checks Google/SO and no answer

3. Figures it out themselves

4. Blogs about that

5. AIs that search pick it up first but eventually it ends up in training.

As well as AI teams paying for content for training.



And if people, like I'm doing, start to block AI access to their blogs?

Some studies show that the number of sites with AI blockers at their robots.txt has dramatically increased!

Right now, some companies are trying to ignore robots.txt, but after regulations...


Do you also block all google bots? Who's to say your data isn't scraped by the company but buys it from brokers who can? Robots.txt is theater.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: