AI bots everywhere. Does anyone have a good whitelist for robots.txt?

My niche little site, http://golfcourse.wiki seems to be very popular with AI bots. They basically become most of my traffic. Most of them follow robots.txt, and that's nice and all, but they are costing me non-trivial amounts of money.

I don't want to block most search engines. I don't want to block legitimate institutions like archive.org. Is there a whitelist that I could crib instead of pretty much having to update my robots file every damn day?


Comments URL: https://news.ycombinator.com/item?id=42861047

Points: 15

# Comments: 8

https://news.ycombinator.com/item?id=42861047

Creato 6mo | 29 gen 2025, 04:50:08


Accedi per aggiungere un commento

Altri post in questo gruppo

Show HN: AI Physics Tutor with Free Body Diagrams

Hi HN. I built a prototype AI physics tutor that can interpret, draw, and edit free body diagrams.

Lately I've been transfixed with generating diagrams with LLMs. If you pipe generated JSON thro

3 ago 2025, 03:40:21 | Hacker news
Show HN: Voltpeek – Vim-inspired oscilloscope software

This is software for my headless, PC based oscilloscope, which is controlled entirely via commands similar to the Vim text editor. I built this because I liked the idea of headless oscilloscopes;

3 ago 2025, 03:40:09 | Hacker news