Smarter handling of random crawler traffic

sam · March 29, 2018, 3:51am

This is now complete thanks to @neil!

You can see “top crawlers” at https://yoursite/admin/reports/page_view_crawler_reqs

You can blacklist bad crawlers by setting blacklisted crawler user agents

Alternatively, if you wish to only allow particular crawlers, you can set the whitelist with the setting whitelisted crawler user agents

Topic		Replies	Views
Too many Crawlers, is that a problem? Data & reporting	6	2487	June 25, 2020
How to block all crawlers but Google's Feature	1	3966	July 21, 2019
Controlling Web Crawlers For a Site Site Management how-to	9	2144	September 14, 2024
How to protect myself from bots crawling my Discourse instance? Support	6	1578	January 17, 2022
MegaIndex bot did about 4,000 pageviews on one day Community	40	4433	December 2, 2023