爬虫被阻止，需要登录才能访问受限制页面

Isambard · 2024 年6 月 2 日 15:17

我的论坛上有一些分类需要特定的TL级别才能阅读。

Google 尝试抓取这些内容时会遇到错误。这些内容是否应该被 robots.txt 自动排除？

sam · 2024 年6 月 3 日 02:12

谷歌从哪里获取的链接，这些主题是否显示在站点地图中？

Isambard · 2024 年6 月 3 日 08:22

嗯。问得好。我看到帖子的规范 URL 不包含 category_id，因此无法轻松进行过滤。假设它不在站点地图中，如果 Google 在其他地方找到该链接，则没有简单的阻止方法，除非您将每个单独的 URL 包含在 robots.txt 中，但这并非明智之举。

system · 2024 年7 月 3 日 08:22

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

话题		回复	浏览量
Allow google access to logged in categories Support	9	589	2023 年9 月 11 日
Prevent Google from Accessing Specific categories Support	4	722	2024 年5 月 11 日
Need to find a solution to block indexing topics from a category Support	2	64	2024 年11 月 27 日
Category doesn't appear for search engine web spiders? Feature	3	1136	2014 年6 月 5 日
No-index for categories Support	0	363	2021 年4 月 7 日