Search engines now blocked from indexing non-canonical pages

sam · March 15, 2022, 10:33pm

We continued analyzing issues following this change and decided to roll it back per:

github.com/discourse/discourse

FEATURE: enable canonical url indexing

discourse:main ← discourse:enable_indexing_canonical

opened 10:30PM - 15 Mar 22 UTC

SamSaffron

+1 -1

We rolled out a change to disable canonical indexing. The goal behind it was to… limit crawl budget by Google being spent scanning non canonical topic links. Since this change was applied we rolled out 2 fixes that made the change no longer needed. 1. Topic RSS feeds are no longer followed, links in the RSS feeds are not followed. 2. Post RSS feeds now contain canonical links. Combined these two changes mean crawlers no longer discover a large amount on non-canonical links on Discourse sites.

The goal behind it was to limit crawl budget by Google being spent scanning non canonical topic links.

Since this change was applied we rolled out 2 fixes that made the change unnecessary.

Topic RSS feeds are no longer followed, links in the RSS feeds are not followed. Eg: https://meta.discourse.org/t/search-engines-now-blocked-from-indexing-non-canonical-pages/218985.rss
Post RSS feeds now contain canonical links. Eg: https://meta.discourse.org/posts.rss

Combined these two changes mean crawlers no longer discover a large amount on non-canonical links on Discourse sites.

The frees search budget and makes the site setting no longer a requirement. Site operators are still free to experiment with it, however it is disabled by default.

Topic		Replies	Views
Removing the /2, /3, /4, etc links for each reply within a topic URL Dev seo	33	4028	October 13, 2024
Why isn't Google Indexing Discourse? SEO concerns Support seo	31	5173	June 1, 2024
Adding Canonical Redirects for SEO Optimization Support	24	7292	October 1, 2015
Sitelinks in Google disappearing Community	26	1385	January 27, 2023
Google Search Indexing and Discourse Data & reporting	9	3683	June 9, 2020

Search engines now blocked from indexing non-canonical pages

Related topics