From time to time we get requests to set the canonical URL of embedded topics to the URL of the blog post. I’ve created a pull request that does exactly that. It unconditionally uses the URL of the original blog post (embed_url) as canonical for the topic.
After reading those posts I’m not so sure about my solution anymore, so I’d like to get some feedback from you.
Should this be a configurable? Is there a good reason for keeping the current behavior of always using the topic’s URL as canonical?
Should the blog post’s URL only be used as canonical for N pages presented to the search bot? After all, only a certain amount of posts is embedded in the blog post. (N probably should be 1)
I’d appreciate your feedback on this. I’m sure there are lots of different use cases out there and I’d like to make an informed decision before I change anything that could affect search engines.
My thought is that if you are copying and re-posting content from a Blog post for any reason conversation or not, the original blog post is the original content and should be pointed to canonically as the original content.
Yes. And by default don’t enable the blog post’s URL as canonical. Let the customer set the switch. Otherwise this is going to change a lot of Search referral traffic all of a sudden.
IMHO, only the blog post linking top post should be made canonical. The responses and follow ups should not be.
With the WordPress plugin, sites can choose between publishing an excerpt, or publishing the full post to Discourse. Sites that are only publishing an excerpt might not want the canonical URL set to the blog post.
That’s not possible. Discourse presents topics as paginated content to crawlers. That’s why I suggested to change only the canonical of the first page.
Yeah, I’m going to make this a per host setting. @simon Will this work for the Wordpress plugin as well?
We just got to be careful here… this is a very sharp instrument. If for example wordpress is in “Top N” mode where it show only the best content we can end up setting a canonical to a page that does not have all the overlapping content, this is terrible signal to search engines and can be penalised heavily.
In fact, the whole “collapsing” of OP may make this a bad idea, the OP really should be a complete duplicate of the canonical page, so we may need a different technique there that collapses on client side.
I originally wanted to weigh in here and join in the calls for this feature, but after diving in a bit deeper I wanted to share what I learned about how this works (in case anyone missed it like I did initially!)
We’ve just embedded Discourse as the comment system for our blog and I had a little mini freak out when I clicked the “Show full post…” button and saw the whole blog copied without the correct canonical URL
After taking a few deep breaths I went into my “debug mode” and started checking the straight HTML response and checked how much of the post is actually there. As it turns out only the initial paragraph is included in the HTML and therefore this is all Google will see. Phew!
Having a second look at it, it makes perfect sense in the way the UX is laid out. I’m assuming the reason it’s hidden behind a button is because you want people to be able to read the full post and not affect SEO
I guess initially I was surprised that that “Show full Post…” wasn’t just a link to the original blog but I guess it’s an OK way to do it
تم تنفيذ هذه الميزة الآن باستخدام إعداد الموقع embed set canonical url. هذا الإعداد معطل افتراضيًا. عند تمكينه، يحدد عنوان URL الأساسي للمواضيع المضمنة إلى عنوان URL للمحتوى المضمن.
الميزة موجودة منذ فترة. سأكون فضوليًا للاستماع إلى أي مواقع قامت بتمكينها حول كيفية تأثيرها على ترتيبها في محركات البحث (SEO).
مرحبًا @simon، كنت أواجه صعوبة في العثور على حل لتعيين عناوين URL الأصلية (canonical URLs) للمواضيع المحددة في مجتمعي عندما صادفت هذا المنشور.
يبدو أن هذا الإعداد قد يقدم حلاً، لكنني لا أفهم ما هي “المواضيع المضمنة”. لقد حاولت البحث عنها في هذا المجتمع لكنني لم أجد أي شرح لها. ربما يكون هذا أمرًا أساسيًا جدًا. لكن هل يمكنك إخباري بما هي المواضيع المضمنة أو كيفية تضمين مواضيع في مجتمع Discourse؟
الموضوع المضمن هو موضوع تم تعيين خاصية embed_url فيه إلى عنوان URL لموقع خارجي. أنا على علم فقط بذلك عند نشر المواضيع على Discourse عبر واجهة برمجة التطبيقات (API). على سبيل المثال، يقوم كل من إضافة Discourse لـ WordPress ورمز تضمين Discourse JavaScript بإنشاء مواضيع مضمنة.
إذا كنت تنشر مواضيعك على Discourse من موقع خارجي، فإن هذا النهج سيكون منطقيًا. ومع ذلك، لن تتمكن من استخدام هذا النهج للمواضيع التي يتم إنشاؤها مباشرة على Discourse.