This feels a bit… imagineer-y to me. Ultimately you would be rebuilding Google’s PageRank, that is, the topics that get the most backlinks (from reputable, non spammy sources…) get elevated in the search results.
Because that’s sure as heck what happens when you search for “emoji” in Google.
I support adding weights to categories.
As a very minimum, I’d decrease the weight of a “chill” category - its entries bother new users quite often.
How about the obvious:
- incoming links
Or are these already being weighted in the existing algorithm?
Sure, finding the perfect formula might be a bit much imagineering, but I agree that the search results could be improved.
Would the new algorithm be something that users can turn off in advanced search in order to exit the collective search bubble?
[ ] Turn off SaffronRank
Nothing personal Sam, but to me “search for the word emoji … not providing much value” begs the question, “what exactly would you consider to be of value?”
Kind of depends on what exactly one is looking for, no?
IMHO successful results depends greatly on ones search foo, good UI, and good search features.
As far as I’m concerned as a somewhat veteran user as opposed to a newbie, Discourse search works great. I can “sort by”, go to advanced search, and eg. search for
and get more fine tuned sets of results.
I have the feeling what this discussion is more about is how to have search have something like Google “ads” where a site could promote certain topics for searches that use certain “keywords”
As a rough analogy, If I go to a restaurant wanting roast beef rare, I ask for roast beef rare. If I go to a restaurant knowing I want seafood, but I am undecided about what seafood exactly, I ask the waiter for his recommendation.
I think having something like an “Admin Like” could work, but I’m not sure how to go about tying it to specific words.
We have a similar situation with our Discourse instance. We use it for our Knowledge Base but also for our forum.
Just this week I had someone struggling with something and they were looking at a two year old post on the topic rather than our “official” KB article on the same subject.
I can support this if adding search weights to categories is super super easy, otherwise it feels like the wrong approach.
We need to build Google’s PageRank properly if we are going to bother at all, we definitely have the backlink data (internally) as we track all links between topics.
This has definitely been discussed internally and seems like the most logical next step. Most of @sam’s complaints seem to be around certain categories (like #howto and #howto:faq ) not showing up higher in results.
Topics with more internal backlinks showing up higher in search results, is also sensible for similar reasons… but might be harder to do than a blanket category bump that site owners can tweak to taste.
At our hockey forum we have a category for live commentary of matches. The threads can gather nearly a thousand posts during a match which lasts less than three hours. (Most comments have from 1 to 10 words.) There are tons of likes and common search terms appear in threads many times. Thus, threads of the live commentary category show up high in search results with most search terms. Afterwards, the threads have some value: you can relive games and sense the emotions of fans by reading a thread. But that is all they have to offer. And most of the time when you are searching, you are certainly not looking for fans’ emotions. This is a little problem for our forum’s search function.
To sum up, sometimes instead of bumping some category up in results, you may want to drag a category down in results.
I suggest deleting those chat topics after they are done. Similarly, I question the value of us carrying ancient resolved bug discussions around forever @sam.
A tiny handful of bugs might have relevant discussion years later, but 90% of them are just search land mines we are leaving around that will blow up anyone who happens to land on them.
Still I agree with the concept of de-prioritizing a category, I would consider de-prioritizing uncategorized on meta
Agree we can probably delete a bunch of land mines, especially ones that are messing up our top 100 search terms
Not saying deletion is the whole solution either … but surely you can see how carrying around a bunch of ancient resolved bugs forever and ever is ultimately a bad deal for everyone involved.
Totally just it gets super hard to do automatically
The deletionist is fine with deleting 10% good content to get rid of 90% bad
The preservationist is not good with deleting 0.05% good content
And everyone is somewhere on the deletionist/preservationist scale
While I think things like bug reports and game commentary do lose value over time and become irrelevant, I see no harm in keeping everything as long as it’s clearly labeled.
The idea of an archive might be the perfect balance between deleting and keeping obsolete topics. It gets old topics out of sight but they remain there for any future need (whatever that may be).
1- Create Archive(s) category
2- For every category in the forum - bugs / support / lounge etc - create a matching subcategory under Archive(s) automatically.
3- Add option in wrench menu to move post to archive (filed under correct subcategory automatically)
4- Remove archived posts from live search results by default
5- Add option in advanced search options to include Archives
Add header to archived posts letting readers know they’re in the archives
There’s already the ability to archive topics, so no need for a separate category:
Ha! While typing this I had a feeling this idea was
probably most likely mentioned here before with all the brains flowing around in these forums
Oh well, looks like I’m three years late to the party
I appreciate the fair suggestion, but it’s impossible. We certainly want to offer the option of reading old chats. Just having those threads at the top of search results is too much. Therefore having category weights would be yet another great improvement for us.
Keeping a median or mode of post times with the topic will be an indicator on how recent this topic is. Doing it on the last [n] posts will be a measure of how fresh the topic.
In addition to category up/down bumps, sort by freshness.
A much-linked topic may not be fresh at all. Just very relevant during a particular period of time in history. Other than faq’s ans howto’s, which can simply be bumped.
@tgxworld has done A LOT recently to improve the story here.
Smarter indexing (we clean up a lot of previous mess that used to live in indexes)
Per category search weights
Smarter relevance scoring (that takes into account the length of posts better)
So for the OP here we used to have:
We now have:
I find the new results a lot better.
We are not 100% done with the OP, it may make sense to do per-topic priority at some point and allow mods to promote topics.
But … instead of carrying this topic open for more years I feel we can close this off for now.
@tgxworld will do a writeup about the new admin knobs once he is done.