Plugin de nuvem de palavras para Discourse?

silvacarl · Fevereiro 25, 2021, 10:13pm

Is there a word cloud plugin for discourse?

Carl

awesomerobot · Fevereiro 26, 2021, 4:03am

There is not… is there a specific reason you’d like one? how would it be used?

silvacarl · Fevereiro 26, 2021, 5:11pm

if would be cool in two ways. one, a word cloud i could click on could then bring up all the topics that match that click on a word like “subscriber”.

two, you could display other types of searches like this, or top posters, or whatever you want.

probably could be something that runs in a cron job one a day or more often.

merefield · Fevereiro 27, 2021, 1:24pm

I thought this was a fun idea … so I created it*

It’s at a very early ‘just working’ stage and needs a lot of refinement and additional options and potentially some click functionality:

It adds a link on your Hamburger Menu.

be aware that currently it builds the word stats from all Posts, regardless of type and location. This could effectively act as a very-round-the-houses mild privacy leak (might need some additional safeguards to exclude words from posts in private areas). You have to be logged in to see it and access the data though … and the words are rendered as SVG’s … and it only shows the top x hundred words, so unlikely to be much of a concern to most sites. I’ll work on that to make it more secure, but this way the query runs very fast.

Enjoy.

*It leverages some pretty nifty existing libraries which I’ve credited in the repo. Shout out to @DiscourseMetrics whose query I leveraged.

silvacarl · Fevereiro 27, 2021, 8:25pm

very cool. i think you would also want to not include certain words in the word cloud?

merefield · Fevereiro 27, 2021, 10:46pm

Sure, it needs a whole load of sensible exclusions and the regexes need work to get rid of markdown formatting etc. whilst not making it overly complicated. This is just a start. I’ve just added some colour.

silvacarl · Fevereiro 27, 2021, 11:18pm

Just to be clear though it’s awesome lol

merefield · Fevereiro 28, 2021, 1:03pm

Added a localised list of ignore words:

which should make results a little more interesting …

I’ve also added a lot of sanitising logic, so the result is much better.

tobiaseigen · Fevereiro 28, 2021, 10:05pm

Nice! I like this effort. Nice job. If I could request features:

make the hamburger menu link optional (I like the idea of this being an easter egg)
create category setting, to only include selected categories
provide a category route so you can generate a word cloud of just one category and sub-categories, e.g. /wordcloud/category

Here’s how it looks on my neighborhood forum.

silvacarl · Março 1, 2021, 10:06pm

works well, need to fine tune it:

merefield · Março 2, 2021, 10:57am

Great feedback, thanks, and some good ideas!

Yes that sounds like a good approach. 3 metres deep in client work atm but will look at Category selection for next update.

merefield · Março 8, 2021, 3:02pm

Category selection is in:

FEATURE: restrict word stats to specific Categories · merefield/discourse-word-cloud@0777adc · GitHub

If you select no Category (default) you get a scan of all forum Posts (PMs and all). If you add just one Category, word stats are restricted to that etc.
As are humungous improvements to the regex’s ( ) which now clean up the ‘raws’ nicely and get rid of most if not all the Markdown.

NB Word stats are updated every hour now (which is probably still excessive, but for the time being makes it easier to checkout changes in Production as we go through a lot of initial code evolution).

NB#2 I’ve not yet considered other languages here beyond English (it’s certainly not tested). The current word manipulation may not work well in some languages. Suggestions & PR’s welcome.

tobiaseigen · Março 8, 2021, 6:27pm

Cool! Here’s an updated wordle just including the most relevant categories.

Mine is a small community and still fairly new. To be honest, though, the info presented in the wordle looks pretty but is not especially meaningful or useful. I guess it could be used as a visual in a retrospective topic about the community or something along those lines. Would be fun to see more examples of how people use this.

Some of the included words are common and meaningless, e.g. youd, off, got, add etc. I wonder if the “word cloud ignore portion” setting (which is 100 for me, the default) is doing its job? Or maybe there is another/better list of words to ignore?

merefield · Março 8, 2021, 6:33pm

Yeah, happy to consider a larger list (I’d found a 200 word list here, but deferred to wikipedia as a more ‘authoritative source’)

merefield · Março 9, 2021, 4:33pm

OK i’ve:

expanded the ignore list to 300 words, using a list I found here
enhanced the regex’s to strip out quotes (so the word ‘quote’ didn’t get featured so much!)
removed the arbitrary cull of the top ten remaining words which was redundant after adding the ignore list.

NB if there are still words you want to exclude, just add them to the beginning of:

like i’ve done here (eg. ‘ive’, ‘its’, ‘topic’, ‘post’)

to see the impact of any changes more quickly, simply re-trigger the job from Sidekiq:

That’s it for a while I suggest. I may create a dedicated Topic.

merefield · Março 9, 2021, 7:53pm

OK, you might like this:

Update: I’ve now simplified the ignore list arrangement so there’s no longer a setting for ‘portion’ of ignore list employed, you simply have to delete or add words to the ignore list using the native localised setting:

silvacarl · Maio 19, 2021, 12:39am

do we need to uninstall old version to get this?

merefield · Maio 20, 2021, 5:19am

You should only need to upgrade the plugin. Having issues?

silvacarl · Maio 20, 2021, 3:53pm

i apologize we figured it out.

merefield · Maio 20, 2021, 4:18pm

No problem at all

Tópico		Respostas	Visualizações
Word Cloud plugin Plugin	23	3153	23 de Outubro de 2024
Restrict users to post certain words per category Support	5	961	21 de Fevereiro de 2023
Use WP Discourse to publish posts from Wordpress to Discourse Administrators wordpress , video , how-to	46	5920	7 de Dezembro de 2024
Discourse Tag Cloud Theme component	25	3137	26 de Março de 2024
Wp-discourse-shortcodes plugin Extras	112	20414	23 de Outubro de 2025

Plugin de nuvem de palavras para Discourse?

Tópicos relacionados