Suggested Topics - Title & Content based Suggestions

Hi,

New here, so sorry if I’m beating a dead horse.

I agree with @sam that there is a rabbit hole, but on the other hand, topic modeling technology is now pretty mature, and pretty good off-the-shelf tools exist. A recent project of mine has analyzed ~ 5 million patent titles and abstracts; analyzing order of ~1000’s of topics on my spiffy new discourse site would be a piece of cake. Moreover my community might have energy to make it happen.

From the experts: I would like advice on whether I should be thinking of designing a plugin, or should I think in terms of messing with discourse source (which I have downloaded from github)?

Found this on scraping discourse topics with python, but haven’t got it to work yet. Something like it should allow me to pull the data offline, build the model, loadable for querying subsequently.

Most of the good tools are in python, FWIW…

4 Likes