How should category archiving work?

We have a request from a customer that I’ve been expecting for a while:

We want to remove the beta and alpha categories, and all topics in those categories

Meaning, there are a whole lot of posts in those alpha and beta categories that represent the state of their software at a much older point in time, such that all the topics there are basically irrelevant, forever. (This is also video game software, where after 3 years a game is forgotten forever and never played again, so old beta/alphas of games are especially irrelevant in the world of software.)

The intent of the “archive” action on a topic is to prepare it for eventual archiving. Now that may mean deleting it. Or it could mean moving it to some kind of historical long term offline archives. At any rate, an archive is something that is

  • not of practical current interest
  • might be useful very rarely to someone digging through long term history for obscure reasons
  • helpful to remove from the current active instance to make room for newer, more relevant current content

I believe archiving should be triggered either by:

  • archiving out all topics with a state of archived
  • archiving out all topics in a particular category

I’m not sure we can physically remove the posts and topics from the database without extreme trauma to our codebase, so perhaps the only alternatives are to

  1. delete (all our deletes are soft deletes) every topic in the category
  2. mark the category as archived, and have special handling for topics in archived categories

Then, produce an export file of the archived topics.

This also implies there is a way to selectively bring back a set of archived posts, or an archived category, which is probably way too hard.

I guess the simple thing to do, for now, is

  • just mark every topic in the category (or all archived topics) as deleted for now.
  • make sure we have an ‘archived’ category state (perhaps a date of archive) to look at later.

Any thoughts here?

「いいね!」 6

There is another option that can work quite well, make the category a staff only category.

This effectively “removes” all the posts in one very quick go. Its clean, has simple undo and very minimal side effects that are “staff only”.

That is the way they have it now (I actually checked for that earlier), and it is not what they want.

It is just unnecessary clutter for them at this point, a bunch of obsolete content.

maybe you could provider an admin setting for “archived categories” or some such - add category names there that you want to disappear forever.

nice aspect of that would be that it really removes all the posts in one go, and is reversible, and even the admins won’t be able to see the posts.

Should there not be some way for normal users who are not forum admins to access the archives somehow? Or would that be part of the “special handling”.

Perhaps if we made sure that the archive file was sanitized of secret information, then they could be put up for download. Even if we don’t want it to be searchable.

If the posts get actually removed from the database, we could eventually get a screen like this:

This topic has been archived.

If you want to view it, click below and give us a minute to dig it up.

[ View Archived Topic ] Download archive of Alpha (2013-2014) (0.5GB)

If the removal of the content is the point, then a soft delete like you mention (assuming hard deletes are a problem) should be applied on all of the content, and all of these topics & posts should be excluded from the default backup. Instead, when you do the archiving, they should be put in a separate “Archives” backup of their own.

If you just don’t want your searches getting polluted & slowed down with deprecated posts, and you don’t want archived categories cluttering up your category list, then a “do not search” flag on archived post and a special category that doesn’t show even in staff lists (but retrievable through admin panel somehow) would suffice. Essentially making all topics Unlisted, plus special treatment of the category visibility.

We’ve talked about similar things before:

「いいね!」 3

There may also be some mapping here for moving a category to a different Discourse instance, which we have at least one paying customer wanting to do

What we need:

  • Copy one category
  • Copy all of its sub-categories (10)
  • Copy all posts in these categories (<50)
  • Copy all members that belong to the custom group; keep current usernames, email and passwords of these users on the new forum

Same basic area of work, IMO. Splitting one Discourse into another is a logical thing to do, only difference I can see in this case is that instead of transferring the entire category to /dev/null we are copying it to another Discourse instance…

Anyway keep that in mind as we work on this @neil!

「いいね!」 4

Moving category spec is here:

「いいね!」 3

これはかなり古い話ですが、Ask Fedora の Discourse サイト自体が数年経ちつつあるため、この問題について考えています。そのフォーラムはエンドユーザーのトラブルシューティング用ですが、一部のトピックは時代を超えて有用であり(過去の議論を掘り起こすことで、それが新しい問題なのか、あるいはなぜそのような状態になったのかの経緯を理解するのに多少役立つこともあります)、最も古いトピックは Fedora Linux 29 時代のものです。Fedora Linux 36 のリリースを目前に控え、状況は大きく変化しています。

そこで、非常に古いリリース(当サイトの「迅速な変化」のペースに照らせば、3 年以上前のもの)のトピックについては、以下のような扱いにしたいと考えています:

  • サイト内検索結果から除外する(ただし、明示的に検索された場合を除く)
    • 類似トピックとして提案されることは、確実に避ける
  • Google 検索からも非表示にするか?(まだ確信はありません)
  • 返信を無効化するが…
  • 発見した人が何か意見を述べたい場合に、そのトピックを簡単に参照できるよう、「+ 新しいトピック」リンクを目立たせる

(実際、最後の点は「クローズド済み」の投稿にも適用できると良さそうです!)

「いいね!」 1

Discourseのスタッフは、閉じられたスレッドや役に立たない投稿を削除する際にかなり容赦ありません。探しているものが見つからないとたまに悲しくなりますが、頻繁ではありません。自動システム(ひどい間違いを犯す可能性があります)よりも、積極的なモデレーションの方が良いと思います。

「いいね!」 2