How are we all feeling about ChatGPT and other LLMs and how they'll impact forums?

MikeNolan · July 26, 2023, 4:11pm

In a quasi-related issue, one of the WSJ columnists put Hardee’s chatbot drive-through ordering system through 30 tests, and it apparently did a pretty good job, only 3 had to be referred to humans for response.

Bas · July 27, 2023, 10:38am

Can you link to the announcement?
It would give those of us out of the (hyper fast) loop a bit of context

RGJ · July 27, 2023, 10:49am

Bas · July 27, 2023, 11:02am

Perfect, thank you @RGJ

It seems it’s specifically about this commitment:

Screenshot 2023-07-27 at 13.00.12

So I think this really is up to the companies to facilitate. But watermarking text is fairly impossible, like @merefield mentioned above.

What would you expect Discourse to do in this case @MikeNolan ? If a user simply copy-pastes AI-generated text, there is no way for Discourse to know about it (apart from running spam and AI-detectors), so I don’t really see how this specific agreement changes anything for now.

MikeNolan · July 27, 2023, 4:07pm

User-pasted AI-generated content is probably not something Discourse can do much with, as it is likely indistinguishable from human-generated content (aside from possibly being better-written), but if you use an official Discourse AI plugin, perhaps Discourse can do something about watermarking or otherwise denoting what it generates?

Bas · July 27, 2023, 4:24pm

Ah in that way, yes, I can see how that makes sense

Falco · July 27, 2023, 4:51pm

We started work on this, for example this very own topic summary is watermarked:

Summarization UI work is the one who got most love, so it’s where we already are close to the final form and have this setup. Others will follow.

RGJ · July 27, 2023, 5:00pm

Maybe a bit semantic but two properties of digital watermarks are that they are hidden to the casual viewer, and hard to remove.

MikeNolan · July 27, 2023, 5:07pm

I would think that OPEN acknowledgement of AI-generated content is important, both for text and for images.

Hidden digital signatures are more useful for things like image copyright enforcement.

I’m active on the Ugly Hedghog photography forum, whether or not AI generated or modified images qualify as photographs is a hotly discussed topic there. (Some AI-generated images have won photography contests.)

RGJ · July 27, 2023, 5:09pm

The problem we’re discussing right now is that people with malicious intent will use AI to generate things and then remove the acknowledgement and try to stage it as human generated content. That implies the requirement of an origin “tag” that’s hard to remove.

MikeNolan · July 27, 2023, 5:09pm

The intent isn’t necessarily malicious, but it is less than honest.

Good luck finding a way to ‘tag’ AI generated text that can’t be overcome with something possibly as rudimentary as cut-and-paste.

hello-smile6 · July 27, 2023, 5:11pm

Could zero-width characters be used for that?

RGJ · July 27, 2023, 5:21pm

No, those can easily be removed by passing the content through a filter that only keeps normal alphabetical characters. Watermarking text is very, very hard. You basically cannot do it at the character representation level.

This blog post from Scott Aaronson explains a bit how it could work. Scroll down to the “My Projects at OpenAI” section. The method outlined there is copy/paste proof @MikeNolan

Ed_S · July 27, 2023, 5:51pm

Thanks, that’s interesting:

My main project so far has been a tool for statistically watermarking the outputs of a text model like GPT. Basically, whenever GPT generates some long text, we want there to be an otherwise unnoticeable secret signal in its choices of words, which you can use to prove later that, yes, this came from GPT. We want it to be much harder to take a GPT output and pass it off as if it came from a human. This could be helpful for preventing academic plagiarism, obviously, but also, for example, mass generation of propaganda… Or impersonating someone’s writing style in order to incriminate them. These are all things one might want to make harder, right?
…
So then to watermark, instead of selecting the next token randomly, the idea will be to select it pseudorandomly, using a cryptographic pseudorandom function, whose key is known only to OpenAI. That won’t make any detectable difference to the end user, assuming the end user can’t distinguish the pseudorandom numbers from truly random ones.

simon · July 27, 2023, 6:06pm

One of my concerns about trying to identify AI generated writing is that it will accidentally target well written human generated text.

MikeNolan · July 27, 2023, 6:19pm

well-written human generated text seems to be the exception on many forums. :sigh:

merefield · July 27, 2023, 6:25pm

I just go back to motivation.

If you identify bad intent, ban or suspend.

If it’s well written, well intentioned text with facts that bear out, leave it?

What if the user’s first language is not English and they’ve used ChatGPT to refine their grammar?

merefield · July 27, 2023, 6:28pm

btw, here’s how I preface AI Topic Summaries:

_{(eek CSS tweak needed!)}

simon · July 27, 2023, 6:31pm

OK, I’m concerned it could target my posts

I think so. I don’t see a problem with people using AI to help compose posts assuming there’s an actual human being making the decision as to whether or not the AI generated text is worthy of posting.

MikeNolan · July 27, 2023, 6:31pm

There are a host of tools that can help improve grammar, I don’t know if ChatGPT is better than the rest of the bunch.

Improving grammar is a somewhat different issue than generating ‘original’ content, though. The AI engines are starting to be targeted by the content owners who want to be reimbursed for using their material to train the AI engine.

Topic		Replies	Views
How to prevent community content from being used to train LLMs like ChatGPT? Community	71	4147	October 14, 2023
What is stopping you from trying out Discourse AI? Community ai	35	1663	August 23, 2025
Best practices dealing with Spam users and GPT reply posts Community	9	896	July 31, 2023
Integrating GPT3-like bots? Dev	63	4386	May 10, 2023
Is there any AI at the core of standard Discourse? Support	15	1472	May 31, 2023

How are we all feeling about ChatGPT and other LLMs and how they'll impact forums?

Related topics