Convert image to text

vainaixr · September 20, 2022, 5:56am

People post screenshots, could there be a way to extract text from an image, and added at the bottom of the post

Jagster · September 20, 2022, 5:59am

Sure. Google OCR.

But not by Discourse. And I would guess such funtionality isn’t coming quite soon anyway

merefield · September 20, 2022, 6:27am

Suspect you’d have to create a plug-in either by authoring it yourself or engaging a freelancer marketplace

michaeld · September 20, 2022, 6:36am

See this plugin

Client (@csmu) never paid me BTW

Tris20 · February 1, 2023, 10:37am

Hey @michaeld

Quickly skimming this plugin, am I right that the images are sent to google servers for processing? What was the reasoning for this approach rather than using a ruby gem to process locally or on the server of the discourse instace? I’m interested in this topic, but submitting images out of house isn’t an option.

michaeld · February 1, 2023, 10:58am

Better performance, ease of maintenance, avoiding version dependencies on local installation.

I understand that this is not always an acceptable approach. A PR is welcome although the user should always be able to avoid a local dependency hell.

Tris20 · February 1, 2023, 12:16pm

Interesting. I guess this was mostly focussed on hand writing though right? If it was simply extracting text from an image, for example an error screenshot, then I guess a local gem might be accurate enough. I played with a python library for something like this a while ago and got reasonable results. Sometimes it was garbage, but the results would never be read by the community, only the search engine. If the user notcied something silly they could always modify the hidden text.

michaeld · February 1, 2023, 12:43pm

I don’t want reasonable results, I want excellent results.

Jagster · February 1, 2023, 12:52pm

There is no OCR that can offer excellent results. Even resonable can be hard to achieve — no matter what library is in use,

Ed_S · February 1, 2023, 10:54pm

Bear in mind OCR is often working on screen grabs, not on scans or photos. It still won’t be 100%, but it’s a good kind of text to try to recognise.

I note that Mastodon’s Web UI offers an OCR function in the dialogue where you can enter an image description for accessibility reasons. It might be that it runs server-side. Here’s what it looks like, after I clicked on “Detect text from picture”:

Tris20 · February 9, 2023, 3:27pm

Interesting. Looks like it has similar results to Tesseract. I wonder how the Mastodon tool handles images with graphics as well as text?

A noble goal Whilst I share the desire for excellent results, I’ll be happy with an 80% improvement

In the context I have in mind, the goal is to extract things like error messages from screenshots. For example, if a user has an error log in their terminal, the tendency is to just screencap it. Even if the result isn’t perfect, if it extracts about 80% of the text correctly, then someone searching for the error message, or another related piece of text has a far higher chance of finding the Topic, than if it was just the unsearchable image.

Topic		Replies	Views
Transcribing handwritten text in images plugin Marketplace	7	844	October 28, 2021
Ai plugin ocr support Feature ai	11	830	April 2, 2024
Autorecognize text in image for Alt-Text Feature	3	583	February 22, 2024
Adding a picture questions feature Feature ai	3	771	January 12, 2024
Imgur API for Images Feature	7	1482	September 7, 2020

Convert image to text

Related topics