Links with multiple dots aren’t linking right

Jagster · March 11, 2022, 10:31pm

If a link has more than one dot, composer don’t know how to handle it after second dot. Like this one:

https://www.sell.fi/sites/default/files/elainlaakarilehti/tieteelliset_artikkelit/kahkonen_t._et_al.canine_pancreatitis-_review.pdf

Bug or feature?

qwe_rfv · March 12, 2022, 6:48am

https://www.sell.fi/sites/default/files/elainlaakarilehti/tieteelliset_artikkelit/kahkonen_t._et_al.canine_pancreatitis-_review.pdf
I think you used emphasis effect writing canine_pancreatitis. like this text.

JammyDodger · March 12, 2022, 6:53am

The italics are because of the underscore after the second period. If you wrap the link in < > I think it displays properly:

https://www.sell.fi/sites/default/files/elainlaakarilehti/tieteelliset_artikkelit/kahkonen_t._et_al._canine_pancreatitis_-_review.pdf

<https://www.sell.fi/sites/default/files/elainlaakarilehti/tieteelliset_artikkelit/kahkonen_t._et_al._canine_pancreatitis_-_review.pdf>

Jagster · March 12, 2022, 7:18am

Every day something new. I didn’t know underscores does italic too.

So… it is feature, not bug

JammyDodger · March 12, 2022, 7:36am

Well, the auto-linking is getting confused by the multiple periods in the pdf link, and starts reading the following underscores as markdown, which might not be desirable?

But the escape <> wrap is a good workaround if you need it.

sam · March 12, 2022, 11:30pm

It is something that @Vitaly ia tracking, really depends on how common these issues are. If it is 1 in a million links it is not worth improving the linkifier, if it is 1 in 10 it certainly is

Jagster · March 13, 2022, 6:42am

It depends. If there is need to share scientific links and pdfs, as on my forum, it is really common covering almost every link. Otherwise, not so often but more than one dot is not so rare form either.

From my point of view — is there some reason why starting http:// or https:// and ending to a space is not enough, plus allowed fileforms of course?

sam · March 13, 2022, 6:58am

The rules are complex and evolving.

The linkifier also works without https:// prefix to add extra complexity.

I suspect that we could do something to relax the linkifiers behaviour for the https:// prefix case, but will leave Vitaly to weigh in.

Jagster · March 13, 2022, 7:04am

That is somehow my point

Well, I’m trying to teach users to use < and >. The biggest UX’ish issue here is Discourse starts to act in many cases too differently than users are expecting. But that is much deeper thing than just trying to remember when links work and when not.

pfaffman · March 13, 2022, 11:04am

That makes sense to me. I think it mount be ok to only linkify bare host names and require urls with a path to have http.

Vitaly · March 29, 2022, 6:32am

I’ve recorded test case but with high probability it’s the same emphasis-based issue as before.

In general, all links with protocol http(s):// can be fixed, and i start working on this very soon (really).

Fuzzy links have no realistic solution (and will open door to ReDoS). But, as i said many times, added value of this mode is not obvious, and it’s disabled by default because of many side effects. I did it only because other packages provided it.

sam · March 29, 2022, 7:19am

I think resolving the http(s):// case will give enormous amounts of mileage.

Vast majority of people are just copying a URL from the address bar of the browser, it certainly includes protocol.

Vitaly · March 29, 2022, 7:20am

Indeed. I don’t know real cases where links without protocol can be useful.

sam · March 29, 2022, 7:25am

It is usually for edge cases where users remember a domain and just want to type it in. Usually a path is not involved, a query param … almost never.

Eg:

Hey… did you check out discourse.org

Vitaly · April 22, 2022, 5:23pm

IMO forcing such users to type https:// for such links is more simple, than fighting side effects of fuzzy mode.

JammyDodger · June 20, 2022, 12:23pm

See also: Links broken with (at least) two underscores in URL

Topic		Replies	Views
Some links are misinterpreted Bug	7	691	August 8, 2022
Links with underbars surrounded by hyphens render incorrectly Bug	2	718	September 24, 2018
Links broken with (at least) two underscores in URL Bug	9	1997	June 15, 2022
A comma followed by an underscore in a URL results in defunct link Bug markdown-it-review	6	2772	June 26, 2017
URL auto-linking doesn't hyperlink if open paren precedes the protocol Feature markdown-it-review	6	2121	June 26, 2017

Links with multiple dots aren’t linking right

Related topics