URLs from emails are doubled


(Allen - Watchman Monitoring) #1

I see a lot of URLs duplicated… especially in people’s signatures, but one from today really got me.

Here’s how it looks in Discourse:

The plain text:

That=E2=80=99s a remote lock from iCloud. You can send a message when =
you remote lock it. You have to have Find My Mac turned on. =
https://support.apple.com/kb/ph2700?locale=3Den_GB =
<https://support.apple.com/kb/ph2700?locale=3Den_GB>

The raw HTML:

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" =
class=3D"">That=E2=80=99s a remote lock from iCloud. You can send a =
message when you remote lock it. You have to have Find My Mac turned =
on.&nbsp;<a href=3D"https://support.apple.com/kb/ph2700?locale=3Den_GB" =
class=3D"">https://support.apple.com/kb/ph2700?locale=3Den_GB</a><div =
class=3D""><br class=3D""><div class=3D"">

Here’s how it rendered in Discourse:

That’s a remote lock from iCloud. You can send a message when you remote lock it. You have to have Find My Mac turned on. https://support.apple.com/kb/ph2700?locale=en_GB <https://support.apple.com/kb/ph2700?locale=en_GB>

I can provide the entire .eml file in a DM

Can discourse look for, and de-dupe the duplicated content?


(Jeff Atwood) #2

You mean specifically this is problematic:

<a href="https://support.apple.com/kb/ph2700?locale=en_GB" class="">
https://support.apple.com/kb/ph2700?locale=en_GB
</a>

(Allen - Watchman Monitoring) #3

Correct. The url is duplicated.


(Allen - Watchman Monitoring) #4

I’m still seeing this happening…

Incoming message source:

w: website.net <http://website.net/>  e: andrew@website.net <mailto:andrew@website.net>

Expected post in discourse:

w: website.net e: andrew@website.net

Actual post in discourse:

w: website.net http://website.net/ e: andrew@website.net mailto:andrew@website.net

Happy to send the raw text & screenshots in a DM to whoever can look at this.