Removal of http:// and/or https:// from links inside posts

Started by pepf, February 10, 2018, 10:03:40 AM

Previous topic - Next topic

pepf

Hello,
I am looking for a mod that does automatically remove the http:// and/or https:// from links inside posts. Not sure if something already exists, but I can not find any.

TIA for any information in this respect.

Peter

Steve

Are trying to make the links unclickable or do you still want them clickable but just not showing the http/https bit?
DO NOT pm me for support!

Antes


pepf

Thanks for answering.
The links should not be clickable, but before all I want that they can not be spidered by the SE. So, actually it is the whole underlying code "<href=http://...> that should not be there, I suppose.

The reason is that many of the posted links are pointing to target content that is removed after some time, while the textual and image content of the post are still in use. The SE should not 'see' and follow links to non-existent external content.

butch2k

Hummm just a quick thought, why not use the censor function replacing http:// by hxxp:// for instance ?

Kindred

I don't think that's what he wants to do.

I think he wants to still have them be a LINK, but to use Schemaless URLs.

At this time, SMF does not support Schemaless URLs...  and there is no mod to do so either (which is why this is in mod requests, I assume) :D
Слaва
Украинi

Please do not PM, IM or Email me with support questions.  You will get better and faster responses in the support boards.  Thank you.

"Loki is not evil, although he is certainly not a force for good. Loki is... complicated."

butch2k

Quote from: Kindred on February 12, 2018, 10:04:50 AM
I don't think that's what he wants to do.

I think he wants to still have them be a LINK, but to use Schemaless URLs.

At this time, SMF does not support Schemaless URLs...  and there is no mod to do so either (which is why this is in mod requests, I assume) :D

Using the word censor to replace:

http:// to //
and
https:// to //

might work, no ?

Illori

that would also have an effect on images, which may break them in img tags right?

Kindred

and butchs... I don't believe that the word censor function operates inside BBC tags....


so

[url=https://www.simplemachines.org]https://www.simplemachines.org[/url]

would convert to

[url=https://www.simplemachines.org]//www.simplemachines.org[/url]


it would DISPLAY as schemaless, but not actually BE so...
(and yes, I suspect it would break the IMG BBC as well, as Illori points out.)

So, this mod request would have to change the way that SMF handles URLs in the URL tag, IMG tag and potentially other spots (quote tag?)
Слaва
Украинi

Please do not PM, IM or Email me with support questions.  You will get better and faster responses in the support boards.  Thank you.

"Loki is not evil, although he is certainly not a force for good. Loki is... complicated."

butch2k

Quote from: Illori on February 12, 2018, 11:18:17 AM
that would also have an effect on images, which may break them in img tags right?
Indded, it would work if the hosting is using the same scheme, but might break if not.

butch2k

Quote from: pepf on February 12, 2018, 04:32:31 AM
The links should not be clickable, but before all I want that they can not be spidered by the SE. So, actually it is the whole underlying code "<href=http://...> that should not be there, I suppose.
I'm wondering what he needs ? Is it just a rel="nofollow" for SE ? or a not clickable link at all ?

Kindred

oh...  I missed that second message.

he just wants a no-follow or non links?    then this works exactly as he wants

https://custom.simplemachines.org/mods/index.php?mod=2404

Слaва
Украинi

Please do not PM, IM or Email me with support questions.  You will get better and faster responses in the support boards.  Thank you.

"Loki is not evil, although he is certainly not a force for good. Loki is... complicated."

pepf

Thanks for answering.

Kindred is right, butch. This is not what I want.

The forum is indexed daily by Google, sometimes several times a day. As a high percentage of posted links will have a dead end after some time (target content removed or changed), in the long run there will be lots of links that are dead. This is negative in view to SE ranking. I just do not want that Google or other search engines sees these links as something they can follow. The visitors are content with just copying the links over to the browser and find the content.

For example, in the past only hyperlinks could be clicked, but now even non-active links with the "http://" can become activated. If the browser can do this, the spiders can do it too. Removing the http part was just meant to remove something that the SE may follow anyway, now or in future.
The 'rel="nofollow" ' code is like the Robots file, it may or may not be observed by the SE. That a nofollow page is not shown in the SE search results does not mean the spider did not follow the link. But following a dead link may affect ranking in the long run. Currently I remove the http:// part manually, but sometimes it is a bit tedious when there are many.

Anyway, thank you for the link to the Anti-Spam mod, I may use this for the time being, Kindred.



Advertisement: