I think I might have found another bug, and I don't know if I should change it or not. Usually I see \r\n\r\n in the database everywhere that someone makes a new paragraph. I just stumbled across the (wrong) HTML equivalent of <br /><br /> in the actual database, where I think it should be \r\n\r\n instead. Or vice versa. It looks like the convert missed something involving that, correct? What should be change, and what should it change to, if anything?
It doesn't appear to be related, but I made a post about a similar topic here:
http://www.simplemachines.org/community/index.php?topic=344103.msg2324495#msg2324495In that topic, I think the problem is in one of the functions in SSI.php, not in the actual operation of the forum, or the conversion process, like in this case. The similarities might just be coincidental.
UPDATE:
Now that I look closer at the database, it seems only the posts made with SMF originally, after the conversion, are using <br />. It appears \r\n was used in vBulletin, but never got converted to the SMF way. If that's correct, then my related post mentioned above suddenly becomes relevant.
<br /> is fine for line endings, but <br /><br /> ought to be replaced with <p></p> tags around the block that was ended with <br /><br />. That's a flaw, if not a bug per se, and is relevant to conversion only because the convert didn't keep everything consistent, flawed or otherwise.
UPDATE:
I found some more problems with the conversion. This bizarre undefined character, , shows up in one of the posts and breaks validation on this page:
http://www.livebusinesschat.com/smf/index.php/topic,16.0/all.htmlI don't know where it came from or why it's there, but I don't think it was there before the conversion. That's another thing the "find errors" function ought to find is undefined characters. It could be a sign of data corruption, although in this case it does not appear to be.
The other problem I have found looks like bb code that doesn't do anything like [ right ][/ right ] that probably ought to be removed in the conversion or settings changes, but wasn't. And then there's odd things like bbcode closing tags that don't seem to have an opening tag. I'm not certain, but it looks like some or all opening [ quote ] tags have been lost too. It looks like the converter tried to remove them, but only got half of it right, like [/ b ]. And one last problem seems to be bbcode that nothing gets done with it, like [ snapback ][/ snapback ]. All of those issues are visible here:
http://www.livebusinesschat.com/smf/index.php/topic,9.0/all.htmlHow do I get all of these problems fixed? I managed to fix the < and > html entities by hand, and it doesn't appear that I've accidentally screwed up the <br /> tags that SMF is gratuitously littering everywhere, but there weren't many of those, and all of the postings with the problem were from VB so there were no competing < and > characters to interfere with find and replace. I'm not sure how to fix these new problems, since I'm not sure what's going on with them.
Can I add here that after this experience, I have discovered that \r\n is much preferred over HTML tags in the database code since it makes it SO MUCH easier to find and fix problems without HTML everywhere. The HTML ought to be added after reading the data from the database.