SMF Support > vBulletin

[SMF Converter] vBulletin 3.5

<< < (11/13) > >>

N. N.:
That's what I would do, yes, replace the occurrences directly in a SQL backup file.

There is no easy way to redo the conversion and reimport correctly the posts / members / etc meanwhile.(though not impossible, but there would be quite a lot of manual database work)

Red G. Brown:
OK, since there's few occurrences, I should be able to manually check each replacement to make sure it's good before proceeding. I should replace with HTML entities of the form &amp; ? Or, should it be something like #039; ? I'm not sure what the difference is. I converted to UTF-8 from ISO 8859-1 (i think).

UPDATE:

I just had an idea. SMF should include this in its "check for errors" maintenance system. Right now it tells me there's no errors, when clearly there are. This functionality should be added to at least identify such problems in the future (repair is easy after that).

N. N.:
I just tested: I can actually reproduce the problem on a 3.6 database even with the latest converter, so it wouldn't help for now. Hopefully it will be possible to be considered for the next version. (of the converter)
Perhaps you're right that there might be also rather few cases (generally), and it didn't happen to be reported as yet, so we didn't know about this bug.
In any case, SMF sets &lt; and &gt; in the raw database data, for random occurrences of '<' and '>', and needs to find them as such in the database.

UPDATE:

--- Quote from: qwasty on October 23, 2009, 04:38:52 PM ---I should replace with HTML entities of the form &amp; ? Or, should it be something like #039; ? I'm not sure what the difference is. I converted to UTF-8 from ISO 8859-1 (i think).

--- End quote ---
Any is okay (it's simply either the name for those which have names, or the underlying code). There is also a task in maintenance area of the forum, to convert html entities to UTF-8 characters, it might be useful to run afterwards.

Red G. Brown:
I think I might have found another bug, and I don't know if I should change it or not. Usually I see \r\n\r\n in the database everywhere that someone makes a new paragraph. I just stumbled across the (wrong) HTML equivalent of <br /><br /> in the actual database, where I think it should be \r\n\r\n instead. Or vice versa. It looks like the convert missed something involving that, correct? What should be change, and what should it change to, if anything?

It doesn't appear to be related, but I made a post about a similar topic here:

http://www.simplemachines.org/community/index.php?topic=344103.msg2324495#msg2324495

In that topic, I think the problem is in one of the functions in SSI.php, not in the actual operation of the forum, or the conversion process, like in this case. The similarities might just be coincidental.

UPDATE:

Now that I look closer at the database, it seems only the posts made with SMF originally, after the conversion, are using <br />. It appears \r\n was used in vBulletin, but never got converted to the SMF way. If that's correct, then my related post mentioned above suddenly becomes relevant.

<br /> is fine for line endings, but <br /><br /> ought to be replaced with <p></p> tags around the block that was ended with <br /><br />. That's a flaw, if not a bug per se, and is relevant to conversion only because the convert didn't keep everything consistent, flawed or otherwise.

UPDATE:

I found some more problems with the conversion. This bizarre undefined character, , shows up in one of the posts and breaks validation on this page:

http://www.livebusinesschat.com/smf/index.php/topic,16.0/all.html

I don't know where it came from or why it's there, but I don't think it was there before the conversion. That's another thing the "find errors" function ought to find is undefined characters. It could be a sign of data corruption, although in this case it does not appear to be.

The other problem I have found looks like bb code that doesn't do anything like [ right ][/ right ] that probably ought to be removed in the conversion or settings changes, but wasn't. And then there's odd things like bbcode closing tags that don't seem to have an opening tag. I'm not certain, but it looks like some or all opening [ quote ] tags have been lost too. It looks like the converter tried to remove them, but only got half of it right, like [/ b ]. And one last problem seems to be bbcode that nothing gets done with it, like [ snapback ][/ snapback ]. All of those issues are visible here:

http://www.livebusinesschat.com/smf/index.php/topic,9.0/all.html

How do I get all of these problems fixed? I managed to fix the < and > html entities by hand, and it doesn't appear that I've accidentally screwed up the <br /> tags that SMF is gratuitously littering everywhere, but there weren't many of those, and all of the postings with the problem were from VB so there were no competing < and > characters to interfere with find and replace. I'm not sure how to fix these new problems, since I'm not sure what's going on with them.

Can I add here that after this experience, I have discovered that \r\n  is much preferred over HTML tags in the database code since it makes it SO MUCH easier to find and fix problems without HTML everywhere. The HTML ought to be added after reading the data from the database.

Red G. Brown:
Any new info on this?

Navigation

[0] Message Index

[#] Next page

[*] Previous page

Go to full version