As a lot of people are discovering, AOL mangles the XML files so that
Palantir cannot import them.
Thanks to some volunteers from my team, I've worked out what exactly
is happening to the files, and I have a workaround which will allow
people to edit their XML files so that Palantir can import them. I've
only worked out exactly how to do this on Windows, so maybe some Mac
techie can work out how to do it there. Basically, we want to insert
the bytes FEFF at the very beginning of the XML file.
Unfortunately, this edit prevents Automagic from importing the XML
files, so make sure you either import your XML into Automagic before
doing this, or make a backup copy for use with Automagic first!
To make XML files downloaded through AOL usable by Palantir:
1) Open the XML file with Wordpad. Do NOT use Notepad.
You should see a bunch of undisplayable characters before each letter.
2) At the very beginning of the file, before the first undisplayable
character, use the numeric keypad to enter two special characters.
To do this, hold down alt, then type in on the numeric keypad
"0254". Then let up alt. You should see a thorn character (looking
rather like a P with the loop halfway up the shaft rather than at the
top of the shaft) appear. After the thorn, hold down alt again and
type "0255". This time you should get a lower case y with two dots
over it.
3) Save the file. At this point, Palantir should be able to read it.
Don't do this for files that didn't come through AOL, or for files
that don't show the boxes indicating undisplayable characters before
each letter.
You may not be able to load the file in Wordpad anymore - when I do
this it allows Wordpad to recognize the encoding as UTF-16, which it
knows it can't handle. So it's hard to go back.
For those interested in the technical details - what's happening is
that for some reason AOL is changing the encoding of the XML document
from UTF-8 to UTF-16. However, they're omitting the byte order marker
which is supposed to begin any UTF-16 XML document. The process above
adds that byte order marker, which is all Palantir needs to identify
the document as UTF-16 rather than UTF-8. Unfortunately, Automagic
gets thrown off by the byte order marker, and can't import the XML
anymore.
Hope this proves helpful to those stuck with AOL. If anyone is
interested I can also describe how to convert AOL files using Emacs,
my editor of choice.
-Peter