Stream: implementers
Topic: Encoding in XML
Grahame Grieve (Nov 29 2017 at 10:37):
The Specification is very conservative with non-ascii characters when creating XML files. Although the files are all UTF-8, I also encode any character with a unicode code point over 127 with an XML escape character. @Lin Zhang queries whether this is really necessary. I did it because there were some tools that didn't parse UTF-8 correctly a few years back, but I don't remember which ones, and I suppose they are fixed anyway. Does anyone have any comment about this?
Lin Zhang (Nov 29 2017 at 12:18):
XML supports Chinese characters directly other than using "&#x"-like encodings. These encodings might prevent normal displaying and searching. At least for some tools, such as notepad and XMLSpy. But I am not very familiar with such a character set issue and don't know how to solve it.
Vadim Peretokin (Nov 30 2017 at 13:48):
Are there sample files available that we can test with?
Grahame Grieve (Nov 30 2017 at 20:33):
build.fhir.org/codesystem-resource-types.xml
Vadim Peretokin (Dec 01 2017 at 09:38):
Simplifier has no issues with it, in fact it stores it unencoded and if you download the file back, all of the characters are unencoded. Oxygen doesn't render it correctly however:
Last updated: Apr 12 2022 at 19:14 UTC