文字化け

The word of the day is Mojibake, and I am not happy about it.

The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)

For completeness: Joel on Unicode.

PHP and Unicode

For future reference: writing Unicode applications with PHP.

On the Goodness of Unicode

Tim Bray’s introduction to Unicode
Link

Internationalization Weblog

Fieldmethods found a weblog on internationalization.

Why Unicode Won’t Work on the Internet

To read closer: the problem is that the Unicode space, around 64,000 code points, is smaller than the 170,000 characters used in modern and ancient Chinese (mainland, RoC, and expatriate communities).

Named Character Elements for XML

XML defines only a few entities, so if you want to use € you have have a definition for it in a DTD. Tony Coates and Zarella Rendon propose a non-DTD way around the problem for entities in text nodes using an XSLT transform library.
This is the first entry in a new category, I18N. I’ve [...]