<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Evils of java.lang.String.toUpperCase</title>
	<atom:link href="http://www.brosinski.com/stephan/2008/02/09/the-evils-of-javalangstringtouppercase/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.brosinski.com/stephan/2008/02/09/the-evils-of-javalangstringtouppercase/</link>
	<description></description>
	<lastBuildDate>Mon, 08 Jun 2009 10:13:18 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
	<item>
		<title>By: Thomas Jacob</title>
		<link>http://www.brosinski.com/stephan/2008/02/09/the-evils-of-javalangstringtouppercase/comment-page-1/#comment-19</link>
		<dc:creator>Thomas Jacob</dc:creator>
		<pubDate>Sat, 17 May 2008 10:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://brosinski.com/stephan/2008/02/09/the-evils-of-javalangstringtouppercase/#comment-19</guid>
		<description>The whole thing is even worse. length() is not the number of characters, but the number of 16-bit-numbers using the UTF-16 encoding. For most cases, this is the same, but traditional chinese characters sometimes require two UTF-16 numbers. The same holds for charAt etc. You always get the UTF-16 encoding.

The other day, there was a long discussion on how to introduce the characters above 64k. How to extend Java. 7 approaches have been dropped due to various reasons. Finally, they chose this one.

http://java.sun.com/developer/technicalArticles/Intl/Supplementary/

If you want to have the correct length of a string, use methods like codePointCount().</description>
		<content:encoded><![CDATA[<p>The whole thing is even worse. length() is not the number of characters, but the number of 16-bit-numbers using the UTF-16 encoding. For most cases, this is the same, but traditional chinese characters sometimes require two UTF-16 numbers. The same holds for charAt etc. You always get the UTF-16 encoding.</p>
<p>The other day, there was a long discussion on how to introduce the characters above 64k. How to extend Java. 7 approaches have been dropped due to various reasons. Finally, they chose this one.</p>
<p><a href="http://java.sun.com/developer/technicalArticles/Intl/Supplementary/" rel="nofollow">http://java.sun.com/developer/technicalArticles/Intl/Supplementary/</a></p>
<p>If you want to have the correct length of a string, use methods like codePointCount().</p>
]]></content:encoded>
	</item>
</channel>
</rss>

