15 Oct
2013
15 Oct
'13
3:05 a.m.
On 15/10/2013 04:03, Adam P. Goucher wrote:
Just out of interest, what's the policy on UTF-8 (extended ASCII)? In particular, can everyone read the following sentence?
`Professor Béla Bollobás has an Erdös number of (¾ + ½²).'
There should be two acute accents, one umlaut (since Hungarian double acute accents are absent from UTF-8), two rationals and a square. Everything else is ASCII.
The fourth letter of Erdős's name is in the Unicode character set. It is represented by a single code point, at position 337 (decimal), which in the UTF-8 encoding becomes two octets, 0xC5 0x91. -- g