The specification to decode utf8 sequences:
http://www.faqs.org/rfcs/rfc3629.html