unicode(str, "utf-8") and str.encode("utf-8")

the key is, unicode in Python is an object, unicode(str, “utf-8″) makes that object from an utf-9 str, and str.encode(”utf-8″) encode a string to the utf-8 encoding.

To write unicode-aware python code, I’ll need to:

  • when getting data, use unicode(str, “the_encoding”) to get an unicode object
  • use unicode object inside my program, like all internal strings should be u”some_thing”
  • when output, convert the unicode object to whatever fits, that is, use str.encode(”the_encoding”)

Good reference:

Tags: , ,

Post a Comment

You could use <code type="name"> to get your code colorized

Your email is never published nor shared. Required fields are marked *

Close
E-mail It