Regular Expressions Python - unicode to plain text -

April 15, 2012

anyone know simple way following?

given unicode string containing 'usa' in chinese (美国), japanese (米国), or korean (미국), write function returns plain python byte string in international version of 'usa' translated 'usa' in english. example: translate_usa(u'美国 country.') should return 'usa country.'

you chain str.replace or use regex matches of 3 strings.

>>> import re >>> usa = (u'ç¾Žå›½', u'ç±³å›½', u'ë¯¸êµ') >>> re.sub('|'.join(usa), 'usa', u'ç¾Žå›½ country.') u'usa country.'

Search This Blog

Two

Regular Expressions Python - unicode to plain text -

Comments

Post a Comment

Popular posts from this blog

get url and add instance to a model with prefilled foreign key :django admin -

android - Keyboard hides my half of edit-text and button below it even in scroll view -

css - Make div keyboard-scrollable in jQuery Mobile? -