Does url_encoding interprete correclty utf8 greek characters?

Andreas Ericsson ae at op5.se
Fri Jul 11 10:43:25 CEST 2008


Tsatsaros Tryfon wrote:
> This may be so.  However my issue is that url_encoding might have problems 
> encoding multibyte utf-8 greek characters even if called once.
> This may be because url_encoding interpretes its input buffer bytes as signed 
> characters, and so, characters having the high bit of their high nibble set 
> (that is 8X, 9X...,FX) are interpreted as negative and thus < 0x20 and 
> consequently are covnerted to '+' in the first place. Moreover no special care 
> is taken for multibyte characters whose 1st byte may be < 0x20 even if positive 
> and as such also converted to '+'.
> 

imo, this isn't worth spending time on, since a PHP-based GUI will be released
soon.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

-------------------------------------------------------------------------
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08




More information about the Developers mailing list