Re: strcasecmp/tolower/toupper breakage

From: Owen Taylor <otaylor redhat com>
To: kevin lyda <kevin suberic net>
Cc: Alan Cox <alan redhat com>, Havoc Pennington <hp redhat com>, George <jirka 5z com>, gnome-devel-list gnome org, gnome-hackers gnome org
Subject: Re: strcasecmp/tolower/toupper breakage
Date: 09 May 2001 10:18:59 -0400

kevin lyda <kevin suberic net> writes:

> On Fri, May 04, 2001 at 10:07:45AM -0400, Alan Cox wrote:
> > > utf8_strcasecmp() is pretty easy to implement using unichar_tolower(),
> > > if you don't change its behavior according to locale.
> > The traditional utf8 'oh my god' appears to be regexps...
> 
> ok, i'm an ascii weenie[0] so i'm out of my depth, but doesn't perl handle
> unicode now - and by extension have regexps working?

As Havoc pointed out, the code is completely unextractable.

But also, there are a lot of bugs in it in 5.6 (they may have been
fixed for 5.8); I started working on porting pcre to UTF-8 at one
point, and intended to copy what Perl did for behavior, but quickly
ran into misbehavior deep in Perl's regex optimizer...

Regards,
                                        Owen

References:
- Re: strcasecmp/tolower/toupper breakage
  - From: Havoc Pennington
- Re: strcasecmp/tolower/toupper breakage
  - From: Alan Cox
- Re: strcasecmp/tolower/toupper breakage
  - From: kevin lyda

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]