00503: Searching for utf-8 characters doesn't work

Summary: Searching for utf-8 characters doesn't work
Created: 2005-09-01 11:07
Status: Closed : fixed in PmWiki 2.2.x beta versions
Category: Bug
From: floozy
Assigned:
Priority: 51
Version: pmwiki-2.0.beta55

Description: Entering non-iso-8859-1 characters (i.e. ☻ or ☼) into a wiki page and then searching pmwiki for these characters won't return any results.

What about using utf-8 as the default file encoding, so utf-8 characters won't have to be converted to entities?


Yes, this is a bug. It may take a while to fix. It might not be fixed.

I've often thought about using utf-8 as the default file encoding, but utf-8 support in PHP isn't all there yet, especially in regular expressions (which PmWiki uses a lot of). In particular, it's not possible to do reliably test for upper/lowercase characters or perform case conversions in PHP 4.

At some point in the future we may switch things to utf-8 encoding by default, but at the moment the language support in PHP isn't quite there or universally available to be able to set it as the default.

--Pm