diff --git a/README.md b/README.md index db66558..99adb30 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,7 @@ # PostgreSQL ASCII folding -Reasonably fast ASCII folding functions (based on [Lucene asciifolding filter](https://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html)) for PostgreSQL +Reasonably fast (tested on Musicbrainz dataset, is 40% faster than a simple `UPPER()`) +ASCII folding functions based on [Lucene's ASCIIFoldingFilter](https://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html) for PostgreSQL *Example:* ``` @@ -9,6 +10,12 @@ postgres=# SELECT asciifold('Hello, ⒩ᴐⱤú⒴⁈~!'); ---------------------- Hello, (n)ORu(y)?!~! (1 row) + +postgres=# SELECT asciifold_lower('Hello, ⒩ᴐⱤú⒴⁈~!'); + asciifold +---------------------- + hello, (n)oru(y)?!~! +(1 row) ``` UTF8 input string is not sanitized (invalid UTF8 might lead to undefined behavior)