tool.lucene
Class AccentFilter
public
class
AccentFilter
extends TokenFilter
Produce an additional 7-bit ASCII version of Term,
since on some systems it's awkward to type in accents.
Keeps characters numbered 0..127 as is,
drops accents on letters (e.g., "À" => "A")
and translates various other symbols (e.g., "©" => "(C)"),
and drops other characters not in translation tables.
Version: $Revision: 1.3 $ $Date: 2003/06/01 07:33:52 $
public AccentFilter(TokenStream in)
public Token next()