Name | Description | Type | Package | Framework |
DefaultICUTokenizerConfig | Default ICUTokenizerConfig that is generally applicable Generally tokenizes Unicode text according to UAX#29 | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
ICUTokenizer | Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
ICUTokenizerConfig | Class that allows for tailored Unicode Text Segmentation on a per-writing system basis. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
ICUTokenizerFactory | Factory for ICUTokenizer. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |