Name | Description | Type | Package | Framework |
AnalyzerProfile | Manages analysis data configuration for SmartChineseAnalyzer SmartChineseAnalyzer has a built-in dictionary and stopword list out-of-box. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
CharType | Internal SmartChineseAnalyzer character type constants. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
HHMMSegmenter | Finds the optimal segmentation of a sentence into Chinese wordsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
HMMChineseTokenizer | Tokenizer for Chinese or mixed Chinese-English text. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
HMMChineseTokenizerFactory | Factory for HMMChineseTokenizer Note: this class will currently emit tokens for punctuation. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
SegToken | SmartChineseAnalyzer internal tokenWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
SegTokenFilter | Filters a SegToken by converting full-width latin to half-width, then lowercasing latin. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
SentenceTokenizer | Tokenizes input text into sentences. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
SmartChineseAnalyzer | SmartChineseAnalyzer is an analyzer for Chinese or mixed Chinese-English text. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
SmartChineseSentenceTokenizerFactory | Factory for the SmartChineseAnalyzer SentenceTokenizerWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
SmartChineseWordTokenFilterFactory | Factory for the SmartChineseAnalyzer WordTokenFilter Note: this class will currently emit tokens for punctuation. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
Utility | SmartChineseAnalyzer utility constants and methodsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
WordTokenFilter | A TokenFilter that breaks sentences into words. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
WordType | Internal SmartChineseAnalyzer token type constantsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |