Name | Description | Type | Package | Framework |
Analyzer | An Analyzer builds TokenStreams, which analyze text. | Class | org.apache.lucene.analysis | Apache Lucene |
|
Analyzer .ReuseStrategy | Strategy defining how TokenStreamComponents are reused per call to Analyzer. | Class | org.apache.lucene.analysis.Analyzer | Apache Lucene |
|
Analyzer .TokenStreamComponents | This class encapsulates the outer components of a token stream. | Class | org.apache.lucene.analysis.Analyzer | Apache Lucene |
|
AnalyzerWrapper | Extension to Analyzer suitable for Analyzers which wrap getWrappedAnalyzer(String) allows the Analyzer | Class | org.apache.lucene.analysis | Apache Lucene |
|
BytesTermAttribute | This attribute can be used if you have the raw term bytes to be indexed. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
BytesTermAttributeImpl | NOTE: This API is for internal purposes only and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
CachingTokenFilter | This class can be used if the token attributes of a TokenStream are intended to be consumed more than once. | Class | org.apache.lucene.analysis | Apache Lucene |
|
CharFilter | Subclasses of CharFilter can be chained to filter a Reader They can be used as Reader with additional offset | Class | org.apache.lucene.analysis | Apache Lucene |
|
CharTermAttribute | The term text of a Token. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
CharTermAttributeImpl | Default implementation of CharTermAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
DelegatingAnalyzerWrapper | An analyzer wrapper, that doesn't allow to wrap components or readers. | Class | org.apache.lucene.analysis | Apache Lucene |
|
FlagsAttribute | This attribute can be used to pass different flags down the Tokenizer chain, e. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
FlagsAttributeImpl | Default implementation of FlagsAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
KeywordAttribute | This attribute can be used to mark a token as a keyword. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
KeywordAttributeImpl | Default implementation of KeywordAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
NumericTokenStream | Expert: This class provides a TokenStream for indexing numeric values that can be used by NumericRangeQuery. | Class | org.apache.lucene.analysis | Apache Lucene |
|
NumericTokenStream .NumericTermAttribute | Expert: Use this attribute to get the details of the currently generated token. | Interface | org.apache.lucene.analysis.NumericTokenStream | Apache Lucene |
|
NumericTokenStream .NumericTermAttributeImpl | | Class | org.apache.lucene.analysis.NumericTokenStream | Apache Lucene |
|
OffsetAttribute | The start and end character offset of a Token. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
OffsetAttributeImpl | Default implementation of OffsetAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PackedTokenAttributeImpl | Default implementation of the common attributes used by Lucene:CharTermAttribute PositionIncrementAttribute | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PayloadAttribute | The payload of a Token. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PayloadAttributeImpl | Default implementation of PayloadAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PositionIncrementAttribute | Determines the position of this token relative to the previous Token in a TokenStream, used in phrase | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PositionIncrementAttributeImpl | Default implementation of PositionIncrementAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PositionLengthAttribute | Determines how many positions this token spans. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
PositionLengthAttributeImpl | Default implementation of PositionLengthAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
TermToBytesRefAttribute | This attribute is requested by TermsHashPerField to index the contents. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
Token | A Token is an occurrence of a term from the text of a field. | Class | org.apache.lucene.analysis | Apache Lucene |
|
TokenFilter | A TokenFilter is a TokenStream whose input is another TokenStream. | Class | org.apache.lucene.analysis | Apache Lucene |
|
Tokenizer | A Tokenizer is a TokenStream whose input is a Reader. | Class | org.apache.lucene.analysis | Apache Lucene |
|
TokenStream | A TokenStream enumerates the sequence of tokens, either from Fields of a Document or from query text. | Class | org.apache.lucene.analysis | Apache Lucene |
|
TokenStreamToAutomaton | Consumes a TokenStream and creates an Automaton where the transition labels are UTF8 bytes (or Unicode | Class | org.apache.lucene.analysis | Apache Lucene |
|
TypeAttribute | A Token's lexical type. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
TypeAttributeImpl | Default implementation of TypeAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
AbstractAnalysisFactory | Abstract parent class for analysis factories TokenizerFactory, TokenFilterFactory and CharFilterFactory. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
AbstractEncoder | Base class for payload encoders. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
AnalysisSPILoader | Helper class for loading named SPIs from classpath (e. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
ApostropheFilter | Strips all characters after an apostrophe (including the apostrophe itself). | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
ApostropheFilterFactory | Factory for ApostropheFilter. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
ArabicAnalyzer | Analyzer for Arabic. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArabicNormalizationFilter | A TokenFilter that applies ArabicNormalizer to normalize the orthography. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArabicNormalizationFilterFactory | Factory for ArabicNormalizationFilter. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArabicNormalizer | Normalizer for Arabic. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArabicStemFilter | A TokenFilter that applies ArabicStemmer to stem Arabic words. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArabicStemFilterFactory | Factory for ArabicStemFilter. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArabicStemmer | Stemming is done in-place for efficiency, operating on a termbuffer. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
ArmenianAnalyzer | Analyzer for Armenian. | Class | org.apache.lucene.analysis.hy | Apache Lucene |
|
ASCIIFoldingFilter | This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the Basic Latin Unicode | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
ASCIIFoldingFilterFactory | Factory for ASCIIFoldingFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
BaseCharFilter | Base utility class for implementing a CharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
BasqueAnalyzer | Analyzer for Basque. | Class | org.apache.lucene.analysis.eu | Apache Lucene |
|
BrazilianAnalyzer | Analyzer for Brazilian Portuguese language. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
BrazilianStemFilter | A TokenFilter that applies BrazilianStemmer. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
BrazilianStemFilterFactory | Factory for BrazilianStemFilter. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
BrazilianStemmer | A stemmer for Brazilian Portuguese words. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
BulgarianAnalyzer | Analyzer for Bulgarian. | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
BulgarianStemFilter | A TokenFilter that applies BulgarianStemmer to stem Bulgarian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
BulgarianStemFilterFactory | Factory for BulgarianStemFilter. | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
BulgarianStemmer | Light Stemmer for Bulgarian. | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
ByteVector | This class implements a simple byte vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
CapitalizationFilter | A filter to apply normal capitalization rules to Tokens. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
CapitalizationFilterFactory | Factory for CapitalizationFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
CatalanAnalyzer | Analyzer for Catalan. | Class | org.apache.lucene.analysis.ca | Apache Lucene |
|
CharacterUtils | CharacterUtils provides a unified interface to Character-related operations to implement backwards compatible character operations based on a | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CharacterUtils .CharacterBuffer | A simple IO buffer to use with CharacterUtils. | Class | org.apache.lucene.analysis.util.CharacterUtils | Apache Lucene |
|
CharArrayIterator | A CharacterIterator used internally for use with BreakIteratorNOTE: This API is for internal purposes only and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CharArrayMap | A simple class that stores key Strings as char[]'s in a hash table. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CharArraySet | A simple class that stores Strings as char[]'s in a hash table. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CharFilterFactory | | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CharTokenizer | An abstract base class for simple, character-oriented tokenizers. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CharVector | This class implements a simple char vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
CJKAnalyzer | An Analyzer that tokenizes text with StandardTokenizer, normalizes content with CJKWidthFilter, folds case with | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
CJKBigramFilter | Forms bigrams of CJK terms that are generated from StandardTokenizer CJK types are set by these tokenizers, but you can also use | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
CJKBigramFilterFactory | Factory for CJKBigramFilter. | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
CJKWidthFilter | A TokenFilter that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
CJKWidthFilterFactory | Factory for CJKWidthFilter. | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
ClassicAnalyzer | Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
ClassicFilter | Normalizes tokens extracted with ClassicTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
ClassicFilterFactory | Factory for ClassicFilter. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
ClassicTokenizer | A grammar-based tokenizer constructed with JFlex This should be a good tokenizer for most European-language documents: | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
ClassicTokenizerFactory | Factory for ClassicTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
ClasspathResourceLoader | Simple ResourceLoader that uses ClassLoader. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
CodepointCountFilter | Removes words that are too long or too short from the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
CodepointCountFilterFactory | Factory for CodepointCountFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
CommonGramsFilter | Construct bigrams for frequently occurring terms while indexing. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
CommonGramsFilterFactory | Constructs a CommonGramsFilter. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
CommonGramsQueryFilter | Wrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
CommonGramsQueryFilterFactory | Construct CommonGramsQueryFilter. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
CompoundWordTokenFilterBase | Base class for decomposition token filters. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
CustomAnalyzer | A general-purpose Analyzer that can be created with a builder-style API. | Class | org.apache.lucene.analysis.custom | Apache Lucene |
|
CustomAnalyzer .Builder | Builder for CustomAnalyzer. | Class | org.apache.lucene.analysis.custom.CustomAnalyzer | Apache Lucene |
|
CzechAnalyzer | Analyzer for Czech language. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
CzechStemFilter | A TokenFilter that applies CzechStemmer to stem Czech words. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
CzechStemFilterFactory | Factory for CzechStemFilter. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
CzechStemmer | Light Stemmer for Czech. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
DanishAnalyzer | Analyzer for Danish. | Class | org.apache.lucene.analysis.da | Apache Lucene |
|
DateRecognizerFilter | Filters all tokens that cannot be parsed to a date, using the provided DateFormat. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
DateRecognizerFilterFactory | Factory for DateRecognizerFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
DecimalDigitFilter | Folds all Unicode digits in [:General_Category=Decimal_Number:] to Basic Latin digits (0-9). | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
DecimalDigitFilterFactory | Factory for DecimalDigitFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
DelimitedPayloadTokenFilter | Characters before the delimiter are the "token", those after are the payload. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
DelimitedPayloadTokenFilterFactory | Factory for DelimitedPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
Dictionary | In-memory structure for the dictionary (. | Class | org.apache.lucene.analysis.hunspell | Apache Lucene |
|
DictionaryCompoundWordTokenFilter | A TokenFilter that decomposes compound words found in many Germanic languages. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
DictionaryCompoundWordTokenFilterFactory | Factory for DictionaryCompoundWordTokenFilter. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
DutchAnalyzer | Analyzer for Dutch language. | Class | org.apache.lucene.analysis.nl | Apache Lucene |
|
EdgeNGramFilterFactory | Creates new instances of EdgeNGramTokenFilter. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
EdgeNGramTokenFilter | Tokenizes the given token into n-grams of given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
EdgeNGramTokenizer | Tokenizes the input from an edge into n-grams of given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
EdgeNGramTokenizerFactory | Creates new instances of EdgeNGramTokenizer. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
ElisionFilter | Removes elisions from a TokenStream. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
ElisionFilterFactory | Factory for ElisionFilter. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
EmptyTokenStream | An always exhausted token stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
EnglishAnalyzer | Analyzer for English. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
EnglishMinimalStemFilter | A TokenFilter that applies EnglishMinimalStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
EnglishMinimalStemFilterFactory | Factory for EnglishMinimalStemFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
EnglishMinimalStemmer | Minimal plural stemmer for English. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
EnglishPossessiveFilter | TokenFilter that removes possessives (trailing 's) from words. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
EnglishPossessiveFilterFactory | Factory for EnglishPossessiveFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
FilesystemResourceLoader | Simple ResourceLoader that opens resource files from the local file system, optionally resolving against | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
FilteringTokenFilter | Abstract base class for TokenFilters that may remove tokens. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
FingerprintFilter | Filter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
FingerprintFilterFactory | Factory for FingerprintFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
FinnishAnalyzer | Analyzer for Finnish. | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
FinnishLightStemFilter | A TokenFilter that applies FinnishLightStemmer to stem Finnish To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
FinnishLightStemFilterFactory | Factory for FinnishLightStemFilter. | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
FinnishLightStemmer | Light Stemmer for Finnish. | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
FloatEncoder | Encode a character array Float as a BytesRef. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
FrenchAnalyzer | Analyzer for French language. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
FrenchLightStemFilter | A TokenFilter that applies FrenchLightStemmer to stem French To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
FrenchLightStemFilterFactory | Factory for FrenchLightStemFilter. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
FrenchLightStemmer | Light Stemmer for French. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
FrenchMinimalStemFilter | A TokenFilter that applies FrenchMinimalStemmer to stem French To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
FrenchMinimalStemFilterFactory | Factory for FrenchMinimalStemFilter. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
FrenchMinimalStemmer | Light Stemmer for French. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
GalicianAnalyzer | Analyzer for Galician. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GalicianMinimalStemFilter | A TokenFilter that applies GalicianMinimalStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GalicianMinimalStemFilterFactory | Factory for GalicianMinimalStemFilter. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GalicianMinimalStemmer | Minimal Stemmer for Galician This follows the "RSLP-S" algorithm, but modified for Galician. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GalicianStemFilter | A TokenFilter that applies GalicianStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GalicianStemFilterFactory | Factory for GalicianStemFilter. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GalicianStemmer | Galician stemmer implementing "Regras do lematizador para o galego". | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
GermanAnalyzer | Analyzer for German language. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanLightStemFilter | A TokenFilter that applies GermanLightStemmer to stem German To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanLightStemFilterFactory | Factory for GermanLightStemFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanLightStemmer | Light Stemmer for German. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanMinimalStemFilter | A TokenFilter that applies GermanMinimalStemmer to stem German To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanMinimalStemFilterFactory | Factory for GermanMinimalStemFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanMinimalStemmer | Minimal Stemmer for German. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanNormalizationFilter | Normalizes German characters according to the heuristics German2 snowball algorithm. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanNormalizationFilterFactory | Factory for GermanNormalizationFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanStemFilter | A TokenFilter that stems German words. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanStemFilterFactory | Factory for GermanStemFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GermanStemmer | A stemmer for German words. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
GreekAnalyzer | Analyzer for the Greek language. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
GreekLowerCaseFilter | Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
GreekLowerCaseFilterFactory | Factory for GreekLowerCaseFilter. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
GreekStemFilter | A TokenFilter that applies GreekStemmer to stem Greek To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
GreekStemFilterFactory | Factory for GreekStemFilter. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
GreekStemmer | A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
HindiAnalyzer | | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HindiNormalizationFilter | A TokenFilter that applies HindiNormalizer to normalize the In some cases the normalization may cause unrelated terms to conflate, so | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HindiNormalizationFilterFactory | Factory for HindiNormalizationFilter. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HindiNormalizer | Normalizer for Hindi. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HindiStemFilter | A TokenFilter that applies HindiStemmer to stem Hindi words. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HindiStemFilterFactory | Factory for HindiStemFilter. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HindiStemmer | Light Stemmer for Hindi. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
HTMLStripCharFilter | A CharFilter that wraps another Reader and attempts to strip out HTML constructs. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
HTMLStripCharFilterFactory | Factory for HTMLStripCharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
HungarianAnalyzer | Analyzer for Hungarian. | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
HungarianLightStemFilter | A TokenFilter that applies HungarianLightStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
HungarianLightStemFilterFactory | Factory for HungarianLightStemFilter. | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
HungarianLightStemmer | Light Stemmer for Hungarian. | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
HunspellStemFilter | TokenFilter that uses hunspell affix rules and words to stem tokens. | Class | org.apache.lucene.analysis.hunspell | Apache Lucene |
|
HunspellStemFilterFactory | TokenFilterFactory that creates instances of HunspellStemFilter. | Class | org.apache.lucene.analysis.hunspell | Apache Lucene |
|
Hyphen | This class represents a hyphen. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
HyphenatedWordsFilter | When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
HyphenatedWordsFilterFactory | Factory for HyphenatedWordsFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
Hyphenation | This class represents a hyphenated word. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
HyphenationCompoundWordTokenFilter | A TokenFilter that decomposes compound words found in many Germanic languages. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
HyphenationCompoundWordTokenFilterFactory | Factory for HyphenationCompoundWordTokenFilter. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
HyphenationTree | This tree structure stores the hyphenation patterns in an efficient way for fast lookup. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
IdentityEncoder | Does nothing other than convert the char array to a byte array using the specified encoding. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
IndicNormalizationFilter | A TokenFilter that applies IndicNormalizer to normalize text in Indian Languages. | Class | org.apache.lucene.analysis.in | Apache Lucene |
|
IndicNormalizationFilterFactory | Factory for IndicNormalizationFilter. | Class | org.apache.lucene.analysis.in | Apache Lucene |
|
IndicNormalizer | Normalizes the Unicode representation of text in Indian languages. | Class | org.apache.lucene.analysis.in | Apache Lucene |
|
IndonesianAnalyzer | | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
IndonesianStemFilter | A TokenFilter that applies IndonesianStemmer to stem Indonesian words. | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
IndonesianStemFilterFactory | Factory for IndonesianStemFilter. | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
IndonesianStemmer | Stemmer for Indonesian. | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
IntegerEncoder | Encode a character array Integer as a BytesRef. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
IrishAnalyzer | | Class | org.apache.lucene.analysis.ga | Apache Lucene |
|
IrishLowerCaseFilter | Normalises token text to lower case, handling t-prothesis and n-eclipsis (i. | Class | org.apache.lucene.analysis.ga | Apache Lucene |
|
IrishLowerCaseFilterFactory | Factory for IrishLowerCaseFilter. | Class | org.apache.lucene.analysis.ga | Apache Lucene |
|
ItalianAnalyzer | Analyzer for Italian. | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
ItalianLightStemFilter | A TokenFilter that applies ItalianLightStemmer to stem Italian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
ItalianLightStemFilterFactory | Factory for ItalianLightStemFilter. | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
ItalianLightStemmer | Light Stemmer for Italian. | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
KeepWordFilter | A TokenFilter that only keeps tokens with text contained in the required words. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
KeepWordFilterFactory | Factory for KeepWordFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
KeywordAnalyzer | "Tokenizes" the entire stream as a single token. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
KeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
KeywordMarkerFilterFactory | Factory for KeywordMarkerFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
KeywordRepeatFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
KeywordRepeatFilterFactory | Factory for KeywordRepeatFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
KeywordTokenizer | Emits the entire input as a single token. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
KeywordTokenizerFactory | Factory for KeywordTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
KStemFilter | A high-performance kstem filter for english. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
KStemFilterFactory | Factory for KStemFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
KStemmer | | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
LatvianAnalyzer | Analyzer for Latvian. | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
LatvianStemFilter | A TokenFilter that applies LatvianStemmer to stem Latvian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
LatvianStemFilterFactory | Factory for LatvianStemFilter. | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
LatvianStemmer | Light stemmer for Latvian. | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
LengthFilter | Removes words that are too long or too short from the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LengthFilterFactory | Factory for LengthFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LetterTokenizer | A LetterTokenizer is a tokenizer that divides text at non-letters. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
LetterTokenizerFactory | Factory for LetterTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
LimitTokenCountAnalyzer | This Analyzer limits the number of tokens while indexing. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LimitTokenCountFilter | This TokenFilter limits the number of tokens while indexing. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LimitTokenCountFilterFactory | Factory for LimitTokenCountFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LimitTokenOffsetFilter | Lets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LimitTokenOffsetFilterFactory | Factory for LimitTokenOffsetFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LimitTokenPositionFilter | This TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LimitTokenPositionFilterFactory | Factory for LimitTokenPositionFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
LithuanianAnalyzer | Analyzer for Lithuanian. | Class | org.apache.lucene.analysis.lt | Apache Lucene |
|
LowerCaseFilter | Normalizes token text to lower case. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
LowerCaseFilterFactory | Factory for LowerCaseFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
LowerCaseTokenizer | LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
LowerCaseTokenizerFactory | Factory for LowerCaseTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
Lucene43CompoundWordTokenFilterBase | Base class for decomposition token filters using pre-4. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
Lucene43DictionaryCompoundWordTokenFilter | | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
Lucene43EdgeNGramTokenFilter | Tokenizes the given token into n-grams of given size(s), using pre-4. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
Lucene43EdgeNGramTokenizer | Old version of EdgeNGramTokenizer which doesn't handle correctly supplementary characters. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
Lucene43EdgeNGramTokenizer .Side | Specifies which side of the input the n-gram should be generated fromGet the n-gram from the end of the input | Class | org.apache.lucene.analysis.ngram.Lucene43EdgeNGramTokenizer | Apache Lucene |
|
Lucene43FilteringTokenFilter | Backcompat FilteringTokenFilter for versions 4. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
Lucene43HyphenationCompoundWordTokenFilter | A TokenFilter that decomposes compound words found in many Germanic languages, using pre-4. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
Lucene43KeepWordFilter | Backcompat for KeepWordFilter for versions 4. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
Lucene43LengthFilter | Backcompat LengthFilter for versions 4. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
Lucene43NGramTokenFilter | Tokenizes the input into n-grams of the given size(s), matching Lucene 4. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
Lucene43NGramTokenizer | Old broken version of NGramTokenizer. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
Lucene43StopFilter | Backcompat StopFilter for versions 4. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
Lucene43TrimFilter | Backcompat TrimFilter for versions 4. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
Lucene43TypeTokenFilter | Backcompat TypeTokenFilter for versions 4. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
Lucene47WordDelimiterFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
MappingCharFilter | Simplistic CharFilter that applies the mappings contained in a NormalizeCharMap to the character | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
MappingCharFilterFactory | Factory for MappingCharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
MultiTermAwareComponent | Add to any analysis factory component to allow returning an analysis component factory for use with partial terms in prefix queries, | Interface | org.apache.lucene.analysis.util | Apache Lucene |
|
NGramFilterFactory | Factory for NGramTokenFilter. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
NGramTokenFilter | Tokenizes the input into n-grams of the given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
NGramTokenizer | Tokenizes the input into n-grams of the given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
NGramTokenizerFactory | Factory for NGramTokenizer. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
NormalizeCharMap | Holds a map of String input to String output, to be used with MappingCharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
NormalizeCharMap .Builder | Builds an NormalizeCharMap. | Class | org.apache.lucene.analysis.charfilter.NormalizeCharMap | Apache Lucene |
|
NorwegianAnalyzer | Analyzer for Norwegian. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NorwegianLightStemFilter | A TokenFilter that applies NorwegianLightStemmer to stem Norwegian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NorwegianLightStemFilterFactory | Factory for NorwegianLightStemFilter. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NorwegianLightStemmer | Light Stemmer for Norwegian. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NorwegianMinimalStemFilter | A TokenFilter that applies NorwegianMinimalStemmer to stem Norwegian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NorwegianMinimalStemFilterFactory | Factory for NorwegianMinimalStemFilter. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NorwegianMinimalStemmer | | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
NumericPayloadTokenFilter | Assigns a payload to a token based on the PackedTokenAttributeImpl. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
NumericPayloadTokenFilterFactory | Factory for NumericPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
OpenStringBuilder | A StringBuilder that allows one to access the array. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
PathHierarchyTokenizer | Tokenizer for path-like hierarchies. | Class | org.apache.lucene.analysis.path | Apache Lucene |
|
PathHierarchyTokenizerFactory | Factory for PathHierarchyTokenizer. | Class | org.apache.lucene.analysis.path | Apache Lucene |
|
PatternCaptureGroupFilterFactory | Factory for PatternCaptureGroupTokenFilter. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternCaptureGroupTokenFilter | CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternConsumer | This interface is used to connect the XML pattern file parser to the This class has been taken from the Apache FOP project (http://xmlgraphics. | Interface | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
PatternKeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
PatternParser | A SAX document handler to read and parse hyphenation patterns from a XML This class has been taken from the Apache FOP project (http://xmlgraphics. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
PatternReplaceCharFilter | CharFilter that uses a regular expression for the target of replace string. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternReplaceCharFilterFactory | Factory for PatternReplaceCharFilter. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternReplaceFilter | A TokenFilter which applies a Pattern to each token in the stream, replacing match occurances with the specified replacement string. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternReplaceFilterFactory | Factory for PatternReplaceFilter. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternTokenizer | This tokenizer uses regex pattern matching to construct distinct tokens for the input stream. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PatternTokenizerFactory | Factory for PatternTokenizer. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
PayloadEncoder | Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to NOTE: This interface is subject to change | Interface | org.apache.lucene.analysis.payloads | Apache Lucene |
|
PayloadHelper | Utility methods for encoding payloads. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
PerFieldAnalyzerWrapper | This analyzer is used to facilitate scenarios where different fields require different analysis techniques. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
PersianAnalyzer | Analyzer for Persian. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
PersianCharFilter | | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
PersianCharFilterFactory | Factory for PersianCharFilter. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
PersianNormalizationFilter | | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
PersianNormalizationFilterFactory | Factory for PersianNormalizationFilter. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
PersianNormalizer | Normalizer for Persian. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
PorterStemFilter | Transforms the token stream as per the Porter stemming algorithm. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
PorterStemFilterFactory | Factory for PorterStemFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
PortugueseAnalyzer | Analyzer for Portuguese. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseLightStemFilter | A TokenFilter that applies PortugueseLightStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseLightStemFilterFactory | Factory for PortugueseLightStemFilter. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseLightStemmer | Light Stemmer for Portuguese This stemmer implements the "UniNE" algorithm in: | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseMinimalStemFilter | A TokenFilter that applies PortugueseMinimalStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseMinimalStemFilterFactory | Factory for PortugueseMinimalStemFilter. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseMinimalStemmer | Minimal Stemmer for Portuguese This follows the "RSLP-S" algorithm presented in: | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseStemFilter | A TokenFilter that applies PortugueseStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseStemFilterFactory | Factory for PortugueseStemFilter. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PortugueseStemmer | Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
PrefixAndSuffixAwareTokenFilter | Links two PrefixAwareTokenFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
PrefixAwareTokenFilter | Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
QueryAutoStopWordAnalyzer | An Analyzer used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries. | Class | org.apache.lucene.analysis.query | Apache Lucene |
|
RemoveDuplicatesTokenFilter | A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
RemoveDuplicatesTokenFilterFactory | Factory for RemoveDuplicatesTokenFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
ResourceLoader | Abstraction for loading resources (streams, files, and classes). | Interface | org.apache.lucene.analysis.util | Apache Lucene |
|
ResourceLoaderAware | Interface for a component that needs to be initialized by an implementation of ResourceLoader. | Interface | org.apache.lucene.analysis.util | Apache Lucene |
|
ReversePathHierarchyTokenizer | Tokenizer for domain-like hierarchies. | Class | org.apache.lucene.analysis.path | Apache Lucene |
|
ReverseStringFilter | Reverse token string, for example "country" => "yrtnuoc". | Class | org.apache.lucene.analysis.reverse | Apache Lucene |
|
ReverseStringFilterFactory | Factory for ReverseStringFilter. | Class | org.apache.lucene.analysis.reverse | Apache Lucene |
|
RollingCharBuffer | Acts like a forever growing char[] as you read characters into it from the provided reader, but | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
RomanianAnalyzer | Analyzer for Romanian. | Class | org.apache.lucene.analysis.ro | Apache Lucene |
|
RSLPStemmerBase | Base class for stemmers that use a set of RSLP-like stemming steps. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
RSLPStemmerBase .Rule | A basic rule, with no exceptions. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
RSLPStemmerBase .RuleWithSetExceptions | A rule with a set of whole-word exceptions. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
RSLPStemmerBase .RuleWithSuffixExceptions | A rule with a set of exceptional suffixes. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
RSLPStemmerBase .Step | A step containing a list of rules. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
RussianAnalyzer | Analyzer for Russian language. | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
RussianLightStemFilter | A TokenFilter that applies RussianLightStemmer to stem Russian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
RussianLightStemFilterFactory | Factory for RussianLightStemFilter. | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
RussianLightStemmer | Light Stemmer for Russian. | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
ScandinavianFoldingFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
ScandinavianFoldingFilterFactory | Factory for ScandinavianFoldingFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
ScandinavianNormalizationFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
ScandinavianNormalizationFilterFactory | Factory for ScandinavianNormalizationFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
SegmentingTokenizerBase | Breaks text into sentences with a BreakIterator and allows subclasses to decompose these sentences into words. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
SerbianNormalizationFilter | Normalizes Serbian Cyrillic and Latin characters to "bald" Latin. | Class | org.apache.lucene.analysis.sr | Apache Lucene |
|
SerbianNormalizationFilterFactory | Factory for SerbianNormalizationFilter. | Class | org.apache.lucene.analysis.sr | Apache Lucene |
|
SerbianNormalizationRegularFilter | Normalizes Serbian Cyrillic to Latin. | Class | org.apache.lucene.analysis.sr | Apache Lucene |
|
SetKeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
ShingleAnalyzerWrapper | A ShingleAnalyzerWrapper wraps a ShingleFilter around another Analyzer. | Class | org.apache.lucene.analysis.shingle | Apache Lucene |
|
ShingleFilter | A ShingleFilter constructs shingles (token n-grams) from a token stream. | Class | org.apache.lucene.analysis.shingle | Apache Lucene |
|
ShingleFilterFactory | Factory for ShingleFilter. | Class | org.apache.lucene.analysis.shingle | Apache Lucene |
|
SimpleAnalyzer | An Analyzer that filters LetterTokenizer with LowerCaseFilter | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
SingleTokenTokenStream | A TokenStream containing a single token. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
SnowballFilter | A filter that stems words using a Snowball-generated stemmer. | Class | org.apache.lucene.analysis.snowball | Apache Lucene |
|
SnowballPorterFilterFactory | Factory for SnowballFilter, with configurable language Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection. | Class | org.apache.lucene.analysis.snowball | Apache Lucene |
|
SolrSynonymParser | Parser for the Solr synonyms format. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
SoraniAnalyzer | Analyzer for Sorani Kurdish. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SoraniNormalizationFilter | | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SoraniNormalizationFilterFactory | Factory for SoraniNormalizationFilter. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SoraniNormalizer | Normalizes the Unicode representation of Sorani text. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SoraniStemFilter | A TokenFilter that applies SoraniStemmer to stem Sorani words. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SoraniStemFilterFactory | Factory for SoraniStemFilter. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SoraniStemmer | | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
SpanishAnalyzer | Analyzer for Spanish. | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
SpanishLightStemFilter | A TokenFilter that applies SpanishLightStemmer to stem Spanish To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
SpanishLightStemFilterFactory | Factory for SpanishLightStemFilter. | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
SpanishLightStemmer | Light Stemmer for Spanish This stemmer implements the algorithm described in: | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
StandardAnalyzer | Filters StandardTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
StandardFilter | Normalizes tokens extracted with StandardTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
StandardFilterFactory | Factory for StandardFilter. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
StandardTokenizer | A grammar-based tokenizer constructed with JFlex. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
StandardTokenizer40 | Backcompat standard tokenizer for Lucene 4. | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
StandardTokenizerFactory | Factory for StandardTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
StandardTokenizerImpl | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
StandardTokenizerImpl40 | | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
StemmerOverrideFilter | Provides the ability to override any KeywordAttribute aware stemmer with custom dictionary-based stemming. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
StemmerOverrideFilter .Builder | | Class | org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter | Apache Lucene |
|
StemmerOverrideFilter .StemmerOverrideMap | A read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for StemmerOverrideFilter | Class | org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter | Apache Lucene |
|
StemmerOverrideFilterFactory | Factory for StemmerOverrideFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
StemmerUtil | Some commonly-used stemming functionsNOTE: This API is for internal purposes only and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
StopAnalyzer | Filters LetterTokenizer with LowerCaseFilter and StopFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
StopFilter | Removes stop words from a token stream. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
StopFilterFactory | Factory for StopFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
StopwordAnalyzerBase | Base class for Analyzers that need to make use of stopword sets. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
SwedishAnalyzer | Analyzer for Swedish. | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
SwedishLightStemFilter | A TokenFilter that applies SwedishLightStemmer to stem Swedish To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
SwedishLightStemFilterFactory | Factory for SwedishLightStemFilter. | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
SwedishLightStemmer | Light Stemmer for Swedish. | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
SynonymFilter | Matches single or multi word synonyms in a token stream. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
SynonymFilterFactory | Factory for SynonymFilter. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
SynonymMap | A map of synonyms, keys and values are phrases. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
SynonymMap .Builder | Builds an FSTSynonymMap. | Class | org.apache.lucene.analysis.synonym.SynonymMap | Apache Lucene |
|
SynonymMap .Parser | Abstraction for parsing synonym files. | Class | org.apache.lucene.analysis.synonym.SynonymMap | Apache Lucene |
|
TeeSinkTokenFilter | This TokenFilter provides the ability to set aside attribute states that have already been analyzed. | Class | org.apache.lucene.analysis.sinks | Apache Lucene |
|
TeeSinkTokenFilter .SinkTokenStream | TokenStream output from a tee. | Class | org.apache.lucene.analysis.sinks.TeeSinkTokenFilter | Apache Lucene |
|
TernaryTree | Ternary Search Tree. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
ThaiAnalyzer | Analyzer for Thai language. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
ThaiTokenizer | Tokenizer that use BreakIterator to tokenize Thai text. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
ThaiTokenizerFactory | Factory for ThaiTokenizer. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
ThaiWordFilter | TokenFilter that use BreakIterator to break each Token that is Thai into separate Token(s) for each Thai word. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
ThaiWordFilterFactory | Factory for ThaiWordFilter. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
TokenFilterFactory | | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
TokenizerFactory | | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
TokenOffsetPayloadTokenFilter | Adds the OffsetAttribute. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
TokenOffsetPayloadTokenFilterFactory | Factory for TokenOffsetPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
TrimFilter | Trims leading and trailing whitespace from Tokens in the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
TrimFilterFactory | Factory for TrimFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
TruncateTokenFilter | A token filter for truncating the terms into a specific length. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
TruncateTokenFilterFactory | Factory for TruncateTokenFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
TurkishAnalyzer | Analyzer for Turkish. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
TurkishLowerCaseFilter | Normalizes Turkish token text to lower case. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
TurkishLowerCaseFilterFactory | Factory for TurkishLowerCaseFilter. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
TypeAsPayloadTokenFilter | Makes the PackedTokenAttributeImpl. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
TypeAsPayloadTokenFilterFactory | Factory for TypeAsPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
TypeTokenFilter | Removes tokens whose types appear in a set of blocked types from a token stream. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
TypeTokenFilterFactory | Factory class for TypeTokenFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
UAX29URLEmailAnalyzer | Filters UAX29URLEmailTokenizer with StandardFilter, | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
UAX29URLEmailTokenizer | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
UAX29URLEmailTokenizer40 | Backcompat uax29 tokenizer for Lucene 4. | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
UAX29URLEmailTokenizerFactory | Factory for UAX29URLEmailTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
UAX29URLEmailTokenizerImpl | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
UAX29URLEmailTokenizerImpl40 | | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
UnicodeProps | This file contains unicode properties used by various CharTokenizers. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
UnicodeWhitespaceAnalyzer | An Analyzer that uses UnicodeWhitespaceTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
UnicodeWhitespaceTokenizer | A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
UpperCaseFilter | Normalizes token text to UPPER CASE. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
UpperCaseFilterFactory | Factory for UpperCaseFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
WhitespaceAnalyzer | An Analyzer that uses WhitespaceTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
WhitespaceTokenizer | A tokenizer that divides text at whitespace characters as defined by Character. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
WhitespaceTokenizerFactory | Factory for WhitespaceTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
WikipediaTokenizer | Extension of StandardTokenizer that is aware of Wikipedia syntax. | Class | org.apache.lucene.analysis.wikipedia | Apache Lucene |
|
WikipediaTokenizerFactory | Factory for WikipediaTokenizer. | Class | org.apache.lucene.analysis.wikipedia | Apache Lucene |
|
WordDelimiterFilter | Splits words into subwords and performs optional transformations on subword groups. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
WordDelimiterFilterFactory | Factory for WordDelimiterFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
WordDelimiterIterator | A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
WordlistLoader | Loader for text files that represent a list of stopwords. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
WordnetSynonymParser | Parser for wordnet prolog format See http://wordnet. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
DefaultICUTokenizerConfig | Default ICUTokenizerConfig that is generally applicable Generally tokenizes Unicode text according to UAX#29 | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
ICUFoldingFilter | A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUFoldingFilterFactory | Factory for ICUFoldingFilter. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUNormalizer2CharFilter | Normalize token text with ICU's Normalizer2. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUNormalizer2CharFilterFactory | Factory for ICUNormalizer2CharFilter Supports the following attributes: | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUNormalizer2Filter | Normalize token text with ICU's Normalizer2 With this filter, you can normalize text in the following ways: | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUNormalizer2FilterFactory | Factory for ICUNormalizer2Filter Supports the following attributes: | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUTokenizer | Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
ICUTokenizerConfig | Class that allows for tailored Unicode Text Segmentation on a per-writing system basis. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
ICUTokenizerFactory | Factory for ICUTokenizer. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
ICUTransformFilter | A TokenFilter that transforms text with ICU. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ICUTransformFilterFactory | Factory for ICUTransformFilter. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
ScriptAttribute | This attribute stores the UTR #24 script value for a token of text. | Interface | org.apache.lucene.analysis.icu.tokenattributes | Apache Lucene |
|
ScriptAttributeImpl | WARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.icu.tokenattributes | Apache Lucene |
|
BaseFormAttribute | Attribute for Token. | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
BaseFormAttributeImpl | Attribute for Token. | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
BinaryDictionary | Base class for a binary-encoded in-memory dictionary. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
CharacterDefinition | Character category data. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
ConnectionCosts | | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
CSVUtil | | Class | org.apache.lucene.analysis.ja.util | Apache Lucene |
|
Dictionary | | Interface | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
GraphvizFormatter | Outputs the dot (graphviz) string for the viterbi lattice. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
InflectionAttribute | Attribute for Kuromoji inflection data. | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
InflectionAttributeImpl | Attribute for Kuromoji inflection data. | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
JapaneseAnalyzer | Analyzer for Japanese that uses morphological analysis. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseBaseFormFilter | Replaces term text with the BaseFormAttribute. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseBaseFormFilterFactory | Factory for JapaneseBaseFormFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseIterationMarkCharFilter | Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseIterationMarkCharFilterFactory | Factory for JapaneseIterationMarkCharFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseKatakanaStemFilter | A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseKatakanaStemFilterFactory | Factory for JapaneseKatakanaStemFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapanesePartOfSpeechStopFilter | Removes tokens that match a set of part-of-speech tags. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapanesePartOfSpeechStopFilterFactory | Factory for JapanesePartOfSpeechStopFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseReadingFormFilter | A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseReadingFormFilterFactory | Factory for JapaneseReadingFormFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseTokenizer | Tokenizer for Japanese that uses morphological analysis. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
JapaneseTokenizer .Mode | Tokenization mode: this determines how the tokenizer handles compound and unknown words. | Class | org.apache.lucene.analysis.ja.JapaneseTokenizer | Apache Lucene |
|
JapaneseTokenizer .Type | Token type reflecting the original source of this tokenKnown words from the system dictionary. | Class | org.apache.lucene.analysis.ja.JapaneseTokenizer | Apache Lucene |
|
JapaneseTokenizerFactory | Factory for JapaneseTokenizer. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
Lucene43JapanesePartOfSpeechStopFilter | Backcompat JapanesePartOfSpeechStopFilter for versions 4. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
PartOfSpeechAttribute | Attribute for Token. | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
PartOfSpeechAttributeImpl | Attribute for Token. | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
ReadingAttribute | Attribute for Kuromoji reading data Note: in some cases this value may not be applicable, | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
ReadingAttributeImpl | | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
Token | Analyzed token with morphological data from its dictionary. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
TokenInfoDictionary | Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
TokenInfoFST | Thin wrapper around an FST with root-arc caching for Japanese. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
ToStringUtil | Utility class for english translations of morphological data, used only for debugging. | Class | org.apache.lucene.analysis.ja.util | Apache Lucene |
|
UnknownDictionary | Dictionary for unknown-word handling. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
UserDictionary | Class for building a User Dictionary. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
MorfologikAnalyzer | Analyzer using Morfologik library. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
MorfologikFilter | TokenFilter using Morfologik library to transform input tokens into lemma and morphosyntactic (POS) tokens. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
MorfologikFilterFactory | Filter factory for MorfologikFilter. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
MorphosyntacticTagsAttribute | Morfologik provides morphosyntactic annotations for surface forms. | Interface | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
MorphosyntacticTagsAttributeImpl | Morphosyntactic annotations for surface forms. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
BeiderMorseFilter | TokenFilter for Beider-Morse phonetic encoding. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
BeiderMorseFilterFactory | Factory for BeiderMorseFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
DaitchMokotoffSoundexFilter | | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
DaitchMokotoffSoundexFilterFactory | Factory for DaitchMokotoffSoundexFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
DoubleMetaphoneFilter | | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
DoubleMetaphoneFilterFactory | Factory for DoubleMetaphoneFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
PhoneticFilter | Create tokens for phonetic matches. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
PhoneticFilterFactory | Factory for PhoneticFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
AnalyzerProfile | Manages analysis data configuration for SmartChineseAnalyzer SmartChineseAnalyzer has a built-in dictionary and stopword list out-of-box. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
CharType | Internal SmartChineseAnalyzer character type constants. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
HHMMSegmenter | Finds the optimal segmentation of a sentence into Chinese wordsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
|
HMMChineseTokenizer | Tokenizer for Chinese or mixed Chinese-English text. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
HMMChineseTokenizerFactory | Factory for HMMChineseTokenizer Note: this class will currently emit tokens for punctuation. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
SegToken | SmartChineseAnalyzer internal tokenWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
|
SegTokenFilter | Filters a SegToken by converting full-width latin to half-width, then lowercasing latin. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
|
SentenceTokenizer | Tokenizes input text into sentences. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
SmartChineseAnalyzer | SmartChineseAnalyzer is an analyzer for Chinese or mixed Chinese-English text. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
SmartChineseSentenceTokenizerFactory | Factory for the SmartChineseAnalyzer SentenceTokenizerWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
SmartChineseWordTokenFilterFactory | Factory for the SmartChineseAnalyzer WordTokenFilter Note: this class will currently emit tokens for punctuation. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
Utility | SmartChineseAnalyzer utility constants and methodsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
WordTokenFilter | A TokenFilter that breaks sentences into words. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
WordType | Internal SmartChineseAnalyzer token type constantsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
PolishAnalyzer | Analyzer for Polish. | Class | org.apache.lucene.analysis.pl | Apache Lucene |
|
StempelFilter | Transforms the token stream as per the stemming algorithm. | Class | org.apache.lucene.analysis.stempel | Apache Lucene |
|
StempelPolishStemFilterFactory | Factory for StempelFilter using a Polish stemming table. | Class | org.apache.lucene.analysis.stempel | Apache Lucene |
|
StempelStemmer | Stemmer class is a convenient facade for other stemmer-related classes. | Class | org.apache.lucene.analysis.stempel | Apache Lucene |
|
AEProvider | | Interface | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
AEProviderFactory | | Class | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
BaseUIMATokenizer | Abstract base implementation of a Tokenizer which is able to analyze the given input with a UIMA AnalysisEngine | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
BasicAEProvider | Basic AEProvider which just instantiates a UIMA AnalysisEngine with no additional metadata, parameters or resources | Class | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
OverridingParamsAEProvider | AEProvider implementation that creates an Aggregate AE from the given path, also injecting runtime parameters defined in the solrconfig. | Class | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
UIMAAnnotationsTokenizer | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
UIMAAnnotationsTokenizerFactory | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
UIMABaseAnalyzer | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
UIMATypeAwareAnalyzer | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
UIMATypeAwareAnnotationsTokenizer | A Tokenizer which creates token from UIMA Annotations filling also their TypeAttribute according to FeaturePaths specified | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
UIMATypeAwareAnnotationsTokenizerFactory | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
BaseTokenStreamTestCase | Base class for all Lucene unit tests that use TokenStreams. | Class | org.apache.lucene.analysis | Apache Lucene |
|
BaseTokenStreamTestCase .CheckClearAttributesAttribute | Attribute that records if it was cleared or not. | Interface | org.apache.lucene.analysis.BaseTokenStreamTestCase | Apache Lucene |
|
BaseTokenStreamTestCase .CheckClearAttributesAttributeImpl | Attribute that records if it was cleared or not. | Class | org.apache.lucene.analysis.BaseTokenStreamTestCase | Apache Lucene |
|
CannedBinaryTokenStream | | Class | org.apache.lucene.analysis | Apache Lucene |
|
CannedBinaryTokenStream .BinaryToken | Represents a binary token. | Class | org.apache.lucene.analysis.CannedBinaryTokenStream | Apache Lucene |
|
CannedTokenStream | TokenStream from a canned list of Tokens. | Class | org.apache.lucene.analysis | Apache Lucene |
|
CollationTestBase | Base test class for testing Unicode collation. | Class | org.apache.lucene.analysis | Apache Lucene |
|
CrankyTokenFilter | Throws IOException from random Tokenstream methods. | Class | org.apache.lucene.analysis | Apache Lucene |
|
LookaheadTokenFilter | An abstract TokenFilter to make it easier to build graph token filters requiring some lookahead. | Class | org.apache.lucene.analysis | Apache Lucene |
|
LookaheadTokenFilter .Position | Holds all state for a single position; subclass this to record other state at each position. | Class | org.apache.lucene.analysis.LookaheadTokenFilter | Apache Lucene |
|
MockAnalyzer | Analyzer for testing This analyzer is a replacement for Whitespace/Simple/KeywordAnalyzers | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockBytesAnalyzer | Analyzer for testing that encodes terms as UTF-16 bytes. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockCharFilter | the purpose of this charfilter is to send offsets out of bounds if the analyzer doesn't use correctOffset or does incorrect offset math. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockFixedLengthPayloadFilter | TokenFilter that adds random fixed-length payloads. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockGraphTokenFilter | Randomly inserts overlapped (posInc=0) tokens with posLength sometimes > 1. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockHoleInjectingTokenFilter | | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockPayloadAnalyzer | Wraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all others | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockRandomLookaheadTokenFilter | Uses LookaheadTokenFilter to randomly peek at future tokens. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockReaderWrapper | Wraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockTokenFilter | A tokenfilter for testing that removes terms accepted by a DFA. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockTokenizer | Tokenizer for testing. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockUTF16TermAttributeImpl | Extension of CharTermAttributeImpl that encodes the term text as UTF-16 bytes instead of as UTF-8 bytes. | Class | org.apache.lucene.analysis | Apache Lucene |
|
MockVariableLengthPayloadFilter | TokenFilter that adds random variable-length payloads. | Class | org.apache.lucene.analysis | Apache Lucene |
|
SimplePayloadFilter | | Class | org.apache.lucene.analysis | Apache Lucene |
|
TokenStreamToDot | Consumes a TokenStream and outputs the dot (graphviz) string (graph). | Class | org.apache.lucene.analysis | Apache Lucene |
|
ValidatingTokenFilter | A TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another). | Class | org.apache.lucene.analysis | Apache Lucene |
|
VocabularyAssert | | Class | org.apache.lucene.analysis | Apache Lucene |