| Name | Description | Type | Package | Framework |
| Analyzer | An Analyzer builds TokenStreams, which analyze text. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| Analyzer .ReuseStrategy | Strategy defining how TokenStreamComponents are reused per call to Analyzer. | Class | org.apache.lucene.analysis.Analyzer | Apache Lucene |
|
| Analyzer .TokenStreamComponents | This class encapsulates the outer components of a token stream. | Class | org.apache.lucene.analysis.Analyzer | Apache Lucene |
|
| AnalyzerWrapper | Extension to Analyzer suitable for Analyzers which wrap getWrappedAnalyzer(String) allows the Analyzer | Class | org.apache.lucene.analysis | Apache Lucene |
|
| BytesTermAttribute | This attribute can be used if you have the raw term bytes to be indexed. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| BytesTermAttributeImpl | NOTE: This API is for internal purposes only and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| CachingTokenFilter | This class can be used if the token attributes of a TokenStream are intended to be consumed more than once. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| CharFilter | Subclasses of CharFilter can be chained to filter a Reader They can be used as Reader with additional offset | Class | org.apache.lucene.analysis | Apache Lucene |
|
| CharTermAttribute | The term text of a Token. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| CharTermAttributeImpl | Default implementation of CharTermAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| DelegatingAnalyzerWrapper | An analyzer wrapper, that doesn't allow to wrap components or readers. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| FlagsAttribute | This attribute can be used to pass different flags down the Tokenizer chain, e. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| FlagsAttributeImpl | Default implementation of FlagsAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| KeywordAttribute | This attribute can be used to mark a token as a keyword. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| KeywordAttributeImpl | Default implementation of KeywordAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| NumericTokenStream | Expert: This class provides a TokenStream for indexing numeric values that can be used by NumericRangeQuery. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| NumericTokenStream .NumericTermAttribute | Expert: Use this attribute to get the details of the currently generated token. | Interface | org.apache.lucene.analysis.NumericTokenStream | Apache Lucene |
|
| NumericTokenStream .NumericTermAttributeImpl | | Class | org.apache.lucene.analysis.NumericTokenStream | Apache Lucene |
|
| OffsetAttribute | The start and end character offset of a Token. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| OffsetAttributeImpl | Default implementation of OffsetAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PackedTokenAttributeImpl | Default implementation of the common attributes used by Lucene:CharTermAttribute PositionIncrementAttribute | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PayloadAttribute | The payload of a Token. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PayloadAttributeImpl | Default implementation of PayloadAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PositionIncrementAttribute | Determines the position of this token relative to the previous Token in a TokenStream, used in phrase | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PositionIncrementAttributeImpl | Default implementation of PositionIncrementAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PositionLengthAttribute | Determines how many positions this token spans. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| PositionLengthAttributeImpl | Default implementation of PositionLengthAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| TermToBytesRefAttribute | This attribute is requested by TermsHashPerField to index the contents. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| Token | A Token is an occurrence of a term from the text of a field. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| TokenFilter | A TokenFilter is a TokenStream whose input is another TokenStream. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| Tokenizer | A Tokenizer is a TokenStream whose input is a Reader. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| TokenStream | A TokenStream enumerates the sequence of tokens, either from Fields of a Document or from query text. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| TokenStreamToAutomaton | Consumes a TokenStream and creates an Automaton where the transition labels are UTF8 bytes (or Unicode | Class | org.apache.lucene.analysis | Apache Lucene |
|
| TypeAttribute | A Token's lexical type. | Interface | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| TypeAttributeImpl | Default implementation of TypeAttribute. | Class | org.apache.lucene.analysis.tokenattributes | Apache Lucene |
|
| AbstractAnalysisFactory | Abstract parent class for analysis factories TokenizerFactory, TokenFilterFactory and CharFilterFactory. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| AbstractEncoder | Base class for payload encoders. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| AnalysisSPILoader | Helper class for loading named SPIs from classpath (e. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| ApostropheFilter | Strips all characters after an apostrophe (including the apostrophe itself). | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
| ApostropheFilterFactory | Factory for ApostropheFilter. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
| ArabicAnalyzer | Analyzer for Arabic. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArabicNormalizationFilter | A TokenFilter that applies ArabicNormalizer to normalize the orthography. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArabicNormalizationFilterFactory | Factory for ArabicNormalizationFilter. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArabicNormalizer | Normalizer for Arabic. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArabicStemFilter | A TokenFilter that applies ArabicStemmer to stem Arabic words. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArabicStemFilterFactory | Factory for ArabicStemFilter. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArabicStemmer | Stemming is done in-place for efficiency, operating on a termbuffer. | Class | org.apache.lucene.analysis.ar | Apache Lucene |
|
| ArmenianAnalyzer | Analyzer for Armenian. | Class | org.apache.lucene.analysis.hy | Apache Lucene |
|
| ASCIIFoldingFilter | This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the Basic Latin Unicode | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| ASCIIFoldingFilterFactory | Factory for ASCIIFoldingFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| BaseCharFilter | Base utility class for implementing a CharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
| BasqueAnalyzer | Analyzer for Basque. | Class | org.apache.lucene.analysis.eu | Apache Lucene |
|
| BrazilianAnalyzer | Analyzer for Brazilian Portuguese language. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
| BrazilianStemFilter | A TokenFilter that applies BrazilianStemmer. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
| BrazilianStemFilterFactory | Factory for BrazilianStemFilter. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
| BrazilianStemmer | A stemmer for Brazilian Portuguese words. | Class | org.apache.lucene.analysis.br | Apache Lucene |
|
| BulgarianAnalyzer | Analyzer for Bulgarian. | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
| BulgarianStemFilter | A TokenFilter that applies BulgarianStemmer to stem Bulgarian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
| BulgarianStemFilterFactory | Factory for BulgarianStemFilter. | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
| BulgarianStemmer | Light Stemmer for Bulgarian. | Class | org.apache.lucene.analysis.bg | Apache Lucene |
|
| ByteVector | This class implements a simple byte vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| CapitalizationFilter | A filter to apply normal capitalization rules to Tokens. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| CapitalizationFilterFactory | Factory for CapitalizationFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| CatalanAnalyzer | Analyzer for Catalan. | Class | org.apache.lucene.analysis.ca | Apache Lucene |
|
| CharacterUtils | CharacterUtils provides a unified interface to Character-related operations to implement backwards compatible character operations based on a | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CharacterUtils .CharacterBuffer | A simple IO buffer to use with CharacterUtils. | Class | org.apache.lucene.analysis.util.CharacterUtils | Apache Lucene |
|
| CharArrayIterator | A CharacterIterator used internally for use with BreakIteratorNOTE: This API is for internal purposes only and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CharArrayMap | A simple class that stores key Strings as char[]'s in a hash table. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CharArraySet | A simple class that stores Strings as char[]'s in a hash table. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CharFilterFactory | | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CharTokenizer | An abstract base class for simple, character-oriented tokenizers. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CharVector | This class implements a simple char vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| CJKAnalyzer | An Analyzer that tokenizes text with StandardTokenizer, normalizes content with CJKWidthFilter, folds case with | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
| CJKBigramFilter | Forms bigrams of CJK terms that are generated from StandardTokenizer CJK types are set by these tokenizers, but you can also use | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
| CJKBigramFilterFactory | Factory for CJKBigramFilter. | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
| CJKWidthFilter | A TokenFilter that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
| CJKWidthFilterFactory | Factory for CJKWidthFilter. | Class | org.apache.lucene.analysis.cjk | Apache Lucene |
|
| ClassicAnalyzer | Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| ClassicFilter | Normalizes tokens extracted with ClassicTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| ClassicFilterFactory | Factory for ClassicFilter. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| ClassicTokenizer | A grammar-based tokenizer constructed with JFlex This should be a good tokenizer for most European-language documents: | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| ClassicTokenizerFactory | Factory for ClassicTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| ClasspathResourceLoader | Simple ResourceLoader that uses ClassLoader. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| CodepointCountFilter | Removes words that are too long or too short from the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| CodepointCountFilterFactory | Factory for CodepointCountFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| CommonGramsFilter | Construct bigrams for frequently occurring terms while indexing. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
| CommonGramsFilterFactory | Constructs a CommonGramsFilter. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
| CommonGramsQueryFilter | Wrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
| CommonGramsQueryFilterFactory | Construct CommonGramsQueryFilter. | Class | org.apache.lucene.analysis.commongrams | Apache Lucene |
|
| CompoundWordTokenFilterBase | Base class for decomposition token filters. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| CustomAnalyzer | A general-purpose Analyzer that can be created with a builder-style API. | Class | org.apache.lucene.analysis.custom | Apache Lucene |
|
| CustomAnalyzer .Builder | Builder for CustomAnalyzer. | Class | org.apache.lucene.analysis.custom.CustomAnalyzer | Apache Lucene |
|
| CzechAnalyzer | Analyzer for Czech language. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
| CzechStemFilter | A TokenFilter that applies CzechStemmer to stem Czech words. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
| CzechStemFilterFactory | Factory for CzechStemFilter. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
| CzechStemmer | Light Stemmer for Czech. | Class | org.apache.lucene.analysis.cz | Apache Lucene |
|
| DanishAnalyzer | Analyzer for Danish. | Class | org.apache.lucene.analysis.da | Apache Lucene |
|
| DateRecognizerFilter | Filters all tokens that cannot be parsed to a date, using the provided DateFormat. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| DateRecognizerFilterFactory | Factory for DateRecognizerFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| DecimalDigitFilter | Folds all Unicode digits in [:General_Category=Decimal_Number:] to Basic Latin digits (0-9). | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| DecimalDigitFilterFactory | Factory for DecimalDigitFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| DelimitedPayloadTokenFilter | Characters before the delimiter are the "token", those after are the payload. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| DelimitedPayloadTokenFilterFactory | Factory for DelimitedPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| Dictionary | In-memory structure for the dictionary (. | Class | org.apache.lucene.analysis.hunspell | Apache Lucene |
|
| DictionaryCompoundWordTokenFilter | A TokenFilter that decomposes compound words found in many Germanic languages. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| DictionaryCompoundWordTokenFilterFactory | Factory for DictionaryCompoundWordTokenFilter. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| DutchAnalyzer | Analyzer for Dutch language. | Class | org.apache.lucene.analysis.nl | Apache Lucene |
|
| EdgeNGramFilterFactory | Creates new instances of EdgeNGramTokenFilter. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| EdgeNGramTokenFilter | Tokenizes the given token into n-grams of given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| EdgeNGramTokenizer | Tokenizes the input from an edge into n-grams of given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| EdgeNGramTokenizerFactory | Creates new instances of EdgeNGramTokenizer. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| ElisionFilter | Removes elisions from a TokenStream. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| ElisionFilterFactory | Factory for ElisionFilter. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| EmptyTokenStream | An always exhausted token stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| EnglishAnalyzer | Analyzer for English. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| EnglishMinimalStemFilter | A TokenFilter that applies EnglishMinimalStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| EnglishMinimalStemFilterFactory | Factory for EnglishMinimalStemFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| EnglishMinimalStemmer | Minimal plural stemmer for English. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| EnglishPossessiveFilter | TokenFilter that removes possessives (trailing 's) from words. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| EnglishPossessiveFilterFactory | Factory for EnglishPossessiveFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| FilesystemResourceLoader | Simple ResourceLoader that opens resource files from the local file system, optionally resolving against | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| FilteringTokenFilter | Abstract base class for TokenFilters that may remove tokens. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| FingerprintFilter | Filter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| FingerprintFilterFactory | Factory for FingerprintFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| FinnishAnalyzer | Analyzer for Finnish. | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
| FinnishLightStemFilter | A TokenFilter that applies FinnishLightStemmer to stem Finnish To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
| FinnishLightStemFilterFactory | Factory for FinnishLightStemFilter. | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
| FinnishLightStemmer | Light Stemmer for Finnish. | Class | org.apache.lucene.analysis.fi | Apache Lucene |
|
| FloatEncoder | Encode a character array Float as a BytesRef. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| FrenchAnalyzer | Analyzer for French language. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| FrenchLightStemFilter | A TokenFilter that applies FrenchLightStemmer to stem French To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| FrenchLightStemFilterFactory | Factory for FrenchLightStemFilter. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| FrenchLightStemmer | Light Stemmer for French. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| FrenchMinimalStemFilter | A TokenFilter that applies FrenchMinimalStemmer to stem French To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| FrenchMinimalStemFilterFactory | Factory for FrenchMinimalStemFilter. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| FrenchMinimalStemmer | Light Stemmer for French. | Class | org.apache.lucene.analysis.fr | Apache Lucene |
|
| GalicianAnalyzer | Analyzer for Galician. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GalicianMinimalStemFilter | A TokenFilter that applies GalicianMinimalStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GalicianMinimalStemFilterFactory | Factory for GalicianMinimalStemFilter. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GalicianMinimalStemmer | Minimal Stemmer for Galician This follows the "RSLP-S" algorithm, but modified for Galician. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GalicianStemFilter | A TokenFilter that applies GalicianStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GalicianStemFilterFactory | Factory for GalicianStemFilter. | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GalicianStemmer | Galician stemmer implementing "Regras do lematizador para o galego". | Class | org.apache.lucene.analysis.gl | Apache Lucene |
|
| GermanAnalyzer | Analyzer for German language. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanLightStemFilter | A TokenFilter that applies GermanLightStemmer to stem German To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanLightStemFilterFactory | Factory for GermanLightStemFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanLightStemmer | Light Stemmer for German. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanMinimalStemFilter | A TokenFilter that applies GermanMinimalStemmer to stem German To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanMinimalStemFilterFactory | Factory for GermanMinimalStemFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanMinimalStemmer | Minimal Stemmer for German. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanNormalizationFilter | Normalizes German characters according to the heuristics German2 snowball algorithm. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanNormalizationFilterFactory | Factory for GermanNormalizationFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanStemFilter | A TokenFilter that stems German words. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanStemFilterFactory | Factory for GermanStemFilter. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GermanStemmer | A stemmer for German words. | Class | org.apache.lucene.analysis.de | Apache Lucene |
|
| GreekAnalyzer | Analyzer for the Greek language. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
| GreekLowerCaseFilter | Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
| GreekLowerCaseFilterFactory | Factory for GreekLowerCaseFilter. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
| GreekStemFilter | A TokenFilter that applies GreekStemmer to stem Greek To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
| GreekStemFilterFactory | Factory for GreekStemFilter. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
| GreekStemmer | A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language. | Class | org.apache.lucene.analysis.el | Apache Lucene |
|
| HindiAnalyzer | | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HindiNormalizationFilter | A TokenFilter that applies HindiNormalizer to normalize the In some cases the normalization may cause unrelated terms to conflate, so | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HindiNormalizationFilterFactory | Factory for HindiNormalizationFilter. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HindiNormalizer | Normalizer for Hindi. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HindiStemFilter | A TokenFilter that applies HindiStemmer to stem Hindi words. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HindiStemFilterFactory | Factory for HindiStemFilter. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HindiStemmer | Light Stemmer for Hindi. | Class | org.apache.lucene.analysis.hi | Apache Lucene |
|
| HTMLStripCharFilter | A CharFilter that wraps another Reader and attempts to strip out HTML constructs. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
| HTMLStripCharFilterFactory | Factory for HTMLStripCharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
| HungarianAnalyzer | Analyzer for Hungarian. | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
| HungarianLightStemFilter | A TokenFilter that applies HungarianLightStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
| HungarianLightStemFilterFactory | Factory for HungarianLightStemFilter. | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
| HungarianLightStemmer | Light Stemmer for Hungarian. | Class | org.apache.lucene.analysis.hu | Apache Lucene |
|
| HunspellStemFilter | TokenFilter that uses hunspell affix rules and words to stem tokens. | Class | org.apache.lucene.analysis.hunspell | Apache Lucene |
|
| HunspellStemFilterFactory | TokenFilterFactory that creates instances of HunspellStemFilter. | Class | org.apache.lucene.analysis.hunspell | Apache Lucene |
|
| Hyphen | This class represents a hyphen. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| HyphenatedWordsFilter | When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| HyphenatedWordsFilterFactory | Factory for HyphenatedWordsFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| Hyphenation | This class represents a hyphenated word. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| HyphenationCompoundWordTokenFilter | A TokenFilter that decomposes compound words found in many Germanic languages. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| HyphenationCompoundWordTokenFilterFactory | Factory for HyphenationCompoundWordTokenFilter. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| HyphenationTree | This tree structure stores the hyphenation patterns in an efficient way for fast lookup. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| IdentityEncoder | Does nothing other than convert the char array to a byte array using the specified encoding. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| IndicNormalizationFilter | A TokenFilter that applies IndicNormalizer to normalize text in Indian Languages. | Class | org.apache.lucene.analysis.in | Apache Lucene |
|
| IndicNormalizationFilterFactory | Factory for IndicNormalizationFilter. | Class | org.apache.lucene.analysis.in | Apache Lucene |
|
| IndicNormalizer | Normalizes the Unicode representation of text in Indian languages. | Class | org.apache.lucene.analysis.in | Apache Lucene |
|
| IndonesianAnalyzer | | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
| IndonesianStemFilter | A TokenFilter that applies IndonesianStemmer to stem Indonesian words. | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
| IndonesianStemFilterFactory | Factory for IndonesianStemFilter. | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
| IndonesianStemmer | Stemmer for Indonesian. | Class | org.apache.lucene.analysis.id | Apache Lucene |
|
| IntegerEncoder | Encode a character array Integer as a BytesRef. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| IrishAnalyzer | | Class | org.apache.lucene.analysis.ga | Apache Lucene |
|
| IrishLowerCaseFilter | Normalises token text to lower case, handling t-prothesis and n-eclipsis (i. | Class | org.apache.lucene.analysis.ga | Apache Lucene |
|
| IrishLowerCaseFilterFactory | Factory for IrishLowerCaseFilter. | Class | org.apache.lucene.analysis.ga | Apache Lucene |
|
| ItalianAnalyzer | Analyzer for Italian. | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
| ItalianLightStemFilter | A TokenFilter that applies ItalianLightStemmer to stem Italian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
| ItalianLightStemFilterFactory | Factory for ItalianLightStemFilter. | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
| ItalianLightStemmer | Light Stemmer for Italian. | Class | org.apache.lucene.analysis.it | Apache Lucene |
|
| KeepWordFilter | A TokenFilter that only keeps tokens with text contained in the required words. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| KeepWordFilterFactory | Factory for KeepWordFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| KeywordAnalyzer | "Tokenizes" the entire stream as a single token. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| KeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| KeywordMarkerFilterFactory | Factory for KeywordMarkerFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| KeywordRepeatFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| KeywordRepeatFilterFactory | Factory for KeywordRepeatFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| KeywordTokenizer | Emits the entire input as a single token. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| KeywordTokenizerFactory | Factory for KeywordTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| KStemFilter | A high-performance kstem filter for english. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| KStemFilterFactory | Factory for KStemFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| KStemmer | | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| LatvianAnalyzer | Analyzer for Latvian. | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
| LatvianStemFilter | A TokenFilter that applies LatvianStemmer to stem Latvian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
| LatvianStemFilterFactory | Factory for LatvianStemFilter. | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
| LatvianStemmer | Light stemmer for Latvian. | Class | org.apache.lucene.analysis.lv | Apache Lucene |
|
| LengthFilter | Removes words that are too long or too short from the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LengthFilterFactory | Factory for LengthFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LetterTokenizer | A LetterTokenizer is a tokenizer that divides text at non-letters. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| LetterTokenizerFactory | Factory for LetterTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| LimitTokenCountAnalyzer | This Analyzer limits the number of tokens while indexing. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LimitTokenCountFilter | This TokenFilter limits the number of tokens while indexing. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LimitTokenCountFilterFactory | Factory for LimitTokenCountFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LimitTokenOffsetFilter | Lets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LimitTokenOffsetFilterFactory | Factory for LimitTokenOffsetFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LimitTokenPositionFilter | This TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LimitTokenPositionFilterFactory | Factory for LimitTokenPositionFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| LithuanianAnalyzer | Analyzer for Lithuanian. | Class | org.apache.lucene.analysis.lt | Apache Lucene |
|
| LowerCaseFilter | Normalizes token text to lower case. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| LowerCaseFilterFactory | Factory for LowerCaseFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| LowerCaseTokenizer | LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| LowerCaseTokenizerFactory | Factory for LowerCaseTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| Lucene43CompoundWordTokenFilterBase | Base class for decomposition token filters using pre-4. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| Lucene43DictionaryCompoundWordTokenFilter | | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| Lucene43EdgeNGramTokenFilter | Tokenizes the given token into n-grams of given size(s), using pre-4. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| Lucene43EdgeNGramTokenizer | Old version of EdgeNGramTokenizer which doesn't handle correctly supplementary characters. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| Lucene43EdgeNGramTokenizer .Side | Specifies which side of the input the n-gram should be generated fromGet the n-gram from the end of the input | Class | org.apache.lucene.analysis.ngram.Lucene43EdgeNGramTokenizer | Apache Lucene |
|
| Lucene43FilteringTokenFilter | Backcompat FilteringTokenFilter for versions 4. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| Lucene43HyphenationCompoundWordTokenFilter | A TokenFilter that decomposes compound words found in many Germanic languages, using pre-4. | Class | org.apache.lucene.analysis.compound | Apache Lucene |
|
| Lucene43KeepWordFilter | Backcompat for KeepWordFilter for versions 4. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| Lucene43LengthFilter | Backcompat LengthFilter for versions 4. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| Lucene43NGramTokenFilter | Tokenizes the input into n-grams of the given size(s), matching Lucene 4. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| Lucene43NGramTokenizer | Old broken version of NGramTokenizer. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| Lucene43StopFilter | Backcompat StopFilter for versions 4. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| Lucene43TrimFilter | Backcompat TrimFilter for versions 4. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| Lucene43TypeTokenFilter | Backcompat TypeTokenFilter for versions 4. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| Lucene47WordDelimiterFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| MappingCharFilter | Simplistic CharFilter that applies the mappings contained in a NormalizeCharMap to the character | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
| MappingCharFilterFactory | Factory for MappingCharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
| MultiTermAwareComponent | Add to any analysis factory component to allow returning an analysis component factory for use with partial terms in prefix queries, | Interface | org.apache.lucene.analysis.util | Apache Lucene |
|
| NGramFilterFactory | Factory for NGramTokenFilter. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| NGramTokenFilter | Tokenizes the input into n-grams of the given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| NGramTokenizer | Tokenizes the input into n-grams of the given size(s). | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| NGramTokenizerFactory | Factory for NGramTokenizer. | Class | org.apache.lucene.analysis.ngram | Apache Lucene |
|
| NormalizeCharMap | Holds a map of String input to String output, to be used with MappingCharFilter. | Class | org.apache.lucene.analysis.charfilter | Apache Lucene |
|
| NormalizeCharMap .Builder | Builds an NormalizeCharMap. | Class | org.apache.lucene.analysis.charfilter.NormalizeCharMap | Apache Lucene |
|
| NorwegianAnalyzer | Analyzer for Norwegian. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NorwegianLightStemFilter | A TokenFilter that applies NorwegianLightStemmer to stem Norwegian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NorwegianLightStemFilterFactory | Factory for NorwegianLightStemFilter. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NorwegianLightStemmer | Light Stemmer for Norwegian. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NorwegianMinimalStemFilter | A TokenFilter that applies NorwegianMinimalStemmer to stem Norwegian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NorwegianMinimalStemFilterFactory | Factory for NorwegianMinimalStemFilter. | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NorwegianMinimalStemmer | | Class | org.apache.lucene.analysis.no | Apache Lucene |
|
| NumericPayloadTokenFilter | Assigns a payload to a token based on the PackedTokenAttributeImpl. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| NumericPayloadTokenFilterFactory | Factory for NumericPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| OpenStringBuilder | A StringBuilder that allows one to access the array. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| PathHierarchyTokenizer | Tokenizer for path-like hierarchies. | Class | org.apache.lucene.analysis.path | Apache Lucene |
|
| PathHierarchyTokenizerFactory | Factory for PathHierarchyTokenizer. | Class | org.apache.lucene.analysis.path | Apache Lucene |
|
| PatternCaptureGroupFilterFactory | Factory for PatternCaptureGroupTokenFilter. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternCaptureGroupTokenFilter | CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternConsumer | This interface is used to connect the XML pattern file parser to the This class has been taken from the Apache FOP project (http://xmlgraphics. | Interface | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| PatternKeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| PatternParser | A SAX document handler to read and parse hyphenation patterns from a XML This class has been taken from the Apache FOP project (http://xmlgraphics. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| PatternReplaceCharFilter | CharFilter that uses a regular expression for the target of replace string. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternReplaceCharFilterFactory | Factory for PatternReplaceCharFilter. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternReplaceFilter | A TokenFilter which applies a Pattern to each token in the stream, replacing match occurances with the specified replacement string. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternReplaceFilterFactory | Factory for PatternReplaceFilter. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternTokenizer | This tokenizer uses regex pattern matching to construct distinct tokens for the input stream. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PatternTokenizerFactory | Factory for PatternTokenizer. | Class | org.apache.lucene.analysis.pattern | Apache Lucene |
|
| PayloadEncoder | Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to NOTE: This interface is subject to change | Interface | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| PayloadHelper | Utility methods for encoding payloads. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| PerFieldAnalyzerWrapper | This analyzer is used to facilitate scenarios where different fields require different analysis techniques. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| PersianAnalyzer | Analyzer for Persian. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
| PersianCharFilter | | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
| PersianCharFilterFactory | Factory for PersianCharFilter. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
| PersianNormalizationFilter | | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
| PersianNormalizationFilterFactory | Factory for PersianNormalizationFilter. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
| PersianNormalizer | Normalizer for Persian. | Class | org.apache.lucene.analysis.fa | Apache Lucene |
|
| PorterStemFilter | Transforms the token stream as per the Porter stemming algorithm. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| PorterStemFilterFactory | Factory for PorterStemFilter. | Class | org.apache.lucene.analysis.en | Apache Lucene |
|
| PortugueseAnalyzer | Analyzer for Portuguese. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseLightStemFilter | A TokenFilter that applies PortugueseLightStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseLightStemFilterFactory | Factory for PortugueseLightStemFilter. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseLightStemmer | Light Stemmer for Portuguese This stemmer implements the "UniNE" algorithm in: | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseMinimalStemFilter | A TokenFilter that applies PortugueseMinimalStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseMinimalStemFilterFactory | Factory for PortugueseMinimalStemFilter. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseMinimalStemmer | Minimal Stemmer for Portuguese This follows the "RSLP-S" algorithm presented in: | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseStemFilter | A TokenFilter that applies PortugueseStemmer to stem To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseStemFilterFactory | Factory for PortugueseStemFilter. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PortugueseStemmer | Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| PrefixAndSuffixAwareTokenFilter | Links two PrefixAwareTokenFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| PrefixAwareTokenFilter | Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| QueryAutoStopWordAnalyzer | An Analyzer used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries. | Class | org.apache.lucene.analysis.query | Apache Lucene |
|
| RemoveDuplicatesTokenFilter | A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| RemoveDuplicatesTokenFilterFactory | Factory for RemoveDuplicatesTokenFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| ResourceLoader | Abstraction for loading resources (streams, files, and classes). | Interface | org.apache.lucene.analysis.util | Apache Lucene |
|
| ResourceLoaderAware | Interface for a component that needs to be initialized by an implementation of ResourceLoader. | Interface | org.apache.lucene.analysis.util | Apache Lucene |
|
| ReversePathHierarchyTokenizer | Tokenizer for domain-like hierarchies. | Class | org.apache.lucene.analysis.path | Apache Lucene |
|
| ReverseStringFilter | Reverse token string, for example "country" => "yrtnuoc". | Class | org.apache.lucene.analysis.reverse | Apache Lucene |
|
| ReverseStringFilterFactory | Factory for ReverseStringFilter. | Class | org.apache.lucene.analysis.reverse | Apache Lucene |
|
| RollingCharBuffer | Acts like a forever growing char[] as you read characters into it from the provided reader, but | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| RomanianAnalyzer | Analyzer for Romanian. | Class | org.apache.lucene.analysis.ro | Apache Lucene |
|
| RSLPStemmerBase | Base class for stemmers that use a set of RSLP-like stemming steps. | Class | org.apache.lucene.analysis.pt | Apache Lucene |
|
| RSLPStemmerBase .Rule | A basic rule, with no exceptions. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
| RSLPStemmerBase .RuleWithSetExceptions | A rule with a set of whole-word exceptions. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
| RSLPStemmerBase .RuleWithSuffixExceptions | A rule with a set of exceptional suffixes. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
| RSLPStemmerBase .Step | A step containing a list of rules. | Class | org.apache.lucene.analysis.pt.RSLPStemmerBase | Apache Lucene |
|
| RussianAnalyzer | Analyzer for Russian language. | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
| RussianLightStemFilter | A TokenFilter that applies RussianLightStemmer to stem Russian To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
| RussianLightStemFilterFactory | Factory for RussianLightStemFilter. | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
| RussianLightStemmer | Light Stemmer for Russian. | Class | org.apache.lucene.analysis.ru | Apache Lucene |
|
| ScandinavianFoldingFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| ScandinavianFoldingFilterFactory | Factory for ScandinavianFoldingFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| ScandinavianNormalizationFilter | | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| ScandinavianNormalizationFilterFactory | Factory for ScandinavianNormalizationFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| SegmentingTokenizerBase | Breaks text into sentences with a BreakIterator and allows subclasses to decompose these sentences into words. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| SerbianNormalizationFilter | Normalizes Serbian Cyrillic and Latin characters to "bald" Latin. | Class | org.apache.lucene.analysis.sr | Apache Lucene |
|
| SerbianNormalizationFilterFactory | Factory for SerbianNormalizationFilter. | Class | org.apache.lucene.analysis.sr | Apache Lucene |
|
| SerbianNormalizationRegularFilter | Normalizes Serbian Cyrillic to Latin. | Class | org.apache.lucene.analysis.sr | Apache Lucene |
|
| SetKeywordMarkerFilter | Marks terms as keywords via the KeywordAttribute. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| ShingleAnalyzerWrapper | A ShingleAnalyzerWrapper wraps a ShingleFilter around another Analyzer. | Class | org.apache.lucene.analysis.shingle | Apache Lucene |
|
| ShingleFilter | A ShingleFilter constructs shingles (token n-grams) from a token stream. | Class | org.apache.lucene.analysis.shingle | Apache Lucene |
|
| ShingleFilterFactory | Factory for ShingleFilter. | Class | org.apache.lucene.analysis.shingle | Apache Lucene |
|
| SimpleAnalyzer | An Analyzer that filters LetterTokenizer with LowerCaseFilter | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| SingleTokenTokenStream | A TokenStream containing a single token. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| SnowballFilter | A filter that stems words using a Snowball-generated stemmer. | Class | org.apache.lucene.analysis.snowball | Apache Lucene |
|
| SnowballPorterFilterFactory | Factory for SnowballFilter, with configurable language Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection. | Class | org.apache.lucene.analysis.snowball | Apache Lucene |
|
| SolrSynonymParser | Parser for the Solr synonyms format. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
| SoraniAnalyzer | Analyzer for Sorani Kurdish. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SoraniNormalizationFilter | | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SoraniNormalizationFilterFactory | Factory for SoraniNormalizationFilter. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SoraniNormalizer | Normalizes the Unicode representation of Sorani text. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SoraniStemFilter | A TokenFilter that applies SoraniStemmer to stem Sorani words. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SoraniStemFilterFactory | Factory for SoraniStemFilter. | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SoraniStemmer | | Class | org.apache.lucene.analysis.ckb | Apache Lucene |
|
| SpanishAnalyzer | Analyzer for Spanish. | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
| SpanishLightStemFilter | A TokenFilter that applies SpanishLightStemmer to stem Spanish To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
| SpanishLightStemFilterFactory | Factory for SpanishLightStemFilter. | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
| SpanishLightStemmer | Light Stemmer for Spanish This stemmer implements the algorithm described in: | Class | org.apache.lucene.analysis.es | Apache Lucene |
|
| StandardAnalyzer | Filters StandardTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| StandardFilter | Normalizes tokens extracted with StandardTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| StandardFilterFactory | Factory for StandardFilter. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| StandardTokenizer | A grammar-based tokenizer constructed with JFlex. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| StandardTokenizer40 | Backcompat standard tokenizer for Lucene 4. | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
| StandardTokenizerFactory | Factory for StandardTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| StandardTokenizerImpl | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| StandardTokenizerImpl40 | | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
| StemmerOverrideFilter | Provides the ability to override any KeywordAttribute aware stemmer with custom dictionary-based stemming. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| StemmerOverrideFilter .Builder | | Class | org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter | Apache Lucene |
|
| StemmerOverrideFilter .StemmerOverrideMap | A read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for StemmerOverrideFilter | Class | org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter | Apache Lucene |
|
| StemmerOverrideFilterFactory | Factory for StemmerOverrideFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| StemmerUtil | Some commonly-used stemming functionsNOTE: This API is for internal purposes only and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| StopAnalyzer | Filters LetterTokenizer with LowerCaseFilter and StopFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| StopFilter | Removes stop words from a token stream. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| StopFilterFactory | Factory for StopFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| StopwordAnalyzerBase | Base class for Analyzers that need to make use of stopword sets. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| SwedishAnalyzer | Analyzer for Swedish. | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
| SwedishLightStemFilter | A TokenFilter that applies SwedishLightStemmer to stem Swedish To prevent terms from being stemmed use an instance of | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
| SwedishLightStemFilterFactory | Factory for SwedishLightStemFilter. | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
| SwedishLightStemmer | Light Stemmer for Swedish. | Class | org.apache.lucene.analysis.sv | Apache Lucene |
|
| SynonymFilter | Matches single or multi word synonyms in a token stream. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
| SynonymFilterFactory | Factory for SynonymFilter. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
| SynonymMap | A map of synonyms, keys and values are phrases. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
| SynonymMap .Builder | Builds an FSTSynonymMap. | Class | org.apache.lucene.analysis.synonym.SynonymMap | Apache Lucene |
|
| SynonymMap .Parser | Abstraction for parsing synonym files. | Class | org.apache.lucene.analysis.synonym.SynonymMap | Apache Lucene |
|
| TeeSinkTokenFilter | This TokenFilter provides the ability to set aside attribute states that have already been analyzed. | Class | org.apache.lucene.analysis.sinks | Apache Lucene |
|
| TeeSinkTokenFilter .SinkTokenStream | TokenStream output from a tee. | Class | org.apache.lucene.analysis.sinks.TeeSinkTokenFilter | Apache Lucene |
|
| TernaryTree | Ternary Search Tree. | Class | org.apache.lucene.analysis.compound.hyphenation | Apache Lucene |
|
| ThaiAnalyzer | Analyzer for Thai language. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
| ThaiTokenizer | Tokenizer that use BreakIterator to tokenize Thai text. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
| ThaiTokenizerFactory | Factory for ThaiTokenizer. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
| ThaiWordFilter | TokenFilter that use BreakIterator to break each Token that is Thai into separate Token(s) for each Thai word. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
| ThaiWordFilterFactory | Factory for ThaiWordFilter. | Class | org.apache.lucene.analysis.th | Apache Lucene |
|
| TokenFilterFactory | | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| TokenizerFactory | | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| TokenOffsetPayloadTokenFilter | Adds the OffsetAttribute. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| TokenOffsetPayloadTokenFilterFactory | Factory for TokenOffsetPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| TrimFilter | Trims leading and trailing whitespace from Tokens in the stream. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| TrimFilterFactory | Factory for TrimFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| TruncateTokenFilter | A token filter for truncating the terms into a specific length. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| TruncateTokenFilterFactory | Factory for TruncateTokenFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| TurkishAnalyzer | Analyzer for Turkish. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
| TurkishLowerCaseFilter | Normalizes Turkish token text to lower case. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
| TurkishLowerCaseFilterFactory | Factory for TurkishLowerCaseFilter. | Class | org.apache.lucene.analysis.tr | Apache Lucene |
|
| TypeAsPayloadTokenFilter | Makes the PackedTokenAttributeImpl. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| TypeAsPayloadTokenFilterFactory | Factory for TypeAsPayloadTokenFilter. | Class | org.apache.lucene.analysis.payloads | Apache Lucene |
|
| TypeTokenFilter | Removes tokens whose types appear in a set of blocked types from a token stream. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| TypeTokenFilterFactory | Factory class for TypeTokenFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| UAX29URLEmailAnalyzer | Filters UAX29URLEmailTokenizer with StandardFilter, | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| UAX29URLEmailTokenizer | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| UAX29URLEmailTokenizer40 | Backcompat uax29 tokenizer for Lucene 4. | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
| UAX29URLEmailTokenizerFactory | Factory for UAX29URLEmailTokenizer. | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| UAX29URLEmailTokenizerImpl | This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in | Class | org.apache.lucene.analysis.standard | Apache Lucene |
|
| UAX29URLEmailTokenizerImpl40 | | Class | org.apache.lucene.analysis.standard.std40 | Apache Lucene |
|
| UnicodeProps | This file contains unicode properties used by various CharTokenizers. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| UnicodeWhitespaceAnalyzer | An Analyzer that uses UnicodeWhitespaceTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| UnicodeWhitespaceTokenizer | A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| UpperCaseFilter | Normalizes token text to UPPER CASE. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| UpperCaseFilterFactory | Factory for UpperCaseFilter. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| WhitespaceAnalyzer | An Analyzer that uses WhitespaceTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| WhitespaceTokenizer | A tokenizer that divides text at whitespace characters as defined by Character. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| WhitespaceTokenizerFactory | Factory for WhitespaceTokenizer. | Class | org.apache.lucene.analysis.core | Apache Lucene |
|
| WikipediaTokenizer | Extension of StandardTokenizer that is aware of Wikipedia syntax. | Class | org.apache.lucene.analysis.wikipedia | Apache Lucene |
|
| WikipediaTokenizerFactory | Factory for WikipediaTokenizer. | Class | org.apache.lucene.analysis.wikipedia | Apache Lucene |
|
| WordDelimiterFilter | Splits words into subwords and performs optional transformations on subword groups. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| WordDelimiterFilterFactory | Factory for WordDelimiterFilter. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| WordDelimiterIterator | A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules. | Class | org.apache.lucene.analysis.miscellaneous | Apache Lucene |
|
| WordlistLoader | Loader for text files that represent a list of stopwords. | Class | org.apache.lucene.analysis.util | Apache Lucene |
|
| WordnetSynonymParser | Parser for wordnet prolog format See http://wordnet. | Class | org.apache.lucene.analysis.synonym | Apache Lucene |
|
| DefaultICUTokenizerConfig | Default ICUTokenizerConfig that is generally applicable Generally tokenizes Unicode text according to UAX#29 | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
| ICUFoldingFilter | A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUFoldingFilterFactory | Factory for ICUFoldingFilter. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUNormalizer2CharFilter | Normalize token text with ICU's Normalizer2. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUNormalizer2CharFilterFactory | Factory for ICUNormalizer2CharFilter Supports the following attributes: | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUNormalizer2Filter | Normalize token text with ICU's Normalizer2 With this filter, you can normalize text in the following ways: | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUNormalizer2FilterFactory | Factory for ICUNormalizer2Filter Supports the following attributes: | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUTokenizer | Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
| ICUTokenizerConfig | Class that allows for tailored Unicode Text Segmentation on a per-writing system basis. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
| ICUTokenizerFactory | Factory for ICUTokenizer. | Class | org.apache.lucene.analysis.icu.segmentation | Apache Lucene |
|
| ICUTransformFilter | A TokenFilter that transforms text with ICU. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ICUTransformFilterFactory | Factory for ICUTransformFilter. | Class | org.apache.lucene.analysis.icu | Apache Lucene |
|
| ScriptAttribute | This attribute stores the UTR #24 script value for a token of text. | Interface | org.apache.lucene.analysis.icu.tokenattributes | Apache Lucene |
|
| ScriptAttributeImpl | WARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.icu.tokenattributes | Apache Lucene |
|
| BaseFormAttribute | Attribute for Token. | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| BaseFormAttributeImpl | Attribute for Token. | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| BinaryDictionary | Base class for a binary-encoded in-memory dictionary. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| CharacterDefinition | Character category data. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| ConnectionCosts | | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| CSVUtil | | Class | org.apache.lucene.analysis.ja.util | Apache Lucene |
|
| Dictionary | | Interface | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| GraphvizFormatter | Outputs the dot (graphviz) string for the viterbi lattice. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| InflectionAttribute | Attribute for Kuromoji inflection data. | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| InflectionAttributeImpl | Attribute for Kuromoji inflection data. | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| JapaneseAnalyzer | Analyzer for Japanese that uses morphological analysis. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseBaseFormFilter | Replaces term text with the BaseFormAttribute. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseBaseFormFilterFactory | Factory for JapaneseBaseFormFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseIterationMarkCharFilter | Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseIterationMarkCharFilterFactory | Factory for JapaneseIterationMarkCharFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseKatakanaStemFilter | A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC). | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseKatakanaStemFilterFactory | Factory for JapaneseKatakanaStemFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapanesePartOfSpeechStopFilter | Removes tokens that match a set of part-of-speech tags. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapanesePartOfSpeechStopFilterFactory | Factory for JapanesePartOfSpeechStopFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseReadingFormFilter | A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseReadingFormFilterFactory | Factory for JapaneseReadingFormFilter. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseTokenizer | Tokenizer for Japanese that uses morphological analysis. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| JapaneseTokenizer .Mode | Tokenization mode: this determines how the tokenizer handles compound and unknown words. | Class | org.apache.lucene.analysis.ja.JapaneseTokenizer | Apache Lucene |
|
| JapaneseTokenizer .Type | Token type reflecting the original source of this tokenKnown words from the system dictionary. | Class | org.apache.lucene.analysis.ja.JapaneseTokenizer | Apache Lucene |
|
| JapaneseTokenizerFactory | Factory for JapaneseTokenizer. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| Lucene43JapanesePartOfSpeechStopFilter | Backcompat JapanesePartOfSpeechStopFilter for versions 4. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| PartOfSpeechAttribute | Attribute for Token. | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| PartOfSpeechAttributeImpl | Attribute for Token. | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| ReadingAttribute | Attribute for Kuromoji reading data Note: in some cases this value may not be applicable, | Interface | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| ReadingAttributeImpl | | Class | org.apache.lucene.analysis.ja.tokenattributes | Apache Lucene |
|
| Token | Analyzed token with morphological data from its dictionary. | Class | org.apache.lucene.analysis.ja | Apache Lucene |
|
| TokenInfoDictionary | Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| TokenInfoFST | Thin wrapper around an FST with root-arc caching for Japanese. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| ToStringUtil | Utility class for english translations of morphological data, used only for debugging. | Class | org.apache.lucene.analysis.ja.util | Apache Lucene |
|
| UnknownDictionary | Dictionary for unknown-word handling. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| UserDictionary | Class for building a User Dictionary. | Class | org.apache.lucene.analysis.ja.dict | Apache Lucene |
|
| MorfologikAnalyzer | Analyzer using Morfologik library. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
| MorfologikFilter | TokenFilter using Morfologik library to transform input tokens into lemma and morphosyntactic (POS) tokens. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
| MorfologikFilterFactory | Filter factory for MorfologikFilter. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
| MorphosyntacticTagsAttribute | Morfologik provides morphosyntactic annotations for surface forms. | Interface | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
| MorphosyntacticTagsAttributeImpl | Morphosyntactic annotations for surface forms. | Class | org.apache.lucene.analysis.morfologik | Apache Lucene |
|
| BeiderMorseFilter | TokenFilter for Beider-Morse phonetic encoding. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| BeiderMorseFilterFactory | Factory for BeiderMorseFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| DaitchMokotoffSoundexFilter | | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| DaitchMokotoffSoundexFilterFactory | Factory for DaitchMokotoffSoundexFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| DoubleMetaphoneFilter | | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| DoubleMetaphoneFilterFactory | Factory for DoubleMetaphoneFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| PhoneticFilter | Create tokens for phonetic matches. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| PhoneticFilterFactory | Factory for PhoneticFilter. | Class | org.apache.lucene.analysis.phonetic | Apache Lucene |
|
| AnalyzerProfile | Manages analysis data configuration for SmartChineseAnalyzer SmartChineseAnalyzer has a built-in dictionary and stopword list out-of-box. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| CharType | Internal SmartChineseAnalyzer character type constants. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| HHMMSegmenter | Finds the optimal segmentation of a sentence into Chinese wordsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
|
| HMMChineseTokenizer | Tokenizer for Chinese or mixed Chinese-English text. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| HMMChineseTokenizerFactory | Factory for HMMChineseTokenizer Note: this class will currently emit tokens for punctuation. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| SegToken | SmartChineseAnalyzer internal tokenWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
|
| SegTokenFilter | Filters a SegToken by converting full-width latin to half-width, then lowercasing latin. | Class | org.apache.lucene.analysis.cn.smart.hhmm | Apache Lucene |
|
| SentenceTokenizer | Tokenizes input text into sentences. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| SmartChineseAnalyzer | SmartChineseAnalyzer is an analyzer for Chinese or mixed Chinese-English text. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| SmartChineseSentenceTokenizerFactory | Factory for the SmartChineseAnalyzer SentenceTokenizerWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| SmartChineseWordTokenFilterFactory | Factory for the SmartChineseAnalyzer WordTokenFilter Note: this class will currently emit tokens for punctuation. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| Utility | SmartChineseAnalyzer utility constants and methodsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| WordTokenFilter | A TokenFilter that breaks sentences into words. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| WordType | Internal SmartChineseAnalyzer token type constantsWARNING: This API is experimental and might change in incompatible ways in the next release. | Class | org.apache.lucene.analysis.cn.smart | Apache Lucene |
|
| PolishAnalyzer | Analyzer for Polish. | Class | org.apache.lucene.analysis.pl | Apache Lucene |
|
| StempelFilter | Transforms the token stream as per the stemming algorithm. | Class | org.apache.lucene.analysis.stempel | Apache Lucene |
|
| StempelPolishStemFilterFactory | Factory for StempelFilter using a Polish stemming table. | Class | org.apache.lucene.analysis.stempel | Apache Lucene |
|
| StempelStemmer | Stemmer class is a convenient facade for other stemmer-related classes. | Class | org.apache.lucene.analysis.stempel | Apache Lucene |
|
| AEProvider | | Interface | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
| AEProviderFactory | | Class | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
| BaseUIMATokenizer | Abstract base implementation of a Tokenizer which is able to analyze the given input with a UIMA AnalysisEngine | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| BasicAEProvider | Basic AEProvider which just instantiates a UIMA AnalysisEngine with no additional metadata, parameters or resources | Class | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
| OverridingParamsAEProvider | AEProvider implementation that creates an Aggregate AE from the given path, also injecting runtime parameters defined in the solrconfig. | Class | org.apache.lucene.analysis.uima.ae | Apache Lucene |
|
| UIMAAnnotationsTokenizer | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| UIMAAnnotationsTokenizerFactory | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| UIMABaseAnalyzer | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| UIMATypeAwareAnalyzer | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| UIMATypeAwareAnnotationsTokenizer | A Tokenizer which creates token from UIMA Annotations filling also their TypeAttribute according to FeaturePaths specified | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| UIMATypeAwareAnnotationsTokenizerFactory | | Class | org.apache.lucene.analysis.uima | Apache Lucene |
|
| BaseTokenStreamTestCase | Base class for all Lucene unit tests that use TokenStreams. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| BaseTokenStreamTestCase .CheckClearAttributesAttribute | Attribute that records if it was cleared or not. | Interface | org.apache.lucene.analysis.BaseTokenStreamTestCase | Apache Lucene |
|
| BaseTokenStreamTestCase .CheckClearAttributesAttributeImpl | Attribute that records if it was cleared or not. | Class | org.apache.lucene.analysis.BaseTokenStreamTestCase | Apache Lucene |
|
| CannedBinaryTokenStream | | Class | org.apache.lucene.analysis | Apache Lucene |
|
| CannedBinaryTokenStream .BinaryToken | Represents a binary token. | Class | org.apache.lucene.analysis.CannedBinaryTokenStream | Apache Lucene |
|
| CannedTokenStream | TokenStream from a canned list of Tokens. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| CollationTestBase | Base test class for testing Unicode collation. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| CrankyTokenFilter | Throws IOException from random Tokenstream methods. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| LookaheadTokenFilter | An abstract TokenFilter to make it easier to build graph token filters requiring some lookahead. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| LookaheadTokenFilter .Position | Holds all state for a single position; subclass this to record other state at each position. | Class | org.apache.lucene.analysis.LookaheadTokenFilter | Apache Lucene |
|
| MockAnalyzer | Analyzer for testing This analyzer is a replacement for Whitespace/Simple/KeywordAnalyzers | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockBytesAnalyzer | Analyzer for testing that encodes terms as UTF-16 bytes. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockCharFilter | the purpose of this charfilter is to send offsets out of bounds if the analyzer doesn't use correctOffset or does incorrect offset math. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockFixedLengthPayloadFilter | TokenFilter that adds random fixed-length payloads. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockGraphTokenFilter | Randomly inserts overlapped (posInc=0) tokens with posLength sometimes > 1. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockHoleInjectingTokenFilter | | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockPayloadAnalyzer | Wraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all others | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockRandomLookaheadTokenFilter | Uses LookaheadTokenFilter to randomly peek at future tokens. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockReaderWrapper | Wraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockTokenFilter | A tokenfilter for testing that removes terms accepted by a DFA. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockTokenizer | Tokenizer for testing. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockUTF16TermAttributeImpl | Extension of CharTermAttributeImpl that encodes the term text as UTF-16 bytes instead of as UTF-8 bytes. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| MockVariableLengthPayloadFilter | TokenFilter that adds random variable-length payloads. | Class | org.apache.lucene.analysis | Apache Lucene |
|
| SimplePayloadFilter | | Class | org.apache.lucene.analysis | Apache Lucene |
|
| TokenStreamToDot | Consumes a TokenStream and outputs the dot (graphviz) string (graph). | Class | org.apache.lucene.analysis | Apache Lucene |
|
| ValidatingTokenFilter | A TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another). | Class | org.apache.lucene.analysis | Apache Lucene |
|
| VocabularyAssert | | Class | org.apache.lucene.analysis | Apache Lucene |