Search Java Classes and Packages

Search Java Frameworks and Libraries

255581 classes and counting ...
Search Tips Index Status



#Org.apache.lucene.analysis Classes and Interfaces - 537 results found.
NameDescriptionTypePackageFramework
AnalyzerAn Analyzer builds TokenStreams, which analyze text.Classorg.apache.lucene.analysisApache Lucene
Analyzer .ReuseStrategyStrategy defining how TokenStreamComponents are reused per call to Analyzer.Classorg.apache.lucene.analysis.AnalyzerApache Lucene
Analyzer .TokenStreamComponentsThis class encapsulates the outer components of a token stream.Classorg.apache.lucene.analysis.AnalyzerApache Lucene
AnalyzerWrapperExtension to Analyzer suitable for Analyzers which wrap getWrappedAnalyzer(String) allows the AnalyzerClassorg.apache.lucene.analysisApache Lucene
BytesTermAttributeThis attribute can be used if you have the raw term bytes to be indexed.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
BytesTermAttributeImplNOTE: This API is for internal purposes only and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
CachingTokenFilterThis class can be used if the token attributes of a TokenStream are intended to be consumed more than once.Classorg.apache.lucene.analysisApache Lucene
CharFilterSubclasses of CharFilter can be chained to filter a Reader They can be used as Reader with additional offsetClassorg.apache.lucene.analysisApache Lucene
CharTermAttributeThe term text of a Token.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
CharTermAttributeImplDefault implementation of CharTermAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
DelegatingAnalyzerWrapperAn analyzer wrapper, that doesn't allow to wrap components or readers.Classorg.apache.lucene.analysisApache Lucene
FlagsAttributeThis attribute can be used to pass different flags down the Tokenizer chain, e.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
FlagsAttributeImplDefault implementation of FlagsAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
KeywordAttributeThis attribute can be used to mark a token as a keyword.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
KeywordAttributeImplDefault implementation of KeywordAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
NumericTokenStreamExpert: This class provides a TokenStream for indexing numeric values that can be used by NumericRangeQuery.Classorg.apache.lucene.analysisApache Lucene
NumericTokenStream .NumericTermAttributeExpert: Use this attribute to get the details of the currently generated token.Interfaceorg.apache.lucene.analysis.NumericTokenStreamApache Lucene
NumericTokenStream .NumericTermAttributeImplClassorg.apache.lucene.analysis.NumericTokenStreamApache Lucene
OffsetAttributeThe start and end character offset of a Token.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
OffsetAttributeImplDefault implementation of OffsetAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
PackedTokenAttributeImplDefault implementation of the common attributes used by Lucene:CharTermAttribute PositionIncrementAttributeClassorg.apache.lucene.analysis.tokenattributesApache Lucene
PayloadAttributeThe payload of a Token.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
PayloadAttributeImplDefault implementation of PayloadAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
PositionIncrementAttributeDetermines the position of this token relative to the previous Token in a TokenStream, used in phraseInterfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
PositionIncrementAttributeImplDefault implementation of PositionIncrementAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
PositionLengthAttributeDetermines how many positions this token spans.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
PositionLengthAttributeImplDefault implementation of PositionLengthAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
TermToBytesRefAttributeThis attribute is requested by TermsHashPerField to index the contents.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
TokenA Token is an occurrence of a term from the text of a field.Classorg.apache.lucene.analysisApache Lucene
TokenFilterA TokenFilter is a TokenStream whose input is another TokenStream.Classorg.apache.lucene.analysisApache Lucene
TokenizerA Tokenizer is a TokenStream whose input is a Reader.Classorg.apache.lucene.analysisApache Lucene
TokenStreamA TokenStream enumerates the sequence of tokens, either from Fields of a Document or from query text.Classorg.apache.lucene.analysisApache Lucene
TokenStreamToAutomatonConsumes a TokenStream and creates an Automaton where the transition labels are UTF8 bytes (or Unicode Classorg.apache.lucene.analysisApache Lucene
TypeAttributeA Token's lexical type.Interfaceorg.apache.lucene.analysis.tokenattributesApache Lucene
TypeAttributeImplDefault implementation of TypeAttribute.Classorg.apache.lucene.analysis.tokenattributesApache Lucene
AbstractAnalysisFactoryAbstract parent class for analysis factories TokenizerFactory, TokenFilterFactory and CharFilterFactory.Classorg.apache.lucene.analysis.utilApache Lucene
AbstractEncoderBase class for payload encoders.Classorg.apache.lucene.analysis.payloadsApache Lucene
AnalysisSPILoaderHelper class for loading named SPIs from classpath (e.Classorg.apache.lucene.analysis.utilApache Lucene
ApostropheFilterStrips all characters after an apostrophe (including the apostrophe itself).Classorg.apache.lucene.analysis.trApache Lucene
ApostropheFilterFactoryFactory for ApostropheFilter.Classorg.apache.lucene.analysis.trApache Lucene
ArabicAnalyzerAnalyzer for Arabic.Classorg.apache.lucene.analysis.arApache Lucene
ArabicNormalizationFilterA TokenFilter that applies ArabicNormalizer to normalize the orthography.Classorg.apache.lucene.analysis.arApache Lucene
ArabicNormalizationFilterFactoryFactory for ArabicNormalizationFilter.Classorg.apache.lucene.analysis.arApache Lucene
ArabicNormalizerNormalizer for Arabic.Classorg.apache.lucene.analysis.arApache Lucene
ArabicStemFilterA TokenFilter that applies ArabicStemmer to stem Arabic words.Classorg.apache.lucene.analysis.arApache Lucene
ArabicStemFilterFactoryFactory for ArabicStemFilter.Classorg.apache.lucene.analysis.arApache Lucene
ArabicStemmer Stemming is done in-place for efficiency, operating on a termbuffer.Classorg.apache.lucene.analysis.arApache Lucene
ArmenianAnalyzerAnalyzer for Armenian.Classorg.apache.lucene.analysis.hyApache Lucene
ASCIIFoldingFilterThis class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the Basic Latin UnicodeClassorg.apache.lucene.analysis.miscellaneousApache Lucene
ASCIIFoldingFilterFactoryFactory for ASCIIFoldingFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
BaseCharFilterBase utility class for implementing a CharFilter.Classorg.apache.lucene.analysis.charfilterApache Lucene
BasqueAnalyzerAnalyzer for Basque.Classorg.apache.lucene.analysis.euApache Lucene
BrazilianAnalyzerAnalyzer for Brazilian Portuguese language.Classorg.apache.lucene.analysis.brApache Lucene
BrazilianStemFilterA TokenFilter that applies BrazilianStemmer.Classorg.apache.lucene.analysis.brApache Lucene
BrazilianStemFilterFactoryFactory for BrazilianStemFilter.Classorg.apache.lucene.analysis.brApache Lucene
BrazilianStemmerA stemmer for Brazilian Portuguese words.Classorg.apache.lucene.analysis.brApache Lucene
BulgarianAnalyzerAnalyzer for Bulgarian.Classorg.apache.lucene.analysis.bgApache Lucene
BulgarianStemFilterA TokenFilter that applies BulgarianStemmer to stem Bulgarian To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.bgApache Lucene
BulgarianStemFilterFactoryFactory for BulgarianStemFilter.Classorg.apache.lucene.analysis.bgApache Lucene
BulgarianStemmerLight Stemmer for Bulgarian.Classorg.apache.lucene.analysis.bgApache Lucene
ByteVectorThis class implements a simple byte vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
CapitalizationFilterA filter to apply normal capitalization rules to Tokens.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
CapitalizationFilterFactoryFactory for CapitalizationFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
CatalanAnalyzerAnalyzer for Catalan.Classorg.apache.lucene.analysis.caApache Lucene
CharacterUtilsCharacterUtils provides a unified interface to Character-related operations to implement backwards compatible character operations based on aClassorg.apache.lucene.analysis.utilApache Lucene
CharacterUtils .CharacterBufferA simple IO buffer to use with CharacterUtils.Classorg.apache.lucene.analysis.util.CharacterUtilsApache Lucene
CharArrayIteratorA CharacterIterator used internally for use with BreakIteratorNOTE: This API is for internal purposes only and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.utilApache Lucene
CharArrayMapA simple class that stores key Strings as char[]'s in a hash table.Classorg.apache.lucene.analysis.utilApache Lucene
CharArraySetA simple class that stores Strings as char[]'s in a hash table.Classorg.apache.lucene.analysis.utilApache Lucene
CharFilterFactoryClassorg.apache.lucene.analysis.utilApache Lucene
CharTokenizerAn abstract base class for simple, character-oriented tokenizers.Classorg.apache.lucene.analysis.utilApache Lucene
CharVectorThis class implements a simple char vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
CJKAnalyzerAn Analyzer that tokenizes text with StandardTokenizer, normalizes content with CJKWidthFilter, folds case withClassorg.apache.lucene.analysis.cjkApache Lucene
CJKBigramFilterForms bigrams of CJK terms that are generated from StandardTokenizer CJK types are set by these tokenizers, but you can also use Classorg.apache.lucene.analysis.cjkApache Lucene
CJKBigramFilterFactoryFactory for CJKBigramFilter.Classorg.apache.lucene.analysis.cjkApache Lucene
CJKWidthFilterA TokenFilter that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latinClassorg.apache.lucene.analysis.cjkApache Lucene
CJKWidthFilterFactoryFactory for CJKWidthFilter.Classorg.apache.lucene.analysis.cjkApache Lucene
ClassicAnalyzerFilters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words.Classorg.apache.lucene.analysis.standardApache Lucene
ClassicFilterNormalizes tokens extracted with ClassicTokenizer.Classorg.apache.lucene.analysis.standardApache Lucene
ClassicFilterFactoryFactory for ClassicFilter.Classorg.apache.lucene.analysis.standardApache Lucene
ClassicTokenizerA grammar-based tokenizer constructed with JFlex This should be a good tokenizer for most European-language documents:Classorg.apache.lucene.analysis.standardApache Lucene
ClassicTokenizerFactoryFactory for ClassicTokenizer.Classorg.apache.lucene.analysis.standardApache Lucene
ClasspathResourceLoaderSimple ResourceLoader that uses ClassLoader.Classorg.apache.lucene.analysis.utilApache Lucene
CodepointCountFilterRemoves words that are too long or too short from the stream.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
CodepointCountFilterFactoryFactory for CodepointCountFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
CommonGramsFilterConstruct bigrams for frequently occurring terms while indexing.Classorg.apache.lucene.analysis.commongramsApache Lucene
CommonGramsFilterFactoryConstructs a CommonGramsFilter.Classorg.apache.lucene.analysis.commongramsApache Lucene
CommonGramsQueryFilterWrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram.Classorg.apache.lucene.analysis.commongramsApache Lucene
CommonGramsQueryFilterFactoryConstruct CommonGramsQueryFilter.Classorg.apache.lucene.analysis.commongramsApache Lucene
CompoundWordTokenFilterBaseBase class for decomposition token filters.Classorg.apache.lucene.analysis.compoundApache Lucene
CustomAnalyzerA general-purpose Analyzer that can be created with a builder-style API.Classorg.apache.lucene.analysis.customApache Lucene
CustomAnalyzer .BuilderBuilder for CustomAnalyzer.Classorg.apache.lucene.analysis.custom.CustomAnalyzerApache Lucene
CzechAnalyzerAnalyzer for Czech language.Classorg.apache.lucene.analysis.czApache Lucene
CzechStemFilterA TokenFilter that applies CzechStemmer to stem Czech words.Classorg.apache.lucene.analysis.czApache Lucene
CzechStemFilterFactoryFactory for CzechStemFilter.Classorg.apache.lucene.analysis.czApache Lucene
CzechStemmerLight Stemmer for Czech.Classorg.apache.lucene.analysis.czApache Lucene
DanishAnalyzerAnalyzer for Danish.Classorg.apache.lucene.analysis.daApache Lucene
DateRecognizerFilterFilters all tokens that cannot be parsed to a date, using the provided DateFormat.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
DateRecognizerFilterFactoryFactory for DateRecognizerFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
DecimalDigitFilterFolds all Unicode digits in [:General_Category=Decimal_Number:] to Basic Latin digits (0-9).Classorg.apache.lucene.analysis.coreApache Lucene
DecimalDigitFilterFactoryFactory for DecimalDigitFilter.Classorg.apache.lucene.analysis.coreApache Lucene
DelimitedPayloadTokenFilterCharacters before the delimiter are the "token", those after are the payload.Classorg.apache.lucene.analysis.payloadsApache Lucene
DelimitedPayloadTokenFilterFactoryFactory for DelimitedPayloadTokenFilter.Classorg.apache.lucene.analysis.payloadsApache Lucene
DictionaryIn-memory structure for the dictionary (.Classorg.apache.lucene.analysis.hunspellApache Lucene
DictionaryCompoundWordTokenFilterA TokenFilter that decomposes compound words found in many Germanic languages.Classorg.apache.lucene.analysis.compoundApache Lucene
DictionaryCompoundWordTokenFilterFactoryFactory for DictionaryCompoundWordTokenFilter.Classorg.apache.lucene.analysis.compoundApache Lucene
DutchAnalyzerAnalyzer for Dutch language.Classorg.apache.lucene.analysis.nlApache Lucene
EdgeNGramFilterFactoryCreates new instances of EdgeNGramTokenFilter.Classorg.apache.lucene.analysis.ngramApache Lucene
EdgeNGramTokenFilterTokenizes the given token into n-grams of given size(s).Classorg.apache.lucene.analysis.ngramApache Lucene
EdgeNGramTokenizerTokenizes the input from an edge into n-grams of given size(s).Classorg.apache.lucene.analysis.ngramApache Lucene
EdgeNGramTokenizerFactoryCreates new instances of EdgeNGramTokenizer.Classorg.apache.lucene.analysis.ngramApache Lucene
ElisionFilterRemoves elisions from a TokenStream.Classorg.apache.lucene.analysis.utilApache Lucene
ElisionFilterFactoryFactory for ElisionFilter.Classorg.apache.lucene.analysis.utilApache Lucene
EmptyTokenStreamAn always exhausted token stream.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
EnglishAnalyzerAnalyzer for English.Classorg.apache.lucene.analysis.enApache Lucene
EnglishMinimalStemFilterA TokenFilter that applies EnglishMinimalStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.enApache Lucene
EnglishMinimalStemFilterFactoryFactory for EnglishMinimalStemFilter.Classorg.apache.lucene.analysis.enApache Lucene
EnglishMinimalStemmerMinimal plural stemmer for English.Classorg.apache.lucene.analysis.enApache Lucene
EnglishPossessiveFilterTokenFilter that removes possessives (trailing 's) from words.Classorg.apache.lucene.analysis.enApache Lucene
EnglishPossessiveFilterFactoryFactory for EnglishPossessiveFilter.Classorg.apache.lucene.analysis.enApache Lucene
FilesystemResourceLoaderSimple ResourceLoader that opens resource files from the local file system, optionally resolving againstClassorg.apache.lucene.analysis.utilApache Lucene
FilteringTokenFilterAbstract base class for TokenFilters that may remove tokens.Classorg.apache.lucene.analysis.utilApache Lucene
FingerprintFilterFilter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
FingerprintFilterFactoryFactory for FingerprintFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
FinnishAnalyzerAnalyzer for Finnish.Classorg.apache.lucene.analysis.fiApache Lucene
FinnishLightStemFilterA TokenFilter that applies FinnishLightStemmer to stem Finnish To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.fiApache Lucene
FinnishLightStemFilterFactoryFactory for FinnishLightStemFilter.Classorg.apache.lucene.analysis.fiApache Lucene
FinnishLightStemmerLight Stemmer for Finnish.Classorg.apache.lucene.analysis.fiApache Lucene
FloatEncoderEncode a character array Float as a BytesRef.Classorg.apache.lucene.analysis.payloadsApache Lucene
FrenchAnalyzerAnalyzer for French language.Classorg.apache.lucene.analysis.frApache Lucene
FrenchLightStemFilterA TokenFilter that applies FrenchLightStemmer to stem French To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.frApache Lucene
FrenchLightStemFilterFactoryFactory for FrenchLightStemFilter.Classorg.apache.lucene.analysis.frApache Lucene
FrenchLightStemmerLight Stemmer for French.Classorg.apache.lucene.analysis.frApache Lucene
FrenchMinimalStemFilterA TokenFilter that applies FrenchMinimalStemmer to stem French To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.frApache Lucene
FrenchMinimalStemFilterFactoryFactory for FrenchMinimalStemFilter.Classorg.apache.lucene.analysis.frApache Lucene
FrenchMinimalStemmerLight Stemmer for French.Classorg.apache.lucene.analysis.frApache Lucene
GalicianAnalyzerAnalyzer for Galician.Classorg.apache.lucene.analysis.glApache Lucene
GalicianMinimalStemFilterA TokenFilter that applies GalicianMinimalStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.glApache Lucene
GalicianMinimalStemFilterFactoryFactory for GalicianMinimalStemFilter.Classorg.apache.lucene.analysis.glApache Lucene
GalicianMinimalStemmerMinimal Stemmer for Galician This follows the "RSLP-S" algorithm, but modified for Galician.Classorg.apache.lucene.analysis.glApache Lucene
GalicianStemFilterA TokenFilter that applies GalicianStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.glApache Lucene
GalicianStemFilterFactoryFactory for GalicianStemFilter.Classorg.apache.lucene.analysis.glApache Lucene
GalicianStemmerGalician stemmer implementing "Regras do lematizador para o galego".Classorg.apache.lucene.analysis.glApache Lucene
GermanAnalyzerAnalyzer for German language.Classorg.apache.lucene.analysis.deApache Lucene
GermanLightStemFilterA TokenFilter that applies GermanLightStemmer to stem German To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.deApache Lucene
GermanLightStemFilterFactoryFactory for GermanLightStemFilter.Classorg.apache.lucene.analysis.deApache Lucene
GermanLightStemmerLight Stemmer for German.Classorg.apache.lucene.analysis.deApache Lucene
GermanMinimalStemFilterA TokenFilter that applies GermanMinimalStemmer to stem German To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.deApache Lucene
GermanMinimalStemFilterFactoryFactory for GermanMinimalStemFilter.Classorg.apache.lucene.analysis.deApache Lucene
GermanMinimalStemmerMinimal Stemmer for German.Classorg.apache.lucene.analysis.deApache Lucene
GermanNormalizationFilterNormalizes German characters according to the heuristics German2 snowball algorithm.Classorg.apache.lucene.analysis.deApache Lucene
GermanNormalizationFilterFactoryFactory for GermanNormalizationFilter.Classorg.apache.lucene.analysis.deApache Lucene
GermanStemFilterA TokenFilter that stems German words.Classorg.apache.lucene.analysis.deApache Lucene
GermanStemFilterFactoryFactory for GermanStemFilter.Classorg.apache.lucene.analysis.deApache Lucene
GermanStemmerA stemmer for German words.Classorg.apache.lucene.analysis.deApache Lucene
GreekAnalyzerAnalyzer for the Greek language.Classorg.apache.lucene.analysis.elApache Lucene
GreekLowerCaseFilterNormalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.Classorg.apache.lucene.analysis.elApache Lucene
GreekLowerCaseFilterFactoryFactory for GreekLowerCaseFilter.Classorg.apache.lucene.analysis.elApache Lucene
GreekStemFilterA TokenFilter that applies GreekStemmer to stem Greek To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.elApache Lucene
GreekStemFilterFactoryFactory for GreekStemFilter.Classorg.apache.lucene.analysis.elApache Lucene
GreekStemmerA stemmer for Greek words, according to: Development of a Stemmer for the Greek Language.Classorg.apache.lucene.analysis.elApache Lucene
HindiAnalyzerClassorg.apache.lucene.analysis.hiApache Lucene
HindiNormalizationFilterA TokenFilter that applies HindiNormalizer to normalize the In some cases the normalization may cause unrelated terms to conflate, soClassorg.apache.lucene.analysis.hiApache Lucene
HindiNormalizationFilterFactoryFactory for HindiNormalizationFilter.Classorg.apache.lucene.analysis.hiApache Lucene
HindiNormalizerNormalizer for Hindi.Classorg.apache.lucene.analysis.hiApache Lucene
HindiStemFilterA TokenFilter that applies HindiStemmer to stem Hindi words.Classorg.apache.lucene.analysis.hiApache Lucene
HindiStemFilterFactoryFactory for HindiStemFilter.Classorg.apache.lucene.analysis.hiApache Lucene
HindiStemmerLight Stemmer for Hindi.Classorg.apache.lucene.analysis.hiApache Lucene
HTMLStripCharFilterA CharFilter that wraps another Reader and attempts to strip out HTML constructs.Classorg.apache.lucene.analysis.charfilterApache Lucene
HTMLStripCharFilterFactoryFactory for HTMLStripCharFilter.Classorg.apache.lucene.analysis.charfilterApache Lucene
HungarianAnalyzerAnalyzer for Hungarian.Classorg.apache.lucene.analysis.huApache Lucene
HungarianLightStemFilterA TokenFilter that applies HungarianLightStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.huApache Lucene
HungarianLightStemFilterFactoryFactory for HungarianLightStemFilter.Classorg.apache.lucene.analysis.huApache Lucene
HungarianLightStemmerLight Stemmer for Hungarian.Classorg.apache.lucene.analysis.huApache Lucene
HunspellStemFilterTokenFilter that uses hunspell affix rules and words to stem tokens.Classorg.apache.lucene.analysis.hunspellApache Lucene
HunspellStemFilterFactoryTokenFilterFactory that creates instances of HunspellStemFilter.Classorg.apache.lucene.analysis.hunspellApache Lucene
HyphenThis class represents a hyphen.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
HyphenatedWordsFilterWhen the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
HyphenatedWordsFilterFactoryFactory for HyphenatedWordsFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
HyphenationThis class represents a hyphenated word.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
HyphenationCompoundWordTokenFilterA TokenFilter that decomposes compound words found in many Germanic languages.Classorg.apache.lucene.analysis.compoundApache Lucene
HyphenationCompoundWordTokenFilterFactoryFactory for HyphenationCompoundWordTokenFilter.Classorg.apache.lucene.analysis.compoundApache Lucene
HyphenationTreeThis tree structure stores the hyphenation patterns in an efficient way for fast lookup.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
IdentityEncoderDoes nothing other than convert the char array to a byte array using the specified encoding.Classorg.apache.lucene.analysis.payloadsApache Lucene
IndicNormalizationFilterA TokenFilter that applies IndicNormalizer to normalize text in Indian Languages.Classorg.apache.lucene.analysis.inApache Lucene
IndicNormalizationFilterFactoryFactory for IndicNormalizationFilter.Classorg.apache.lucene.analysis.inApache Lucene
IndicNormalizerNormalizes the Unicode representation of text in Indian languages.Classorg.apache.lucene.analysis.inApache Lucene
IndonesianAnalyzerClassorg.apache.lucene.analysis.idApache Lucene
IndonesianStemFilterA TokenFilter that applies IndonesianStemmer to stem Indonesian words.Classorg.apache.lucene.analysis.idApache Lucene
IndonesianStemFilterFactoryFactory for IndonesianStemFilter.Classorg.apache.lucene.analysis.idApache Lucene
IndonesianStemmerStemmer for Indonesian.Classorg.apache.lucene.analysis.idApache Lucene
IntegerEncoderEncode a character array Integer as a BytesRef.Classorg.apache.lucene.analysis.payloadsApache Lucene
IrishAnalyzerClassorg.apache.lucene.analysis.gaApache Lucene
IrishLowerCaseFilterNormalises token text to lower case, handling t-prothesis and n-eclipsis (i.Classorg.apache.lucene.analysis.gaApache Lucene
IrishLowerCaseFilterFactoryFactory for IrishLowerCaseFilter.Classorg.apache.lucene.analysis.gaApache Lucene
ItalianAnalyzerAnalyzer for Italian.Classorg.apache.lucene.analysis.itApache Lucene
ItalianLightStemFilterA TokenFilter that applies ItalianLightStemmer to stem Italian To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.itApache Lucene
ItalianLightStemFilterFactoryFactory for ItalianLightStemFilter.Classorg.apache.lucene.analysis.itApache Lucene
ItalianLightStemmerLight Stemmer for Italian.Classorg.apache.lucene.analysis.itApache Lucene
KeepWordFilterA TokenFilter that only keeps tokens with text contained in the required words.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
KeepWordFilterFactoryFactory for KeepWordFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
KeywordAnalyzer"Tokenizes" the entire stream as a single token.Classorg.apache.lucene.analysis.coreApache Lucene
KeywordMarkerFilterMarks terms as keywords via the KeywordAttribute.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
KeywordMarkerFilterFactoryFactory for KeywordMarkerFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
KeywordRepeatFilterClassorg.apache.lucene.analysis.miscellaneousApache Lucene
KeywordRepeatFilterFactoryFactory for KeywordRepeatFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
KeywordTokenizerEmits the entire input as a single token.Classorg.apache.lucene.analysis.coreApache Lucene
KeywordTokenizerFactoryFactory for KeywordTokenizer.Classorg.apache.lucene.analysis.coreApache Lucene
KStemFilterA high-performance kstem filter for english.Classorg.apache.lucene.analysis.enApache Lucene
KStemFilterFactoryFactory for KStemFilter.Classorg.apache.lucene.analysis.enApache Lucene
KStemmerClassorg.apache.lucene.analysis.enApache Lucene
LatvianAnalyzerAnalyzer for Latvian.Classorg.apache.lucene.analysis.lvApache Lucene
LatvianStemFilterA TokenFilter that applies LatvianStemmer to stem Latvian To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.lvApache Lucene
LatvianStemFilterFactoryFactory for LatvianStemFilter.Classorg.apache.lucene.analysis.lvApache Lucene
LatvianStemmerLight stemmer for Latvian.Classorg.apache.lucene.analysis.lvApache Lucene
LengthFilterRemoves words that are too long or too short from the stream.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LengthFilterFactoryFactory for LengthFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LetterTokenizerA LetterTokenizer is a tokenizer that divides text at non-letters.Classorg.apache.lucene.analysis.coreApache Lucene
LetterTokenizerFactoryFactory for LetterTokenizer.Classorg.apache.lucene.analysis.coreApache Lucene
LimitTokenCountAnalyzerThis Analyzer limits the number of tokens while indexing.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LimitTokenCountFilterThis TokenFilter limits the number of tokens while indexing.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LimitTokenCountFilterFactoryFactory for LimitTokenCountFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LimitTokenOffsetFilterLets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LimitTokenOffsetFilterFactoryFactory for LimitTokenOffsetFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LimitTokenPositionFilterThis TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LimitTokenPositionFilterFactoryFactory for LimitTokenPositionFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
LithuanianAnalyzerAnalyzer for Lithuanian.Classorg.apache.lucene.analysis.ltApache Lucene
LowerCaseFilterNormalizes token text to lower case.Classorg.apache.lucene.analysis.coreApache Lucene
LowerCaseFilterFactoryFactory for LowerCaseFilter.Classorg.apache.lucene.analysis.coreApache Lucene
LowerCaseTokenizerLowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together.Classorg.apache.lucene.analysis.coreApache Lucene
LowerCaseTokenizerFactoryFactory for LowerCaseTokenizer.Classorg.apache.lucene.analysis.coreApache Lucene
Lucene43CompoundWordTokenFilterBaseBase class for decomposition token filters using pre-4.Classorg.apache.lucene.analysis.compoundApache Lucene
Lucene43DictionaryCompoundWordTokenFilterClassorg.apache.lucene.analysis.compoundApache Lucene
Lucene43EdgeNGramTokenFilterTokenizes the given token into n-grams of given size(s), using pre-4.Classorg.apache.lucene.analysis.ngramApache Lucene
Lucene43EdgeNGramTokenizerOld version of EdgeNGramTokenizer which doesn't handle correctly supplementary characters.Classorg.apache.lucene.analysis.ngramApache Lucene
Lucene43EdgeNGramTokenizer .SideSpecifies which side of the input the n-gram should be generated fromGet the n-gram from the end of the inputClassorg.apache.lucene.analysis.ngram.Lucene43EdgeNGramTokenizerApache Lucene
Lucene43FilteringTokenFilterBackcompat FilteringTokenFilter for versions 4.Classorg.apache.lucene.analysis.utilApache Lucene
Lucene43HyphenationCompoundWordTokenFilterA TokenFilter that decomposes compound words found in many Germanic languages, using pre-4.Classorg.apache.lucene.analysis.compoundApache Lucene
Lucene43KeepWordFilterBackcompat for KeepWordFilter for versions 4.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
Lucene43LengthFilterBackcompat LengthFilter for versions 4.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
Lucene43NGramTokenFilterTokenizes the input into n-grams of the given size(s), matching Lucene 4.Classorg.apache.lucene.analysis.ngramApache Lucene
Lucene43NGramTokenizerOld broken version of NGramTokenizer.Classorg.apache.lucene.analysis.ngramApache Lucene
Lucene43StopFilterBackcompat StopFilter for versions 4.Classorg.apache.lucene.analysis.coreApache Lucene
Lucene43TrimFilterBackcompat TrimFilter for versions 4.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
Lucene43TypeTokenFilterBackcompat TypeTokenFilter for versions 4.Classorg.apache.lucene.analysis.coreApache Lucene
Lucene47WordDelimiterFilterClassorg.apache.lucene.analysis.miscellaneousApache Lucene
MappingCharFilterSimplistic CharFilter that applies the mappings contained in a NormalizeCharMap to the characterClassorg.apache.lucene.analysis.charfilterApache Lucene
MappingCharFilterFactoryFactory for MappingCharFilter.Classorg.apache.lucene.analysis.charfilterApache Lucene
MultiTermAwareComponentAdd to any analysis factory component to allow returning an analysis component factory for use with partial terms in prefix queries,Interfaceorg.apache.lucene.analysis.utilApache Lucene
NGramFilterFactoryFactory for NGramTokenFilter.Classorg.apache.lucene.analysis.ngramApache Lucene
NGramTokenFilterTokenizes the input into n-grams of the given size(s).Classorg.apache.lucene.analysis.ngramApache Lucene
NGramTokenizerTokenizes the input into n-grams of the given size(s).Classorg.apache.lucene.analysis.ngramApache Lucene
NGramTokenizerFactoryFactory for NGramTokenizer.Classorg.apache.lucene.analysis.ngramApache Lucene
NormalizeCharMapHolds a map of String input to String output, to be used with MappingCharFilter.Classorg.apache.lucene.analysis.charfilterApache Lucene
NormalizeCharMap .BuilderBuilds an NormalizeCharMap.Classorg.apache.lucene.analysis.charfilter.NormalizeCharMapApache Lucene
NorwegianAnalyzerAnalyzer for Norwegian.Classorg.apache.lucene.analysis.noApache Lucene
NorwegianLightStemFilterA TokenFilter that applies NorwegianLightStemmer to stem Norwegian To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.noApache Lucene
NorwegianLightStemFilterFactoryFactory for NorwegianLightStemFilter.Classorg.apache.lucene.analysis.noApache Lucene
NorwegianLightStemmerLight Stemmer for Norwegian.Classorg.apache.lucene.analysis.noApache Lucene
NorwegianMinimalStemFilterA TokenFilter that applies NorwegianMinimalStemmer to stem Norwegian To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.noApache Lucene
NorwegianMinimalStemFilterFactoryFactory for NorwegianMinimalStemFilter.Classorg.apache.lucene.analysis.noApache Lucene
NorwegianMinimalStemmerClassorg.apache.lucene.analysis.noApache Lucene
NumericPayloadTokenFilterAssigns a payload to a token based on the PackedTokenAttributeImpl.Classorg.apache.lucene.analysis.payloadsApache Lucene
NumericPayloadTokenFilterFactoryFactory for NumericPayloadTokenFilter.Classorg.apache.lucene.analysis.payloadsApache Lucene
OpenStringBuilderA StringBuilder that allows one to access the array.Classorg.apache.lucene.analysis.utilApache Lucene
PathHierarchyTokenizerTokenizer for path-like hierarchies.Classorg.apache.lucene.analysis.pathApache Lucene
PathHierarchyTokenizerFactoryFactory for PathHierarchyTokenizer.Classorg.apache.lucene.analysis.pathApache Lucene
PatternCaptureGroupFilterFactoryFactory for PatternCaptureGroupTokenFilter.Classorg.apache.lucene.analysis.patternApache Lucene
PatternCaptureGroupTokenFilterCaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.Classorg.apache.lucene.analysis.patternApache Lucene
PatternConsumerThis interface is used to connect the XML pattern file parser to the This class has been taken from the Apache FOP project (http://xmlgraphics.Interfaceorg.apache.lucene.analysis.compound.hyphenationApache Lucene
PatternKeywordMarkerFilterMarks terms as keywords via the KeywordAttribute.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
PatternParserA SAX document handler to read and parse hyphenation patterns from a XML This class has been taken from the Apache FOP project (http://xmlgraphics.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
PatternReplaceCharFilterCharFilter that uses a regular expression for the target of replace string.Classorg.apache.lucene.analysis.patternApache Lucene
PatternReplaceCharFilterFactoryFactory for PatternReplaceCharFilter.Classorg.apache.lucene.analysis.patternApache Lucene
PatternReplaceFilterA TokenFilter which applies a Pattern to each token in the stream, replacing match occurances with the specified replacement string.Classorg.apache.lucene.analysis.patternApache Lucene
PatternReplaceFilterFactoryFactory for PatternReplaceFilter.Classorg.apache.lucene.analysis.patternApache Lucene
PatternTokenizerThis tokenizer uses regex pattern matching to construct distinct tokens for the input stream.Classorg.apache.lucene.analysis.patternApache Lucene
PatternTokenizerFactoryFactory for PatternTokenizer.Classorg.apache.lucene.analysis.patternApache Lucene
PayloadEncoderMainly for use with the DelimitedPayloadTokenFilter, converts char buffers to NOTE: This interface is subject to changeInterfaceorg.apache.lucene.analysis.payloadsApache Lucene
PayloadHelperUtility methods for encoding payloads.Classorg.apache.lucene.analysis.payloadsApache Lucene
PerFieldAnalyzerWrapperThis analyzer is used to facilitate scenarios where different fields require different analysis techniques.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
PersianAnalyzerAnalyzer for Persian.Classorg.apache.lucene.analysis.faApache Lucene
PersianCharFilterClassorg.apache.lucene.analysis.faApache Lucene
PersianCharFilterFactoryFactory for PersianCharFilter.Classorg.apache.lucene.analysis.faApache Lucene
PersianNormalizationFilterClassorg.apache.lucene.analysis.faApache Lucene
PersianNormalizationFilterFactoryFactory for PersianNormalizationFilter.Classorg.apache.lucene.analysis.faApache Lucene
PersianNormalizerNormalizer for Persian.Classorg.apache.lucene.analysis.faApache Lucene
PorterStemFilterTransforms the token stream as per the Porter stemming algorithm.Classorg.apache.lucene.analysis.enApache Lucene
PorterStemFilterFactoryFactory for PorterStemFilter.Classorg.apache.lucene.analysis.enApache Lucene
PortugueseAnalyzerAnalyzer for Portuguese.Classorg.apache.lucene.analysis.ptApache Lucene
PortugueseLightStemFilterA TokenFilter that applies PortugueseLightStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.ptApache Lucene
PortugueseLightStemFilterFactoryFactory for PortugueseLightStemFilter.Classorg.apache.lucene.analysis.ptApache Lucene
PortugueseLightStemmerLight Stemmer for Portuguese This stemmer implements the "UniNE" algorithm in:Classorg.apache.lucene.analysis.ptApache Lucene
PortugueseMinimalStemFilterA TokenFilter that applies PortugueseMinimalStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.ptApache Lucene
PortugueseMinimalStemFilterFactoryFactory for PortugueseMinimalStemFilter.Classorg.apache.lucene.analysis.ptApache Lucene
PortugueseMinimalStemmerMinimal Stemmer for Portuguese This follows the "RSLP-S" algorithm presented in:Classorg.apache.lucene.analysis.ptApache Lucene
PortugueseStemFilterA TokenFilter that applies PortugueseStemmer to stem To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.ptApache Lucene
PortugueseStemFilterFactoryFactory for PortugueseStemFilter.Classorg.apache.lucene.analysis.ptApache Lucene
PortugueseStemmerPortuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm.Classorg.apache.lucene.analysis.ptApache Lucene
PrefixAndSuffixAwareTokenFilterLinks two PrefixAwareTokenFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
PrefixAwareTokenFilterJoins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
QueryAutoStopWordAnalyzerAn Analyzer used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries.Classorg.apache.lucene.analysis.queryApache Lucene
RemoveDuplicatesTokenFilterA TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
RemoveDuplicatesTokenFilterFactoryFactory for RemoveDuplicatesTokenFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
ResourceLoaderAbstraction for loading resources (streams, files, and classes).Interfaceorg.apache.lucene.analysis.utilApache Lucene
ResourceLoaderAwareInterface for a component that needs to be initialized by an implementation of ResourceLoader.Interfaceorg.apache.lucene.analysis.utilApache Lucene
ReversePathHierarchyTokenizerTokenizer for domain-like hierarchies.Classorg.apache.lucene.analysis.pathApache Lucene
ReverseStringFilterReverse token string, for example "country" => "yrtnuoc".Classorg.apache.lucene.analysis.reverseApache Lucene
ReverseStringFilterFactoryFactory for ReverseStringFilter.Classorg.apache.lucene.analysis.reverseApache Lucene
RollingCharBufferActs like a forever growing char[] as you read characters into it from the provided reader, butClassorg.apache.lucene.analysis.utilApache Lucene
RomanianAnalyzerAnalyzer for Romanian.Classorg.apache.lucene.analysis.roApache Lucene
RSLPStemmerBaseBase class for stemmers that use a set of RSLP-like stemming steps.Classorg.apache.lucene.analysis.ptApache Lucene
RSLPStemmerBase .RuleA basic rule, with no exceptions.Classorg.apache.lucene.analysis.pt.RSLPStemmerBaseApache Lucene
RSLPStemmerBase .RuleWithSetExceptionsA rule with a set of whole-word exceptions.Classorg.apache.lucene.analysis.pt.RSLPStemmerBaseApache Lucene
RSLPStemmerBase .RuleWithSuffixExceptionsA rule with a set of exceptional suffixes.Classorg.apache.lucene.analysis.pt.RSLPStemmerBaseApache Lucene
RSLPStemmerBase .StepA step containing a list of rules.Classorg.apache.lucene.analysis.pt.RSLPStemmerBaseApache Lucene
RussianAnalyzerAnalyzer for Russian language.Classorg.apache.lucene.analysis.ruApache Lucene
RussianLightStemFilterA TokenFilter that applies RussianLightStemmer to stem Russian To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.ruApache Lucene
RussianLightStemFilterFactoryFactory for RussianLightStemFilter.Classorg.apache.lucene.analysis.ruApache Lucene
RussianLightStemmerLight Stemmer for Russian.Classorg.apache.lucene.analysis.ruApache Lucene
ScandinavianFoldingFilterClassorg.apache.lucene.analysis.miscellaneousApache Lucene
ScandinavianFoldingFilterFactoryFactory for ScandinavianFoldingFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
ScandinavianNormalizationFilterClassorg.apache.lucene.analysis.miscellaneousApache Lucene
ScandinavianNormalizationFilterFactoryFactory for ScandinavianNormalizationFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
SegmentingTokenizerBaseBreaks text into sentences with a BreakIterator and allows subclasses to decompose these sentences into words.Classorg.apache.lucene.analysis.utilApache Lucene
SerbianNormalizationFilterNormalizes Serbian Cyrillic and Latin characters to "bald" Latin.Classorg.apache.lucene.analysis.srApache Lucene
SerbianNormalizationFilterFactoryFactory for SerbianNormalizationFilter.Classorg.apache.lucene.analysis.srApache Lucene
SerbianNormalizationRegularFilterNormalizes Serbian Cyrillic to Latin.Classorg.apache.lucene.analysis.srApache Lucene
SetKeywordMarkerFilterMarks terms as keywords via the KeywordAttribute.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
ShingleAnalyzerWrapperA ShingleAnalyzerWrapper wraps a ShingleFilter around another Analyzer.Classorg.apache.lucene.analysis.shingleApache Lucene
ShingleFilterA ShingleFilter constructs shingles (token n-grams) from a token stream.Classorg.apache.lucene.analysis.shingleApache Lucene
ShingleFilterFactoryFactory for ShingleFilter.Classorg.apache.lucene.analysis.shingleApache Lucene
SimpleAnalyzerAn Analyzer that filters LetterTokenizer with LowerCaseFilterClassorg.apache.lucene.analysis.coreApache Lucene
SingleTokenTokenStreamA TokenStream containing a single token.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
SnowballFilterA filter that stems words using a Snowball-generated stemmer.Classorg.apache.lucene.analysis.snowballApache Lucene
SnowballPorterFilterFactoryFactory for SnowballFilter, with configurable language Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection.Classorg.apache.lucene.analysis.snowballApache Lucene
SolrSynonymParserParser for the Solr synonyms format.Classorg.apache.lucene.analysis.synonymApache Lucene
SoraniAnalyzerAnalyzer for Sorani Kurdish.Classorg.apache.lucene.analysis.ckbApache Lucene
SoraniNormalizationFilterClassorg.apache.lucene.analysis.ckbApache Lucene
SoraniNormalizationFilterFactoryFactory for SoraniNormalizationFilter.Classorg.apache.lucene.analysis.ckbApache Lucene
SoraniNormalizerNormalizes the Unicode representation of Sorani text.Classorg.apache.lucene.analysis.ckbApache Lucene
SoraniStemFilterA TokenFilter that applies SoraniStemmer to stem Sorani words.Classorg.apache.lucene.analysis.ckbApache Lucene
SoraniStemFilterFactoryFactory for SoraniStemFilter.Classorg.apache.lucene.analysis.ckbApache Lucene
SoraniStemmerClassorg.apache.lucene.analysis.ckbApache Lucene
SpanishAnalyzerAnalyzer for Spanish.Classorg.apache.lucene.analysis.esApache Lucene
SpanishLightStemFilterA TokenFilter that applies SpanishLightStemmer to stem Spanish To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.esApache Lucene
SpanishLightStemFilterFactoryFactory for SpanishLightStemFilter.Classorg.apache.lucene.analysis.esApache Lucene
SpanishLightStemmerLight Stemmer for Spanish This stemmer implements the algorithm described in:Classorg.apache.lucene.analysis.esApache Lucene
StandardAnalyzerFilters StandardTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words.Classorg.apache.lucene.analysis.standardApache Lucene
StandardFilterNormalizes tokens extracted with StandardTokenizer.Classorg.apache.lucene.analysis.standardApache Lucene
StandardFilterFactoryFactory for StandardFilter.Classorg.apache.lucene.analysis.standardApache Lucene
StandardTokenizerA grammar-based tokenizer constructed with JFlex.Classorg.apache.lucene.analysis.standardApache Lucene
StandardTokenizer40Backcompat standard tokenizer for Lucene 4.Classorg.apache.lucene.analysis.standard.std40Apache Lucene
StandardTokenizerFactoryFactory for StandardTokenizer.Classorg.apache.lucene.analysis.standardApache Lucene
StandardTokenizerImplThis class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Classorg.apache.lucene.analysis.standardApache Lucene
StandardTokenizerImpl40Classorg.apache.lucene.analysis.standard.std40Apache Lucene
StemmerOverrideFilterProvides the ability to override any KeywordAttribute aware stemmer with custom dictionary-based stemming.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
StemmerOverrideFilter .BuilderClassorg.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterApache Lucene
StemmerOverrideFilter .StemmerOverrideMapA read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for StemmerOverrideFilterClassorg.apache.lucene.analysis.miscellaneous.StemmerOverrideFilterApache Lucene
StemmerOverrideFilterFactoryFactory for StemmerOverrideFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
StemmerUtilSome commonly-used stemming functionsNOTE: This API is for internal purposes only and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.utilApache Lucene
StopAnalyzerFilters LetterTokenizer with LowerCaseFilter and StopFilter.Classorg.apache.lucene.analysis.coreApache Lucene
StopFilterRemoves stop words from a token stream.Classorg.apache.lucene.analysis.coreApache Lucene
StopFilterFactoryFactory for StopFilter.Classorg.apache.lucene.analysis.coreApache Lucene
StopwordAnalyzerBaseBase class for Analyzers that need to make use of stopword sets.Classorg.apache.lucene.analysis.utilApache Lucene
SwedishAnalyzerAnalyzer for Swedish.Classorg.apache.lucene.analysis.svApache Lucene
SwedishLightStemFilterA TokenFilter that applies SwedishLightStemmer to stem Swedish To prevent terms from being stemmed use an instance ofClassorg.apache.lucene.analysis.svApache Lucene
SwedishLightStemFilterFactoryFactory for SwedishLightStemFilter.Classorg.apache.lucene.analysis.svApache Lucene
SwedishLightStemmerLight Stemmer for Swedish.Classorg.apache.lucene.analysis.svApache Lucene
SynonymFilterMatches single or multi word synonyms in a token stream.Classorg.apache.lucene.analysis.synonymApache Lucene
SynonymFilterFactoryFactory for SynonymFilter.Classorg.apache.lucene.analysis.synonymApache Lucene
SynonymMapA map of synonyms, keys and values are phrases.Classorg.apache.lucene.analysis.synonymApache Lucene
SynonymMap .BuilderBuilds an FSTSynonymMap.Classorg.apache.lucene.analysis.synonym.SynonymMapApache Lucene
SynonymMap .ParserAbstraction for parsing synonym files.Classorg.apache.lucene.analysis.synonym.SynonymMapApache Lucene
TeeSinkTokenFilterThis TokenFilter provides the ability to set aside attribute states that have already been analyzed.Classorg.apache.lucene.analysis.sinksApache Lucene
TeeSinkTokenFilter .SinkTokenStreamTokenStream output from a tee.Classorg.apache.lucene.analysis.sinks.TeeSinkTokenFilterApache Lucene
TernaryTreeTernary Search Tree.Classorg.apache.lucene.analysis.compound.hyphenationApache Lucene
ThaiAnalyzerAnalyzer for Thai language.Classorg.apache.lucene.analysis.thApache Lucene
ThaiTokenizerTokenizer that use BreakIterator to tokenize Thai text.Classorg.apache.lucene.analysis.thApache Lucene
ThaiTokenizerFactoryFactory for ThaiTokenizer.Classorg.apache.lucene.analysis.thApache Lucene
ThaiWordFilterTokenFilter that use BreakIterator to break each Token that is Thai into separate Token(s) for each Thai word.Classorg.apache.lucene.analysis.thApache Lucene
ThaiWordFilterFactoryFactory for ThaiWordFilter.Classorg.apache.lucene.analysis.thApache Lucene
TokenFilterFactoryClassorg.apache.lucene.analysis.utilApache Lucene
TokenizerFactoryClassorg.apache.lucene.analysis.utilApache Lucene
TokenOffsetPayloadTokenFilterAdds the OffsetAttribute.Classorg.apache.lucene.analysis.payloadsApache Lucene
TokenOffsetPayloadTokenFilterFactoryFactory for TokenOffsetPayloadTokenFilter.Classorg.apache.lucene.analysis.payloadsApache Lucene
TrimFilterTrims leading and trailing whitespace from Tokens in the stream.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
TrimFilterFactoryFactory for TrimFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
TruncateTokenFilterA token filter for truncating the terms into a specific length.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
TruncateTokenFilterFactoryFactory for TruncateTokenFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
TurkishAnalyzerAnalyzer for Turkish.Classorg.apache.lucene.analysis.trApache Lucene
TurkishLowerCaseFilterNormalizes Turkish token text to lower case.Classorg.apache.lucene.analysis.trApache Lucene
TurkishLowerCaseFilterFactoryFactory for TurkishLowerCaseFilter.Classorg.apache.lucene.analysis.trApache Lucene
TypeAsPayloadTokenFilterMakes the PackedTokenAttributeImpl.Classorg.apache.lucene.analysis.payloadsApache Lucene
TypeAsPayloadTokenFilterFactoryFactory for TypeAsPayloadTokenFilter.Classorg.apache.lucene.analysis.payloadsApache Lucene
TypeTokenFilterRemoves tokens whose types appear in a set of blocked types from a token stream.Classorg.apache.lucene.analysis.coreApache Lucene
TypeTokenFilterFactoryFactory class for TypeTokenFilter.Classorg.apache.lucene.analysis.coreApache Lucene
UAX29URLEmailAnalyzerFilters UAX29URLEmailTokenizer with StandardFilter,Classorg.apache.lucene.analysis.standardApache Lucene
UAX29URLEmailTokenizerThis class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Classorg.apache.lucene.analysis.standardApache Lucene
UAX29URLEmailTokenizer40Backcompat uax29 tokenizer for Lucene 4.Classorg.apache.lucene.analysis.standard.std40Apache Lucene
UAX29URLEmailTokenizerFactoryFactory for UAX29URLEmailTokenizer.Classorg.apache.lucene.analysis.standardApache Lucene
UAX29URLEmailTokenizerImplThis class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in Classorg.apache.lucene.analysis.standardApache Lucene
UAX29URLEmailTokenizerImpl40Classorg.apache.lucene.analysis.standard.std40Apache Lucene
UnicodePropsThis file contains unicode properties used by various CharTokenizers.Classorg.apache.lucene.analysis.utilApache Lucene
UnicodeWhitespaceAnalyzerAn Analyzer that uses UnicodeWhitespaceTokenizer.Classorg.apache.lucene.analysis.coreApache Lucene
UnicodeWhitespaceTokenizerA UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.Classorg.apache.lucene.analysis.coreApache Lucene
UpperCaseFilterNormalizes token text to UPPER CASE.Classorg.apache.lucene.analysis.coreApache Lucene
UpperCaseFilterFactoryFactory for UpperCaseFilter.Classorg.apache.lucene.analysis.coreApache Lucene
WhitespaceAnalyzerAn Analyzer that uses WhitespaceTokenizer.Classorg.apache.lucene.analysis.coreApache Lucene
WhitespaceTokenizerA tokenizer that divides text at whitespace characters as defined by Character.Classorg.apache.lucene.analysis.coreApache Lucene
WhitespaceTokenizerFactoryFactory for WhitespaceTokenizer.Classorg.apache.lucene.analysis.coreApache Lucene
WikipediaTokenizerExtension of StandardTokenizer that is aware of Wikipedia syntax.Classorg.apache.lucene.analysis.wikipediaApache Lucene
WikipediaTokenizerFactoryFactory for WikipediaTokenizer.Classorg.apache.lucene.analysis.wikipediaApache Lucene
WordDelimiterFilterSplits words into subwords and performs optional transformations on subword groups.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
WordDelimiterFilterFactoryFactory for WordDelimiterFilter.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
WordDelimiterIteratorA BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.Classorg.apache.lucene.analysis.miscellaneousApache Lucene
WordlistLoaderLoader for text files that represent a list of stopwords.Classorg.apache.lucene.analysis.utilApache Lucene
WordnetSynonymParserParser for wordnet prolog format See http://wordnet.Classorg.apache.lucene.analysis.synonymApache Lucene
DefaultICUTokenizerConfigDefault ICUTokenizerConfig that is generally applicable Generally tokenizes Unicode text according to UAX#29 Classorg.apache.lucene.analysis.icu.segmentationApache Lucene
ICUFoldingFilterA TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.Classorg.apache.lucene.analysis.icuApache Lucene
ICUFoldingFilterFactoryFactory for ICUFoldingFilter.Classorg.apache.lucene.analysis.icuApache Lucene
ICUNormalizer2CharFilterNormalize token text with ICU's Normalizer2.Classorg.apache.lucene.analysis.icuApache Lucene
ICUNormalizer2CharFilterFactoryFactory for ICUNormalizer2CharFilter Supports the following attributes:Classorg.apache.lucene.analysis.icuApache Lucene
ICUNormalizer2FilterNormalize token text with ICU's Normalizer2 With this filter, you can normalize text in the following ways:Classorg.apache.lucene.analysis.icuApache Lucene
ICUNormalizer2FilterFactoryFactory for ICUNormalizer2Filter Supports the following attributes:Classorg.apache.lucene.analysis.icuApache Lucene
ICUTokenizerBreaks text into words according to UAX #29: Unicode Text Segmentation (http://www.Classorg.apache.lucene.analysis.icu.segmentationApache Lucene
ICUTokenizerConfigClass that allows for tailored Unicode Text Segmentation on a per-writing system basis.Classorg.apache.lucene.analysis.icu.segmentationApache Lucene
ICUTokenizerFactoryFactory for ICUTokenizer.Classorg.apache.lucene.analysis.icu.segmentationApache Lucene
ICUTransformFilterA TokenFilter that transforms text with ICU.Classorg.apache.lucene.analysis.icuApache Lucene
ICUTransformFilterFactoryFactory for ICUTransformFilter.Classorg.apache.lucene.analysis.icuApache Lucene
ScriptAttributeThis attribute stores the UTR #24 script value for a token of text.Interfaceorg.apache.lucene.analysis.icu.tokenattributesApache Lucene
ScriptAttributeImplWARNING: This API is experimental and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.icu.tokenattributesApache Lucene
BaseFormAttributeAttribute for Token.Interfaceorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
BaseFormAttributeImplAttribute for Token.Classorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
BinaryDictionaryBase class for a binary-encoded in-memory dictionary.Classorg.apache.lucene.analysis.ja.dictApache Lucene
CharacterDefinitionCharacter category data.Classorg.apache.lucene.analysis.ja.dictApache Lucene
ConnectionCostsClassorg.apache.lucene.analysis.ja.dictApache Lucene
CSVUtilClassorg.apache.lucene.analysis.ja.utilApache Lucene
DictionaryInterfaceorg.apache.lucene.analysis.ja.dictApache Lucene
GraphvizFormatterOutputs the dot (graphviz) string for the viterbi lattice.Classorg.apache.lucene.analysis.jaApache Lucene
InflectionAttributeAttribute for Kuromoji inflection data.Interfaceorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
InflectionAttributeImplAttribute for Kuromoji inflection data.Classorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
JapaneseAnalyzerAnalyzer for Japanese that uses morphological analysis.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseBaseFormFilterReplaces term text with the BaseFormAttribute.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseBaseFormFilterFactoryFactory for JapaneseBaseFormFilter.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseIterationMarkCharFilterNormalizes Japanese horizontal iteration marks (odoriji) to their expanded form.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseIterationMarkCharFilterFactoryFactory for JapaneseIterationMarkCharFilter.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseKatakanaStemFilterA TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseKatakanaStemFilterFactoryFactory for JapaneseKatakanaStemFilter.Classorg.apache.lucene.analysis.jaApache Lucene
JapanesePartOfSpeechStopFilterRemoves tokens that match a set of part-of-speech tags.Classorg.apache.lucene.analysis.jaApache Lucene
JapanesePartOfSpeechStopFilterFactoryFactory for JapanesePartOfSpeechStopFilter.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseReadingFormFilterA TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseReadingFormFilterFactoryFactory for JapaneseReadingFormFilter.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseTokenizerTokenizer for Japanese that uses morphological analysis.Classorg.apache.lucene.analysis.jaApache Lucene
JapaneseTokenizer .ModeTokenization mode: this determines how the tokenizer handles compound and unknown words.Classorg.apache.lucene.analysis.ja.JapaneseTokenizerApache Lucene
JapaneseTokenizer .TypeToken type reflecting the original source of this tokenKnown words from the system dictionary.Classorg.apache.lucene.analysis.ja.JapaneseTokenizerApache Lucene
JapaneseTokenizerFactoryFactory for JapaneseTokenizer.Classorg.apache.lucene.analysis.jaApache Lucene
Lucene43JapanesePartOfSpeechStopFilterBackcompat JapanesePartOfSpeechStopFilter for versions 4.Classorg.apache.lucene.analysis.jaApache Lucene
PartOfSpeechAttributeAttribute for Token.Interfaceorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
PartOfSpeechAttributeImplAttribute for Token.Classorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
ReadingAttributeAttribute for Kuromoji reading data Note: in some cases this value may not be applicable,Interfaceorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
ReadingAttributeImplClassorg.apache.lucene.analysis.ja.tokenattributesApache Lucene
TokenAnalyzed token with morphological data from its dictionary.Classorg.apache.lucene.analysis.jaApache Lucene
TokenInfoDictionaryBinary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.Classorg.apache.lucene.analysis.ja.dictApache Lucene
TokenInfoFSTThin wrapper around an FST with root-arc caching for Japanese.Classorg.apache.lucene.analysis.ja.dictApache Lucene
ToStringUtilUtility class for english translations of morphological data, used only for debugging.Classorg.apache.lucene.analysis.ja.utilApache Lucene
UnknownDictionaryDictionary for unknown-word handling.Classorg.apache.lucene.analysis.ja.dictApache Lucene
UserDictionaryClass for building a User Dictionary.Classorg.apache.lucene.analysis.ja.dictApache Lucene
MorfologikAnalyzerAnalyzer using Morfologik library.Classorg.apache.lucene.analysis.morfologikApache Lucene
MorfologikFilterTokenFilter using Morfologik library to transform input tokens into lemma and morphosyntactic (POS) tokens.Classorg.apache.lucene.analysis.morfologikApache Lucene
MorfologikFilterFactoryFilter factory for MorfologikFilter.Classorg.apache.lucene.analysis.morfologikApache Lucene
MorphosyntacticTagsAttributeMorfologik provides morphosyntactic annotations for surface forms.Interfaceorg.apache.lucene.analysis.morfologikApache Lucene
MorphosyntacticTagsAttributeImplMorphosyntactic annotations for surface forms.Classorg.apache.lucene.analysis.morfologikApache Lucene
BeiderMorseFilterTokenFilter for Beider-Morse phonetic encoding.Classorg.apache.lucene.analysis.phoneticApache Lucene
BeiderMorseFilterFactoryFactory for BeiderMorseFilter.Classorg.apache.lucene.analysis.phoneticApache Lucene
DaitchMokotoffSoundexFilterClassorg.apache.lucene.analysis.phoneticApache Lucene
DaitchMokotoffSoundexFilterFactoryFactory for DaitchMokotoffSoundexFilter.Classorg.apache.lucene.analysis.phoneticApache Lucene
DoubleMetaphoneFilterClassorg.apache.lucene.analysis.phoneticApache Lucene
DoubleMetaphoneFilterFactoryFactory for DoubleMetaphoneFilter.Classorg.apache.lucene.analysis.phoneticApache Lucene
PhoneticFilterCreate tokens for phonetic matches.Classorg.apache.lucene.analysis.phoneticApache Lucene
PhoneticFilterFactoryFactory for PhoneticFilter.Classorg.apache.lucene.analysis.phoneticApache Lucene
AnalyzerProfileManages analysis data configuration for SmartChineseAnalyzer SmartChineseAnalyzer has a built-in dictionary and stopword list out-of-box.Classorg.apache.lucene.analysis.cn.smartApache Lucene
CharTypeInternal SmartChineseAnalyzer character type constants.Classorg.apache.lucene.analysis.cn.smartApache Lucene
HHMMSegmenterFinds the optimal segmentation of a sentence into Chinese wordsWARNING: This API is experimental and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.cn.smart.hhmmApache Lucene
HMMChineseTokenizerTokenizer for Chinese or mixed Chinese-English text.Classorg.apache.lucene.analysis.cn.smartApache Lucene
HMMChineseTokenizerFactoryFactory for HMMChineseTokenizer Note: this class will currently emit tokens for punctuation.Classorg.apache.lucene.analysis.cn.smartApache Lucene
SegTokenSmartChineseAnalyzer internal tokenWARNING: This API is experimental and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.cn.smart.hhmmApache Lucene
SegTokenFilter Filters a SegToken by converting full-width latin to half-width, then lowercasing latin.Classorg.apache.lucene.analysis.cn.smart.hhmmApache Lucene
SentenceTokenizerTokenizes input text into sentences.Classorg.apache.lucene.analysis.cn.smartApache Lucene
SmartChineseAnalyzer SmartChineseAnalyzer is an analyzer for Chinese or mixed Chinese-English text.Classorg.apache.lucene.analysis.cn.smartApache Lucene
SmartChineseSentenceTokenizerFactoryFactory for the SmartChineseAnalyzer SentenceTokenizerWARNING: This API is experimental and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.cn.smartApache Lucene
SmartChineseWordTokenFilterFactoryFactory for the SmartChineseAnalyzer WordTokenFilter Note: this class will currently emit tokens for punctuation.Classorg.apache.lucene.analysis.cn.smartApache Lucene
UtilitySmartChineseAnalyzer utility constants and methodsWARNING: This API is experimental and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.cn.smartApache Lucene
WordTokenFilterA TokenFilter that breaks sentences into words.Classorg.apache.lucene.analysis.cn.smartApache Lucene
WordTypeInternal SmartChineseAnalyzer token type constantsWARNING: This API is experimental and might change in incompatible ways in the next release.Classorg.apache.lucene.analysis.cn.smartApache Lucene
PolishAnalyzerAnalyzer for Polish.Classorg.apache.lucene.analysis.plApache Lucene
StempelFilterTransforms the token stream as per the stemming algorithm.Classorg.apache.lucene.analysis.stempelApache Lucene
StempelPolishStemFilterFactoryFactory for StempelFilter using a Polish stemming table.Classorg.apache.lucene.analysis.stempelApache Lucene
StempelStemmer Stemmer class is a convenient facade for other stemmer-related classes.Classorg.apache.lucene.analysis.stempelApache Lucene
AEProviderInterfaceorg.apache.lucene.analysis.uima.aeApache Lucene
AEProviderFactoryClassorg.apache.lucene.analysis.uima.aeApache Lucene
BaseUIMATokenizerAbstract base implementation of a Tokenizer which is able to analyze the given input with a UIMA AnalysisEngineClassorg.apache.lucene.analysis.uimaApache Lucene
BasicAEProviderBasic AEProvider which just instantiates a UIMA AnalysisEngine with no additional metadata, parameters or resourcesClassorg.apache.lucene.analysis.uima.aeApache Lucene
OverridingParamsAEProviderAEProvider implementation that creates an Aggregate AE from the given path, also injecting runtime parameters defined in the solrconfig.Classorg.apache.lucene.analysis.uima.aeApache Lucene
UIMAAnnotationsTokenizerClassorg.apache.lucene.analysis.uimaApache Lucene
UIMAAnnotationsTokenizerFactoryClassorg.apache.lucene.analysis.uimaApache Lucene
UIMABaseAnalyzerClassorg.apache.lucene.analysis.uimaApache Lucene
UIMATypeAwareAnalyzerClassorg.apache.lucene.analysis.uimaApache Lucene
UIMATypeAwareAnnotationsTokenizerA Tokenizer which creates token from UIMA Annotations filling also their TypeAttribute according to FeaturePaths specifiedClassorg.apache.lucene.analysis.uimaApache Lucene
UIMATypeAwareAnnotationsTokenizerFactoryClassorg.apache.lucene.analysis.uimaApache Lucene
BaseTokenStreamTestCaseBase class for all Lucene unit tests that use TokenStreams.Classorg.apache.lucene.analysisApache Lucene
BaseTokenStreamTestCase .CheckClearAttributesAttributeAttribute that records if it was cleared or not.Interfaceorg.apache.lucene.analysis.BaseTokenStreamTestCaseApache Lucene
BaseTokenStreamTestCase .CheckClearAttributesAttributeImplAttribute that records if it was cleared or not.Classorg.apache.lucene.analysis.BaseTokenStreamTestCaseApache Lucene
CannedBinaryTokenStreamClassorg.apache.lucene.analysisApache Lucene
CannedBinaryTokenStream .BinaryTokenRepresents a binary token.Classorg.apache.lucene.analysis.CannedBinaryTokenStreamApache Lucene
CannedTokenStreamTokenStream from a canned list of Tokens.Classorg.apache.lucene.analysisApache Lucene
CollationTestBaseBase test class for testing Unicode collation.Classorg.apache.lucene.analysisApache Lucene
CrankyTokenFilterThrows IOException from random Tokenstream methods.Classorg.apache.lucene.analysisApache Lucene
LookaheadTokenFilterAn abstract TokenFilter to make it easier to build graph token filters requiring some lookahead.Classorg.apache.lucene.analysisApache Lucene
LookaheadTokenFilter .PositionHolds all state for a single position; subclass this to record other state at each position.Classorg.apache.lucene.analysis.LookaheadTokenFilterApache Lucene
MockAnalyzerAnalyzer for testing This analyzer is a replacement for Whitespace/Simple/KeywordAnalyzersClassorg.apache.lucene.analysisApache Lucene
MockBytesAnalyzerAnalyzer for testing that encodes terms as UTF-16 bytes.Classorg.apache.lucene.analysisApache Lucene
MockCharFilterthe purpose of this charfilter is to send offsets out of bounds if the analyzer doesn't use correctOffset or does incorrect offset math.Classorg.apache.lucene.analysisApache Lucene
MockFixedLengthPayloadFilterTokenFilter that adds random fixed-length payloads.Classorg.apache.lucene.analysisApache Lucene
MockGraphTokenFilterRandomly inserts overlapped (posInc=0) tokens with posLength sometimes > 1.Classorg.apache.lucene.analysisApache Lucene
MockHoleInjectingTokenFilterClassorg.apache.lucene.analysisApache Lucene
MockPayloadAnalyzerWraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all othersClassorg.apache.lucene.analysisApache Lucene
MockRandomLookaheadTokenFilterUses LookaheadTokenFilter to randomly peek at future tokens.Classorg.apache.lucene.analysisApache Lucene
MockReaderWrapperWraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars.Classorg.apache.lucene.analysisApache Lucene
MockTokenFilterA tokenfilter for testing that removes terms accepted by a DFA.Classorg.apache.lucene.analysisApache Lucene
MockTokenizerTokenizer for testing.Classorg.apache.lucene.analysisApache Lucene
MockUTF16TermAttributeImplExtension of CharTermAttributeImpl that encodes the term text as UTF-16 bytes instead of as UTF-8 bytes.Classorg.apache.lucene.analysisApache Lucene
MockVariableLengthPayloadFilterTokenFilter that adds random variable-length payloads.Classorg.apache.lucene.analysisApache Lucene
SimplePayloadFilterClassorg.apache.lucene.analysisApache Lucene
TokenStreamToDotConsumes a TokenStream and outputs the dot (graphviz) string (graph).Classorg.apache.lucene.analysisApache Lucene
ValidatingTokenFilterA TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another).Classorg.apache.lucene.analysisApache Lucene
VocabularyAssertClassorg.apache.lucene.analysisApache Lucene