#Org.apache.lucene.analysis Classes and Interfaces

_{#Org.apache.lucene.analysis Classes and Interfaces - 537 results found.}

Name	Description	Type	Package	Framework
Analyzer	An Analyzer builds TokenStreams, which analyze text.	Class	org.apache.lucene.analysis	Apache Lucene
Analyzer .ReuseStrategy	Strategy defining how TokenStreamComponents are reused per call to Analyzer.	Class	org.apache.lucene.analysis.Analyzer	Apache Lucene
Analyzer .TokenStreamComponents	This class encapsulates the outer components of a token stream.	Class	org.apache.lucene.analysis.Analyzer	Apache Lucene
AnalyzerWrapper	Extension to Analyzer suitable for Analyzers which wrap getWrappedAnalyzer(String) allows the Analyzer	Class	org.apache.lucene.analysis	Apache Lucene
BytesTermAttribute	This attribute can be used if you have the raw term bytes to be indexed.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
BytesTermAttributeImpl	NOTE: This API is for internal purposes only and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
CachingTokenFilter	This class can be used if the token attributes of a TokenStream are intended to be consumed more than once.	Class	org.apache.lucene.analysis	Apache Lucene
CharFilter	Subclasses of CharFilter can be chained to filter a Reader They can be used as Reader with additional offset	Class	org.apache.lucene.analysis	Apache Lucene
CharTermAttribute	The term text of a Token.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
CharTermAttributeImpl	Default implementation of CharTermAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
DelegatingAnalyzerWrapper	An analyzer wrapper, that doesn't allow to wrap components or readers.	Class	org.apache.lucene.analysis	Apache Lucene
FlagsAttribute	This attribute can be used to pass different flags down the Tokenizer chain, e.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
FlagsAttributeImpl	Default implementation of FlagsAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
KeywordAttribute	This attribute can be used to mark a token as a keyword.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
KeywordAttributeImpl	Default implementation of KeywordAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
NumericTokenStream	Expert: This class provides a TokenStream for indexing numeric values that can be used by NumericRangeQuery.	Class	org.apache.lucene.analysis	Apache Lucene
NumericTokenStream .NumericTermAttribute	Expert: Use this attribute to get the details of the currently generated token.	Interface	org.apache.lucene.analysis.NumericTokenStream	Apache Lucene
NumericTokenStream .NumericTermAttributeImpl		Class	org.apache.lucene.analysis.NumericTokenStream	Apache Lucene
OffsetAttribute	The start and end character offset of a Token.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
OffsetAttributeImpl	Default implementation of OffsetAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PackedTokenAttributeImpl	Default implementation of the common attributes used by Lucene:CharTermAttribute PositionIncrementAttribute	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PayloadAttribute	The payload of a Token.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PayloadAttributeImpl	Default implementation of PayloadAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PositionIncrementAttribute	Determines the position of this token relative to the previous Token in a TokenStream, used in phrase	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PositionIncrementAttributeImpl	Default implementation of PositionIncrementAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PositionLengthAttribute	Determines how many positions this token spans.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
PositionLengthAttributeImpl	Default implementation of PositionLengthAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
TermToBytesRefAttribute	This attribute is requested by TermsHashPerField to index the contents.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
Token	A Token is an occurrence of a term from the text of a field.	Class	org.apache.lucene.analysis	Apache Lucene
TokenFilter	A TokenFilter is a TokenStream whose input is another TokenStream.	Class	org.apache.lucene.analysis	Apache Lucene
Tokenizer	A Tokenizer is a TokenStream whose input is a Reader.	Class	org.apache.lucene.analysis	Apache Lucene
TokenStream	A TokenStream enumerates the sequence of tokens, either from Fields of a Document or from query text.	Class	org.apache.lucene.analysis	Apache Lucene
TokenStreamToAutomaton	Consumes a TokenStream and creates an Automaton where the transition labels are UTF8 bytes (or Unicode	Class	org.apache.lucene.analysis	Apache Lucene
TypeAttribute	A Token's lexical type.	Interface	org.apache.lucene.analysis.tokenattributes	Apache Lucene
TypeAttributeImpl	Default implementation of TypeAttribute.	Class	org.apache.lucene.analysis.tokenattributes	Apache Lucene
AbstractAnalysisFactory	Abstract parent class for analysis factories TokenizerFactory, TokenFilterFactory and CharFilterFactory.	Class	org.apache.lucene.analysis.util	Apache Lucene
AbstractEncoder	Base class for payload encoders.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
AnalysisSPILoader	Helper class for loading named SPIs from classpath (e.	Class	org.apache.lucene.analysis.util	Apache Lucene
ApostropheFilter	Strips all characters after an apostrophe (including the apostrophe itself).	Class	org.apache.lucene.analysis.tr	Apache Lucene
ApostropheFilterFactory	Factory for ApostropheFilter.	Class	org.apache.lucene.analysis.tr	Apache Lucene
ArabicAnalyzer	Analyzer for Arabic.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArabicNormalizationFilter	A TokenFilter that applies ArabicNormalizer to normalize the orthography.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArabicNormalizationFilterFactory	Factory for ArabicNormalizationFilter.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArabicNormalizer	Normalizer for Arabic.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArabicStemFilter	A TokenFilter that applies ArabicStemmer to stem Arabic words.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArabicStemFilterFactory	Factory for ArabicStemFilter.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArabicStemmer	Stemming is done in-place for efficiency, operating on a termbuffer.	Class	org.apache.lucene.analysis.ar	Apache Lucene
ArmenianAnalyzer	Analyzer for Armenian.	Class	org.apache.lucene.analysis.hy	Apache Lucene
ASCIIFoldingFilter	This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the Basic Latin Unicode	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
ASCIIFoldingFilterFactory	Factory for ASCIIFoldingFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
BaseCharFilter	Base utility class for implementing a CharFilter.	Class	org.apache.lucene.analysis.charfilter	Apache Lucene
BasqueAnalyzer	Analyzer for Basque.	Class	org.apache.lucene.analysis.eu	Apache Lucene
BrazilianAnalyzer	Analyzer for Brazilian Portuguese language.	Class	org.apache.lucene.analysis.br	Apache Lucene
BrazilianStemFilter	A TokenFilter that applies BrazilianStemmer.	Class	org.apache.lucene.analysis.br	Apache Lucene
BrazilianStemFilterFactory	Factory for BrazilianStemFilter.	Class	org.apache.lucene.analysis.br	Apache Lucene
BrazilianStemmer	A stemmer for Brazilian Portuguese words.	Class	org.apache.lucene.analysis.br	Apache Lucene
BulgarianAnalyzer	Analyzer for Bulgarian.	Class	org.apache.lucene.analysis.bg	Apache Lucene
BulgarianStemFilter	A TokenFilter that applies BulgarianStemmer to stem Bulgarian To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.bg	Apache Lucene
BulgarianStemFilterFactory	Factory for BulgarianStemFilter.	Class	org.apache.lucene.analysis.bg	Apache Lucene
BulgarianStemmer	Light Stemmer for Bulgarian.	Class	org.apache.lucene.analysis.bg	Apache Lucene
ByteVector	This class implements a simple byte vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
CapitalizationFilter	A filter to apply normal capitalization rules to Tokens.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
CapitalizationFilterFactory	Factory for CapitalizationFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
CatalanAnalyzer	Analyzer for Catalan.	Class	org.apache.lucene.analysis.ca	Apache Lucene
CharacterUtils	CharacterUtils provides a unified interface to Character-related operations to implement backwards compatible character operations based on a	Class	org.apache.lucene.analysis.util	Apache Lucene
CharacterUtils .CharacterBuffer	A simple IO buffer to use with CharacterUtils.	Class	org.apache.lucene.analysis.util.CharacterUtils	Apache Lucene
CharArrayIterator	A CharacterIterator used internally for use with BreakIteratorNOTE: This API is for internal purposes only and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.util	Apache Lucene
CharArrayMap	A simple class that stores key Strings as char[]'s in a hash table.	Class	org.apache.lucene.analysis.util	Apache Lucene
CharArraySet	A simple class that stores Strings as char[]'s in a hash table.	Class	org.apache.lucene.analysis.util	Apache Lucene
CharFilterFactory		Class	org.apache.lucene.analysis.util	Apache Lucene
CharTokenizer	An abstract base class for simple, character-oriented tokenizers.	Class	org.apache.lucene.analysis.util	Apache Lucene
CharVector	This class implements a simple char vector with access to the underlying This class has been taken from the Apache FOP project (http://xmlgraphics.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
CJKAnalyzer	An Analyzer that tokenizes text with StandardTokenizer, normalizes content with CJKWidthFilter, folds case with	Class	org.apache.lucene.analysis.cjk	Apache Lucene
CJKBigramFilter	Forms bigrams of CJK terms that are generated from StandardTokenizer CJK types are set by these tokenizers, but you can also use	Class	org.apache.lucene.analysis.cjk	Apache Lucene
CJKBigramFilterFactory	Factory for CJKBigramFilter.	Class	org.apache.lucene.analysis.cjk	Apache Lucene
CJKWidthFilter	A TokenFilter that normalizes CJK width differences: Folds fullwidth ASCII variants into the equivalent basic latin	Class	org.apache.lucene.analysis.cjk	Apache Lucene
CJKWidthFilterFactory	Factory for CJKWidthFilter.	Class	org.apache.lucene.analysis.cjk	Apache Lucene
ClassicAnalyzer	Filters ClassicTokenizer with ClassicFilter, LowerCaseFilter and StopFilter, using a list of English stop words.	Class	org.apache.lucene.analysis.standard	Apache Lucene
ClassicFilter	Normalizes tokens extracted with ClassicTokenizer.	Class	org.apache.lucene.analysis.standard	Apache Lucene
ClassicFilterFactory	Factory for ClassicFilter.	Class	org.apache.lucene.analysis.standard	Apache Lucene
ClassicTokenizer	A grammar-based tokenizer constructed with JFlex This should be a good tokenizer for most European-language documents:	Class	org.apache.lucene.analysis.standard	Apache Lucene
ClassicTokenizerFactory	Factory for ClassicTokenizer.	Class	org.apache.lucene.analysis.standard	Apache Lucene
ClasspathResourceLoader	Simple ResourceLoader that uses ClassLoader.	Class	org.apache.lucene.analysis.util	Apache Lucene
CodepointCountFilter	Removes words that are too long or too short from the stream.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
CodepointCountFilterFactory	Factory for CodepointCountFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
CommonGramsFilter	Construct bigrams for frequently occurring terms while indexing.	Class	org.apache.lucene.analysis.commongrams	Apache Lucene
CommonGramsFilterFactory	Constructs a CommonGramsFilter.	Class	org.apache.lucene.analysis.commongrams	Apache Lucene
CommonGramsQueryFilter	Wrap a CommonGramsFilter optimizing phrase queries by only returning single words when they are not a member of a bigram.	Class	org.apache.lucene.analysis.commongrams	Apache Lucene
CommonGramsQueryFilterFactory	Construct CommonGramsQueryFilter.	Class	org.apache.lucene.analysis.commongrams	Apache Lucene
CompoundWordTokenFilterBase	Base class for decomposition token filters.	Class	org.apache.lucene.analysis.compound	Apache Lucene
CustomAnalyzer	A general-purpose Analyzer that can be created with a builder-style API.	Class	org.apache.lucene.analysis.custom	Apache Lucene
CustomAnalyzer .Builder	Builder for CustomAnalyzer.	Class	org.apache.lucene.analysis.custom.CustomAnalyzer	Apache Lucene
CzechAnalyzer	Analyzer for Czech language.	Class	org.apache.lucene.analysis.cz	Apache Lucene
CzechStemFilter	A TokenFilter that applies CzechStemmer to stem Czech words.	Class	org.apache.lucene.analysis.cz	Apache Lucene
CzechStemFilterFactory	Factory for CzechStemFilter.	Class	org.apache.lucene.analysis.cz	Apache Lucene
CzechStemmer	Light Stemmer for Czech.	Class	org.apache.lucene.analysis.cz	Apache Lucene
DanishAnalyzer	Analyzer for Danish.	Class	org.apache.lucene.analysis.da	Apache Lucene
DateRecognizerFilter	Filters all tokens that cannot be parsed to a date, using the provided DateFormat.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
DateRecognizerFilterFactory	Factory for DateRecognizerFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
DecimalDigitFilter	Folds all Unicode digits in [:General_Category=Decimal_Number:] to Basic Latin digits (0-9).	Class	org.apache.lucene.analysis.core	Apache Lucene
DecimalDigitFilterFactory	Factory for DecimalDigitFilter.	Class	org.apache.lucene.analysis.core	Apache Lucene
DelimitedPayloadTokenFilter	Characters before the delimiter are the "token", those after are the payload.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
DelimitedPayloadTokenFilterFactory	Factory for DelimitedPayloadTokenFilter.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
Dictionary	In-memory structure for the dictionary (.	Class	org.apache.lucene.analysis.hunspell	Apache Lucene
DictionaryCompoundWordTokenFilter	A TokenFilter that decomposes compound words found in many Germanic languages.	Class	org.apache.lucene.analysis.compound	Apache Lucene
DictionaryCompoundWordTokenFilterFactory	Factory for DictionaryCompoundWordTokenFilter.	Class	org.apache.lucene.analysis.compound	Apache Lucene
DutchAnalyzer	Analyzer for Dutch language.	Class	org.apache.lucene.analysis.nl	Apache Lucene
EdgeNGramFilterFactory	Creates new instances of EdgeNGramTokenFilter.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
EdgeNGramTokenFilter	Tokenizes the given token into n-grams of given size(s).	Class	org.apache.lucene.analysis.ngram	Apache Lucene
EdgeNGramTokenizer	Tokenizes the input from an edge into n-grams of given size(s).	Class	org.apache.lucene.analysis.ngram	Apache Lucene
EdgeNGramTokenizerFactory	Creates new instances of EdgeNGramTokenizer.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
ElisionFilter	Removes elisions from a TokenStream.	Class	org.apache.lucene.analysis.util	Apache Lucene
ElisionFilterFactory	Factory for ElisionFilter.	Class	org.apache.lucene.analysis.util	Apache Lucene
EmptyTokenStream	An always exhausted token stream.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
EnglishAnalyzer	Analyzer for English.	Class	org.apache.lucene.analysis.en	Apache Lucene
EnglishMinimalStemFilter	A TokenFilter that applies EnglishMinimalStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.en	Apache Lucene
EnglishMinimalStemFilterFactory	Factory for EnglishMinimalStemFilter.	Class	org.apache.lucene.analysis.en	Apache Lucene
EnglishMinimalStemmer	Minimal plural stemmer for English.	Class	org.apache.lucene.analysis.en	Apache Lucene
EnglishPossessiveFilter	TokenFilter that removes possessives (trailing 's) from words.	Class	org.apache.lucene.analysis.en	Apache Lucene
EnglishPossessiveFilterFactory	Factory for EnglishPossessiveFilter.	Class	org.apache.lucene.analysis.en	Apache Lucene
FilesystemResourceLoader	Simple ResourceLoader that opens resource files from the local file system, optionally resolving against	Class	org.apache.lucene.analysis.util	Apache Lucene
FilteringTokenFilter	Abstract base class for TokenFilters that may remove tokens.	Class	org.apache.lucene.analysis.util	Apache Lucene
FingerprintFilter	Filter outputs a single token which is a concatenation of the sorted and de-duplicated set of input tokens.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
FingerprintFilterFactory	Factory for FingerprintFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
FinnishAnalyzer	Analyzer for Finnish.	Class	org.apache.lucene.analysis.fi	Apache Lucene
FinnishLightStemFilter	A TokenFilter that applies FinnishLightStemmer to stem Finnish To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.fi	Apache Lucene
FinnishLightStemFilterFactory	Factory for FinnishLightStemFilter.	Class	org.apache.lucene.analysis.fi	Apache Lucene
FinnishLightStemmer	Light Stemmer for Finnish.	Class	org.apache.lucene.analysis.fi	Apache Lucene
FloatEncoder	Encode a character array Float as a BytesRef.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
FrenchAnalyzer	Analyzer for French language.	Class	org.apache.lucene.analysis.fr	Apache Lucene
FrenchLightStemFilter	A TokenFilter that applies FrenchLightStemmer to stem French To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.fr	Apache Lucene
FrenchLightStemFilterFactory	Factory for FrenchLightStemFilter.	Class	org.apache.lucene.analysis.fr	Apache Lucene
FrenchLightStemmer	Light Stemmer for French.	Class	org.apache.lucene.analysis.fr	Apache Lucene
FrenchMinimalStemFilter	A TokenFilter that applies FrenchMinimalStemmer to stem French To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.fr	Apache Lucene
FrenchMinimalStemFilterFactory	Factory for FrenchMinimalStemFilter.	Class	org.apache.lucene.analysis.fr	Apache Lucene
FrenchMinimalStemmer	Light Stemmer for French.	Class	org.apache.lucene.analysis.fr	Apache Lucene
GalicianAnalyzer	Analyzer for Galician.	Class	org.apache.lucene.analysis.gl	Apache Lucene
GalicianMinimalStemFilter	A TokenFilter that applies GalicianMinimalStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.gl	Apache Lucene
GalicianMinimalStemFilterFactory	Factory for GalicianMinimalStemFilter.	Class	org.apache.lucene.analysis.gl	Apache Lucene
GalicianMinimalStemmer	Minimal Stemmer for Galician This follows the "RSLP-S" algorithm, but modified for Galician.	Class	org.apache.lucene.analysis.gl	Apache Lucene
GalicianStemFilter	A TokenFilter that applies GalicianStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.gl	Apache Lucene
GalicianStemFilterFactory	Factory for GalicianStemFilter.	Class	org.apache.lucene.analysis.gl	Apache Lucene
GalicianStemmer	Galician stemmer implementing "Regras do lematizador para o galego".	Class	org.apache.lucene.analysis.gl	Apache Lucene
GermanAnalyzer	Analyzer for German language.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanLightStemFilter	A TokenFilter that applies GermanLightStemmer to stem German To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanLightStemFilterFactory	Factory for GermanLightStemFilter.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanLightStemmer	Light Stemmer for German.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanMinimalStemFilter	A TokenFilter that applies GermanMinimalStemmer to stem German To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanMinimalStemFilterFactory	Factory for GermanMinimalStemFilter.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanMinimalStemmer	Minimal Stemmer for German.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanNormalizationFilter	Normalizes German characters according to the heuristics German2 snowball algorithm.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanNormalizationFilterFactory	Factory for GermanNormalizationFilter.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanStemFilter	A TokenFilter that stems German words.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanStemFilterFactory	Factory for GermanStemFilter.	Class	org.apache.lucene.analysis.de	Apache Lucene
GermanStemmer	A stemmer for German words.	Class	org.apache.lucene.analysis.de	Apache Lucene
GreekAnalyzer	Analyzer for the Greek language.	Class	org.apache.lucene.analysis.el	Apache Lucene
GreekLowerCaseFilter	Normalizes token text to lower case, removes some Greek diacritics, and standardizes final sigma to sigma.	Class	org.apache.lucene.analysis.el	Apache Lucene
GreekLowerCaseFilterFactory	Factory for GreekLowerCaseFilter.	Class	org.apache.lucene.analysis.el	Apache Lucene
GreekStemFilter	A TokenFilter that applies GreekStemmer to stem Greek To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.el	Apache Lucene
GreekStemFilterFactory	Factory for GreekStemFilter.	Class	org.apache.lucene.analysis.el	Apache Lucene
GreekStemmer	A stemmer for Greek words, according to: Development of a Stemmer for the Greek Language.	Class	org.apache.lucene.analysis.el	Apache Lucene
HindiAnalyzer		Class	org.apache.lucene.analysis.hi	Apache Lucene
HindiNormalizationFilter	A TokenFilter that applies HindiNormalizer to normalize the In some cases the normalization may cause unrelated terms to conflate, so	Class	org.apache.lucene.analysis.hi	Apache Lucene
HindiNormalizationFilterFactory	Factory for HindiNormalizationFilter.	Class	org.apache.lucene.analysis.hi	Apache Lucene
HindiNormalizer	Normalizer for Hindi.	Class	org.apache.lucene.analysis.hi	Apache Lucene
HindiStemFilter	A TokenFilter that applies HindiStemmer to stem Hindi words.	Class	org.apache.lucene.analysis.hi	Apache Lucene
HindiStemFilterFactory	Factory for HindiStemFilter.	Class	org.apache.lucene.analysis.hi	Apache Lucene
HindiStemmer	Light Stemmer for Hindi.	Class	org.apache.lucene.analysis.hi	Apache Lucene
HTMLStripCharFilter	A CharFilter that wraps another Reader and attempts to strip out HTML constructs.	Class	org.apache.lucene.analysis.charfilter	Apache Lucene
HTMLStripCharFilterFactory	Factory for HTMLStripCharFilter.	Class	org.apache.lucene.analysis.charfilter	Apache Lucene
HungarianAnalyzer	Analyzer for Hungarian.	Class	org.apache.lucene.analysis.hu	Apache Lucene
HungarianLightStemFilter	A TokenFilter that applies HungarianLightStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.hu	Apache Lucene
HungarianLightStemFilterFactory	Factory for HungarianLightStemFilter.	Class	org.apache.lucene.analysis.hu	Apache Lucene
HungarianLightStemmer	Light Stemmer for Hungarian.	Class	org.apache.lucene.analysis.hu	Apache Lucene
HunspellStemFilter	TokenFilter that uses hunspell affix rules and words to stem tokens.	Class	org.apache.lucene.analysis.hunspell	Apache Lucene
HunspellStemFilterFactory	TokenFilterFactory that creates instances of HunspellStemFilter.	Class	org.apache.lucene.analysis.hunspell	Apache Lucene
Hyphen	This class represents a hyphen.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
HyphenatedWordsFilter	When the plain text is extracted from documents, we will often have many words hyphenated and broken into two lines.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
HyphenatedWordsFilterFactory	Factory for HyphenatedWordsFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
Hyphenation	This class represents a hyphenated word.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
HyphenationCompoundWordTokenFilter	A TokenFilter that decomposes compound words found in many Germanic languages.	Class	org.apache.lucene.analysis.compound	Apache Lucene
HyphenationCompoundWordTokenFilterFactory	Factory for HyphenationCompoundWordTokenFilter.	Class	org.apache.lucene.analysis.compound	Apache Lucene
HyphenationTree	This tree structure stores the hyphenation patterns in an efficient way for fast lookup.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
IdentityEncoder	Does nothing other than convert the char array to a byte array using the specified encoding.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
IndicNormalizationFilter	A TokenFilter that applies IndicNormalizer to normalize text in Indian Languages.	Class	org.apache.lucene.analysis.in	Apache Lucene
IndicNormalizationFilterFactory	Factory for IndicNormalizationFilter.	Class	org.apache.lucene.analysis.in	Apache Lucene
IndicNormalizer	Normalizes the Unicode representation of text in Indian languages.	Class	org.apache.lucene.analysis.in	Apache Lucene
IndonesianAnalyzer		Class	org.apache.lucene.analysis.id	Apache Lucene
IndonesianStemFilter	A TokenFilter that applies IndonesianStemmer to stem Indonesian words.	Class	org.apache.lucene.analysis.id	Apache Lucene
IndonesianStemFilterFactory	Factory for IndonesianStemFilter.	Class	org.apache.lucene.analysis.id	Apache Lucene
IndonesianStemmer	Stemmer for Indonesian.	Class	org.apache.lucene.analysis.id	Apache Lucene
IntegerEncoder	Encode a character array Integer as a BytesRef.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
IrishAnalyzer		Class	org.apache.lucene.analysis.ga	Apache Lucene
IrishLowerCaseFilter	Normalises token text to lower case, handling t-prothesis and n-eclipsis (i.	Class	org.apache.lucene.analysis.ga	Apache Lucene
IrishLowerCaseFilterFactory	Factory for IrishLowerCaseFilter.	Class	org.apache.lucene.analysis.ga	Apache Lucene
ItalianAnalyzer	Analyzer for Italian.	Class	org.apache.lucene.analysis.it	Apache Lucene
ItalianLightStemFilter	A TokenFilter that applies ItalianLightStemmer to stem Italian To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.it	Apache Lucene
ItalianLightStemFilterFactory	Factory for ItalianLightStemFilter.	Class	org.apache.lucene.analysis.it	Apache Lucene
ItalianLightStemmer	Light Stemmer for Italian.	Class	org.apache.lucene.analysis.it	Apache Lucene
KeepWordFilter	A TokenFilter that only keeps tokens with text contained in the required words.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
KeepWordFilterFactory	Factory for KeepWordFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
KeywordAnalyzer	"Tokenizes" the entire stream as a single token.	Class	org.apache.lucene.analysis.core	Apache Lucene
KeywordMarkerFilter	Marks terms as keywords via the KeywordAttribute.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
KeywordMarkerFilterFactory	Factory for KeywordMarkerFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
KeywordRepeatFilter		Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
KeywordRepeatFilterFactory	Factory for KeywordRepeatFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
KeywordTokenizer	Emits the entire input as a single token.	Class	org.apache.lucene.analysis.core	Apache Lucene
KeywordTokenizerFactory	Factory for KeywordTokenizer.	Class	org.apache.lucene.analysis.core	Apache Lucene
KStemFilter	A high-performance kstem filter for english.	Class	org.apache.lucene.analysis.en	Apache Lucene
KStemFilterFactory	Factory for KStemFilter.	Class	org.apache.lucene.analysis.en	Apache Lucene
KStemmer		Class	org.apache.lucene.analysis.en	Apache Lucene
LatvianAnalyzer	Analyzer for Latvian.	Class	org.apache.lucene.analysis.lv	Apache Lucene
LatvianStemFilter	A TokenFilter that applies LatvianStemmer to stem Latvian To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.lv	Apache Lucene
LatvianStemFilterFactory	Factory for LatvianStemFilter.	Class	org.apache.lucene.analysis.lv	Apache Lucene
LatvianStemmer	Light stemmer for Latvian.	Class	org.apache.lucene.analysis.lv	Apache Lucene
LengthFilter	Removes words that are too long or too short from the stream.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LengthFilterFactory	Factory for LengthFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LetterTokenizer	A LetterTokenizer is a tokenizer that divides text at non-letters.	Class	org.apache.lucene.analysis.core	Apache Lucene
LetterTokenizerFactory	Factory for LetterTokenizer.	Class	org.apache.lucene.analysis.core	Apache Lucene
LimitTokenCountAnalyzer	This Analyzer limits the number of tokens while indexing.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LimitTokenCountFilter	This TokenFilter limits the number of tokens while indexing.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LimitTokenCountFilterFactory	Factory for LimitTokenCountFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LimitTokenOffsetFilter	Lets all tokens pass through until it sees one with a start offset <= a configured limit, which won't pass and ends the stream.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LimitTokenOffsetFilterFactory	Factory for LimitTokenOffsetFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LimitTokenPositionFilter	This TokenFilter limits its emitted tokens to those with positions that are not greater than the configured limit.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LimitTokenPositionFilterFactory	Factory for LimitTokenPositionFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
LithuanianAnalyzer	Analyzer for Lithuanian.	Class	org.apache.lucene.analysis.lt	Apache Lucene
LowerCaseFilter	Normalizes token text to lower case.	Class	org.apache.lucene.analysis.core	Apache Lucene
LowerCaseFilterFactory	Factory for LowerCaseFilter.	Class	org.apache.lucene.analysis.core	Apache Lucene
LowerCaseTokenizer	LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together.	Class	org.apache.lucene.analysis.core	Apache Lucene
LowerCaseTokenizerFactory	Factory for LowerCaseTokenizer.	Class	org.apache.lucene.analysis.core	Apache Lucene
Lucene43CompoundWordTokenFilterBase	Base class for decomposition token filters using pre-4.	Class	org.apache.lucene.analysis.compound	Apache Lucene
Lucene43DictionaryCompoundWordTokenFilter		Class	org.apache.lucene.analysis.compound	Apache Lucene
Lucene43EdgeNGramTokenFilter	Tokenizes the given token into n-grams of given size(s), using pre-4.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
Lucene43EdgeNGramTokenizer	Old version of EdgeNGramTokenizer which doesn't handle correctly supplementary characters.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
Lucene43EdgeNGramTokenizer .Side	Specifies which side of the input the n-gram should be generated fromGet the n-gram from the end of the input	Class	org.apache.lucene.analysis.ngram.Lucene43EdgeNGramTokenizer	Apache Lucene
Lucene43FilteringTokenFilter	Backcompat FilteringTokenFilter for versions 4.	Class	org.apache.lucene.analysis.util	Apache Lucene
Lucene43HyphenationCompoundWordTokenFilter	A TokenFilter that decomposes compound words found in many Germanic languages, using pre-4.	Class	org.apache.lucene.analysis.compound	Apache Lucene
Lucene43KeepWordFilter	Backcompat for KeepWordFilter for versions 4.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
Lucene43LengthFilter	Backcompat LengthFilter for versions 4.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
Lucene43NGramTokenFilter	Tokenizes the input into n-grams of the given size(s), matching Lucene 4.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
Lucene43NGramTokenizer	Old broken version of NGramTokenizer.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
Lucene43StopFilter	Backcompat StopFilter for versions 4.	Class	org.apache.lucene.analysis.core	Apache Lucene
Lucene43TrimFilter	Backcompat TrimFilter for versions 4.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
Lucene43TypeTokenFilter	Backcompat TypeTokenFilter for versions 4.	Class	org.apache.lucene.analysis.core	Apache Lucene
Lucene47WordDelimiterFilter		Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
MappingCharFilter	Simplistic CharFilter that applies the mappings contained in a NormalizeCharMap to the character	Class	org.apache.lucene.analysis.charfilter	Apache Lucene
MappingCharFilterFactory	Factory for MappingCharFilter.	Class	org.apache.lucene.analysis.charfilter	Apache Lucene
MultiTermAwareComponent	Add to any analysis factory component to allow returning an analysis component factory for use with partial terms in prefix queries,	Interface	org.apache.lucene.analysis.util	Apache Lucene
NGramFilterFactory	Factory for NGramTokenFilter.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
NGramTokenFilter	Tokenizes the input into n-grams of the given size(s).	Class	org.apache.lucene.analysis.ngram	Apache Lucene
NGramTokenizer	Tokenizes the input into n-grams of the given size(s).	Class	org.apache.lucene.analysis.ngram	Apache Lucene
NGramTokenizerFactory	Factory for NGramTokenizer.	Class	org.apache.lucene.analysis.ngram	Apache Lucene
NormalizeCharMap	Holds a map of String input to String output, to be used with MappingCharFilter.	Class	org.apache.lucene.analysis.charfilter	Apache Lucene
NormalizeCharMap .Builder	Builds an NormalizeCharMap.	Class	org.apache.lucene.analysis.charfilter.NormalizeCharMap	Apache Lucene
NorwegianAnalyzer	Analyzer for Norwegian.	Class	org.apache.lucene.analysis.no	Apache Lucene
NorwegianLightStemFilter	A TokenFilter that applies NorwegianLightStemmer to stem Norwegian To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.no	Apache Lucene
NorwegianLightStemFilterFactory	Factory for NorwegianLightStemFilter.	Class	org.apache.lucene.analysis.no	Apache Lucene
NorwegianLightStemmer	Light Stemmer for Norwegian.	Class	org.apache.lucene.analysis.no	Apache Lucene
NorwegianMinimalStemFilter	A TokenFilter that applies NorwegianMinimalStemmer to stem Norwegian To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.no	Apache Lucene
NorwegianMinimalStemFilterFactory	Factory for NorwegianMinimalStemFilter.	Class	org.apache.lucene.analysis.no	Apache Lucene
NorwegianMinimalStemmer		Class	org.apache.lucene.analysis.no	Apache Lucene
NumericPayloadTokenFilter	Assigns a payload to a token based on the PackedTokenAttributeImpl.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
NumericPayloadTokenFilterFactory	Factory for NumericPayloadTokenFilter.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
OpenStringBuilder	A StringBuilder that allows one to access the array.	Class	org.apache.lucene.analysis.util	Apache Lucene
PathHierarchyTokenizer	Tokenizer for path-like hierarchies.	Class	org.apache.lucene.analysis.path	Apache Lucene
PathHierarchyTokenizerFactory	Factory for PathHierarchyTokenizer.	Class	org.apache.lucene.analysis.path	Apache Lucene
PatternCaptureGroupFilterFactory	Factory for PatternCaptureGroupTokenFilter.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternCaptureGroupTokenFilter	CaptureGroup uses Java regexes to emit multiple tokens - one for each capture group in one or more patterns.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternConsumer	This interface is used to connect the XML pattern file parser to the This class has been taken from the Apache FOP project (http://xmlgraphics.	Interface	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
PatternKeywordMarkerFilter	Marks terms as keywords via the KeywordAttribute.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
PatternParser	A SAX document handler to read and parse hyphenation patterns from a XML This class has been taken from the Apache FOP project (http://xmlgraphics.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
PatternReplaceCharFilter	CharFilter that uses a regular expression for the target of replace string.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternReplaceCharFilterFactory	Factory for PatternReplaceCharFilter.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternReplaceFilter	A TokenFilter which applies a Pattern to each token in the stream, replacing match occurances with the specified replacement string.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternReplaceFilterFactory	Factory for PatternReplaceFilter.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternTokenizer	This tokenizer uses regex pattern matching to construct distinct tokens for the input stream.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PatternTokenizerFactory	Factory for PatternTokenizer.	Class	org.apache.lucene.analysis.pattern	Apache Lucene
PayloadEncoder	Mainly for use with the DelimitedPayloadTokenFilter, converts char buffers to NOTE: This interface is subject to change	Interface	org.apache.lucene.analysis.payloads	Apache Lucene
PayloadHelper	Utility methods for encoding payloads.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
PerFieldAnalyzerWrapper	This analyzer is used to facilitate scenarios where different fields require different analysis techniques.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
PersianAnalyzer	Analyzer for Persian.	Class	org.apache.lucene.analysis.fa	Apache Lucene
PersianCharFilter		Class	org.apache.lucene.analysis.fa	Apache Lucene
PersianCharFilterFactory	Factory for PersianCharFilter.	Class	org.apache.lucene.analysis.fa	Apache Lucene
PersianNormalizationFilter		Class	org.apache.lucene.analysis.fa	Apache Lucene
PersianNormalizationFilterFactory	Factory for PersianNormalizationFilter.	Class	org.apache.lucene.analysis.fa	Apache Lucene
PersianNormalizer	Normalizer for Persian.	Class	org.apache.lucene.analysis.fa	Apache Lucene
PorterStemFilter	Transforms the token stream as per the Porter stemming algorithm.	Class	org.apache.lucene.analysis.en	Apache Lucene
PorterStemFilterFactory	Factory for PorterStemFilter.	Class	org.apache.lucene.analysis.en	Apache Lucene
PortugueseAnalyzer	Analyzer for Portuguese.	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseLightStemFilter	A TokenFilter that applies PortugueseLightStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseLightStemFilterFactory	Factory for PortugueseLightStemFilter.	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseLightStemmer	Light Stemmer for Portuguese This stemmer implements the "UniNE" algorithm in:	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseMinimalStemFilter	A TokenFilter that applies PortugueseMinimalStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseMinimalStemFilterFactory	Factory for PortugueseMinimalStemFilter.	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseMinimalStemmer	Minimal Stemmer for Portuguese This follows the "RSLP-S" algorithm presented in:	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseStemFilter	A TokenFilter that applies PortugueseStemmer to stem To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseStemFilterFactory	Factory for PortugueseStemFilter.	Class	org.apache.lucene.analysis.pt	Apache Lucene
PortugueseStemmer	Portuguese stemmer implementing the RSLP (Removedor de Sufixos da Lingua Portuguesa) algorithm.	Class	org.apache.lucene.analysis.pt	Apache Lucene
PrefixAndSuffixAwareTokenFilter	Links two PrefixAwareTokenFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
PrefixAwareTokenFilter	Joins two token streams and leaves the last token of the first stream available to be used when updating the token values in the second stream based on that token.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
QueryAutoStopWordAnalyzer	An Analyzer used primarily at query time to wrap another analyzer and provide a layer of protection which prevents very common words from being passed into queries.	Class	org.apache.lucene.analysis.query	Apache Lucene
RemoveDuplicatesTokenFilter	A TokenFilter which filters out Tokens at the same position and Term text as the previous token in the stream.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
RemoveDuplicatesTokenFilterFactory	Factory for RemoveDuplicatesTokenFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
ResourceLoader	Abstraction for loading resources (streams, files, and classes).	Interface	org.apache.lucene.analysis.util	Apache Lucene
ResourceLoaderAware	Interface for a component that needs to be initialized by an implementation of ResourceLoader.	Interface	org.apache.lucene.analysis.util	Apache Lucene
ReversePathHierarchyTokenizer	Tokenizer for domain-like hierarchies.	Class	org.apache.lucene.analysis.path	Apache Lucene
ReverseStringFilter	Reverse token string, for example "country" => "yrtnuoc".	Class	org.apache.lucene.analysis.reverse	Apache Lucene
ReverseStringFilterFactory	Factory for ReverseStringFilter.	Class	org.apache.lucene.analysis.reverse	Apache Lucene
RollingCharBuffer	Acts like a forever growing char[] as you read characters into it from the provided reader, but	Class	org.apache.lucene.analysis.util	Apache Lucene
RomanianAnalyzer	Analyzer for Romanian.	Class	org.apache.lucene.analysis.ro	Apache Lucene
RSLPStemmerBase	Base class for stemmers that use a set of RSLP-like stemming steps.	Class	org.apache.lucene.analysis.pt	Apache Lucene
RSLPStemmerBase .Rule	A basic rule, with no exceptions.	Class	org.apache.lucene.analysis.pt.RSLPStemmerBase	Apache Lucene
RSLPStemmerBase .RuleWithSetExceptions	A rule with a set of whole-word exceptions.	Class	org.apache.lucene.analysis.pt.RSLPStemmerBase	Apache Lucene
RSLPStemmerBase .RuleWithSuffixExceptions	A rule with a set of exceptional suffixes.	Class	org.apache.lucene.analysis.pt.RSLPStemmerBase	Apache Lucene
RSLPStemmerBase .Step	A step containing a list of rules.	Class	org.apache.lucene.analysis.pt.RSLPStemmerBase	Apache Lucene
RussianAnalyzer	Analyzer for Russian language.	Class	org.apache.lucene.analysis.ru	Apache Lucene
RussianLightStemFilter	A TokenFilter that applies RussianLightStemmer to stem Russian To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.ru	Apache Lucene
RussianLightStemFilterFactory	Factory for RussianLightStemFilter.	Class	org.apache.lucene.analysis.ru	Apache Lucene
RussianLightStemmer	Light Stemmer for Russian.	Class	org.apache.lucene.analysis.ru	Apache Lucene
ScandinavianFoldingFilter		Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
ScandinavianFoldingFilterFactory	Factory for ScandinavianFoldingFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
ScandinavianNormalizationFilter		Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
ScandinavianNormalizationFilterFactory	Factory for ScandinavianNormalizationFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
SegmentingTokenizerBase	Breaks text into sentences with a BreakIterator and allows subclasses to decompose these sentences into words.	Class	org.apache.lucene.analysis.util	Apache Lucene
SerbianNormalizationFilter	Normalizes Serbian Cyrillic and Latin characters to "bald" Latin.	Class	org.apache.lucene.analysis.sr	Apache Lucene
SerbianNormalizationFilterFactory	Factory for SerbianNormalizationFilter.	Class	org.apache.lucene.analysis.sr	Apache Lucene
SerbianNormalizationRegularFilter	Normalizes Serbian Cyrillic to Latin.	Class	org.apache.lucene.analysis.sr	Apache Lucene
SetKeywordMarkerFilter	Marks terms as keywords via the KeywordAttribute.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
ShingleAnalyzerWrapper	A ShingleAnalyzerWrapper wraps a ShingleFilter around another Analyzer.	Class	org.apache.lucene.analysis.shingle	Apache Lucene
ShingleFilter	A ShingleFilter constructs shingles (token n-grams) from a token stream.	Class	org.apache.lucene.analysis.shingle	Apache Lucene
ShingleFilterFactory	Factory for ShingleFilter.	Class	org.apache.lucene.analysis.shingle	Apache Lucene
SimpleAnalyzer	An Analyzer that filters LetterTokenizer with LowerCaseFilter	Class	org.apache.lucene.analysis.core	Apache Lucene
SingleTokenTokenStream	A TokenStream containing a single token.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
SnowballFilter	A filter that stems words using a Snowball-generated stemmer.	Class	org.apache.lucene.analysis.snowball	Apache Lucene
SnowballPorterFilterFactory	Factory for SnowballFilter, with configurable language Note: Use of the "Lovins" stemmer is not recommended, as it is implemented with reflection.	Class	org.apache.lucene.analysis.snowball	Apache Lucene
SolrSynonymParser	Parser for the Solr synonyms format.	Class	org.apache.lucene.analysis.synonym	Apache Lucene
SoraniAnalyzer	Analyzer for Sorani Kurdish.	Class	org.apache.lucene.analysis.ckb	Apache Lucene
SoraniNormalizationFilter		Class	org.apache.lucene.analysis.ckb	Apache Lucene
SoraniNormalizationFilterFactory	Factory for SoraniNormalizationFilter.	Class	org.apache.lucene.analysis.ckb	Apache Lucene
SoraniNormalizer	Normalizes the Unicode representation of Sorani text.	Class	org.apache.lucene.analysis.ckb	Apache Lucene
SoraniStemFilter	A TokenFilter that applies SoraniStemmer to stem Sorani words.	Class	org.apache.lucene.analysis.ckb	Apache Lucene
SoraniStemFilterFactory	Factory for SoraniStemFilter.	Class	org.apache.lucene.analysis.ckb	Apache Lucene
SoraniStemmer		Class	org.apache.lucene.analysis.ckb	Apache Lucene
SpanishAnalyzer	Analyzer for Spanish.	Class	org.apache.lucene.analysis.es	Apache Lucene
SpanishLightStemFilter	A TokenFilter that applies SpanishLightStemmer to stem Spanish To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.es	Apache Lucene
SpanishLightStemFilterFactory	Factory for SpanishLightStemFilter.	Class	org.apache.lucene.analysis.es	Apache Lucene
SpanishLightStemmer	Light Stemmer for Spanish This stemmer implements the algorithm described in:	Class	org.apache.lucene.analysis.es	Apache Lucene
StandardAnalyzer	Filters StandardTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words.	Class	org.apache.lucene.analysis.standard	Apache Lucene
StandardFilter	Normalizes tokens extracted with StandardTokenizer.	Class	org.apache.lucene.analysis.standard	Apache Lucene
StandardFilterFactory	Factory for StandardFilter.	Class	org.apache.lucene.analysis.standard	Apache Lucene
StandardTokenizer	A grammar-based tokenizer constructed with JFlex.	Class	org.apache.lucene.analysis.standard	Apache Lucene
StandardTokenizer40	Backcompat standard tokenizer for Lucene 4.	Class	org.apache.lucene.analysis.standard.std40	Apache Lucene
StandardTokenizerFactory	Factory for StandardTokenizer.	Class	org.apache.lucene.analysis.standard	Apache Lucene
StandardTokenizerImpl	This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in	Class	org.apache.lucene.analysis.standard	Apache Lucene
StandardTokenizerImpl40		Class	org.apache.lucene.analysis.standard.std40	Apache Lucene
StemmerOverrideFilter	Provides the ability to override any KeywordAttribute aware stemmer with custom dictionary-based stemming.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
StemmerOverrideFilter .Builder		Class	org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter	Apache Lucene
StemmerOverrideFilter .StemmerOverrideMap	A read-only 4-byte FST backed map that allows fast case-insensitive key value lookups for StemmerOverrideFilter	Class	org.apache.lucene.analysis.miscellaneous.StemmerOverrideFilter	Apache Lucene
StemmerOverrideFilterFactory	Factory for StemmerOverrideFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
StemmerUtil	Some commonly-used stemming functionsNOTE: This API is for internal purposes only and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.util	Apache Lucene
StopAnalyzer	Filters LetterTokenizer with LowerCaseFilter and StopFilter.	Class	org.apache.lucene.analysis.core	Apache Lucene
StopFilter	Removes stop words from a token stream.	Class	org.apache.lucene.analysis.core	Apache Lucene
StopFilterFactory	Factory for StopFilter.	Class	org.apache.lucene.analysis.core	Apache Lucene
StopwordAnalyzerBase	Base class for Analyzers that need to make use of stopword sets.	Class	org.apache.lucene.analysis.util	Apache Lucene
SwedishAnalyzer	Analyzer for Swedish.	Class	org.apache.lucene.analysis.sv	Apache Lucene
SwedishLightStemFilter	A TokenFilter that applies SwedishLightStemmer to stem Swedish To prevent terms from being stemmed use an instance of	Class	org.apache.lucene.analysis.sv	Apache Lucene
SwedishLightStemFilterFactory	Factory for SwedishLightStemFilter.	Class	org.apache.lucene.analysis.sv	Apache Lucene
SwedishLightStemmer	Light Stemmer for Swedish.	Class	org.apache.lucene.analysis.sv	Apache Lucene
SynonymFilter	Matches single or multi word synonyms in a token stream.	Class	org.apache.lucene.analysis.synonym	Apache Lucene
SynonymFilterFactory	Factory for SynonymFilter.	Class	org.apache.lucene.analysis.synonym	Apache Lucene
SynonymMap	A map of synonyms, keys and values are phrases.	Class	org.apache.lucene.analysis.synonym	Apache Lucene
SynonymMap .Builder	Builds an FSTSynonymMap.	Class	org.apache.lucene.analysis.synonym.SynonymMap	Apache Lucene
SynonymMap .Parser	Abstraction for parsing synonym files.	Class	org.apache.lucene.analysis.synonym.SynonymMap	Apache Lucene
TeeSinkTokenFilter	This TokenFilter provides the ability to set aside attribute states that have already been analyzed.	Class	org.apache.lucene.analysis.sinks	Apache Lucene
TeeSinkTokenFilter .SinkTokenStream	TokenStream output from a tee.	Class	org.apache.lucene.analysis.sinks.TeeSinkTokenFilter	Apache Lucene
TernaryTree	Ternary Search Tree.	Class	org.apache.lucene.analysis.compound.hyphenation	Apache Lucene
ThaiAnalyzer	Analyzer for Thai language.	Class	org.apache.lucene.analysis.th	Apache Lucene
ThaiTokenizer	Tokenizer that use BreakIterator to tokenize Thai text.	Class	org.apache.lucene.analysis.th	Apache Lucene
ThaiTokenizerFactory	Factory for ThaiTokenizer.	Class	org.apache.lucene.analysis.th	Apache Lucene
ThaiWordFilter	TokenFilter that use BreakIterator to break each Token that is Thai into separate Token(s) for each Thai word.	Class	org.apache.lucene.analysis.th	Apache Lucene
ThaiWordFilterFactory	Factory for ThaiWordFilter.	Class	org.apache.lucene.analysis.th	Apache Lucene
TokenFilterFactory		Class	org.apache.lucene.analysis.util	Apache Lucene
TokenizerFactory		Class	org.apache.lucene.analysis.util	Apache Lucene
TokenOffsetPayloadTokenFilter	Adds the OffsetAttribute.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
TokenOffsetPayloadTokenFilterFactory	Factory for TokenOffsetPayloadTokenFilter.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
TrimFilter	Trims leading and trailing whitespace from Tokens in the stream.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
TrimFilterFactory	Factory for TrimFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
TruncateTokenFilter	A token filter for truncating the terms into a specific length.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
TruncateTokenFilterFactory	Factory for TruncateTokenFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
TurkishAnalyzer	Analyzer for Turkish.	Class	org.apache.lucene.analysis.tr	Apache Lucene
TurkishLowerCaseFilter	Normalizes Turkish token text to lower case.	Class	org.apache.lucene.analysis.tr	Apache Lucene
TurkishLowerCaseFilterFactory	Factory for TurkishLowerCaseFilter.	Class	org.apache.lucene.analysis.tr	Apache Lucene
TypeAsPayloadTokenFilter	Makes the PackedTokenAttributeImpl.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
TypeAsPayloadTokenFilterFactory	Factory for TypeAsPayloadTokenFilter.	Class	org.apache.lucene.analysis.payloads	Apache Lucene
TypeTokenFilter	Removes tokens whose types appear in a set of blocked types from a token stream.	Class	org.apache.lucene.analysis.core	Apache Lucene
TypeTokenFilterFactory	Factory class for TypeTokenFilter.	Class	org.apache.lucene.analysis.core	Apache Lucene
UAX29URLEmailAnalyzer	Filters UAX29URLEmailTokenizer with StandardFilter,	Class	org.apache.lucene.analysis.standard	Apache Lucene
UAX29URLEmailTokenizer	This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in	Class	org.apache.lucene.analysis.standard	Apache Lucene
UAX29URLEmailTokenizer40	Backcompat uax29 tokenizer for Lucene 4.	Class	org.apache.lucene.analysis.standard.std40	Apache Lucene
UAX29URLEmailTokenizerFactory	Factory for UAX29URLEmailTokenizer.	Class	org.apache.lucene.analysis.standard	Apache Lucene
UAX29URLEmailTokenizerImpl	This class implements Word Break rules from the Unicode Text Segmentation algorithm, as specified in	Class	org.apache.lucene.analysis.standard	Apache Lucene
UAX29URLEmailTokenizerImpl40		Class	org.apache.lucene.analysis.standard.std40	Apache Lucene
UnicodeProps	This file contains unicode properties used by various CharTokenizers.	Class	org.apache.lucene.analysis.util	Apache Lucene
UnicodeWhitespaceAnalyzer	An Analyzer that uses UnicodeWhitespaceTokenizer.	Class	org.apache.lucene.analysis.core	Apache Lucene
UnicodeWhitespaceTokenizer	A UnicodeWhitespaceTokenizer is a tokenizer that divides text at whitespace.	Class	org.apache.lucene.analysis.core	Apache Lucene
UpperCaseFilter	Normalizes token text to UPPER CASE.	Class	org.apache.lucene.analysis.core	Apache Lucene
UpperCaseFilterFactory	Factory for UpperCaseFilter.	Class	org.apache.lucene.analysis.core	Apache Lucene
WhitespaceAnalyzer	An Analyzer that uses WhitespaceTokenizer.	Class	org.apache.lucene.analysis.core	Apache Lucene
WhitespaceTokenizer	A tokenizer that divides text at whitespace characters as defined by Character.	Class	org.apache.lucene.analysis.core	Apache Lucene
WhitespaceTokenizerFactory	Factory for WhitespaceTokenizer.	Class	org.apache.lucene.analysis.core	Apache Lucene
WikipediaTokenizer	Extension of StandardTokenizer that is aware of Wikipedia syntax.	Class	org.apache.lucene.analysis.wikipedia	Apache Lucene
WikipediaTokenizerFactory	Factory for WikipediaTokenizer.	Class	org.apache.lucene.analysis.wikipedia	Apache Lucene
WordDelimiterFilter	Splits words into subwords and performs optional transformations on subword groups.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
WordDelimiterFilterFactory	Factory for WordDelimiterFilter.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
WordDelimiterIterator	A BreakIterator-like API for iterating over subwords in text, according to WordDelimiterFilter rules.	Class	org.apache.lucene.analysis.miscellaneous	Apache Lucene
WordlistLoader	Loader for text files that represent a list of stopwords.	Class	org.apache.lucene.analysis.util	Apache Lucene
WordnetSynonymParser	Parser for wordnet prolog format See http://wordnet.	Class	org.apache.lucene.analysis.synonym	Apache Lucene
DefaultICUTokenizerConfig	Default ICUTokenizerConfig that is generally applicable Generally tokenizes Unicode text according to UAX#29	Class	org.apache.lucene.analysis.icu.segmentation	Apache Lucene
ICUFoldingFilter	A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUFoldingFilterFactory	Factory for ICUFoldingFilter.	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUNormalizer2CharFilter	Normalize token text with ICU's Normalizer2.	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUNormalizer2CharFilterFactory	Factory for ICUNormalizer2CharFilter Supports the following attributes:	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUNormalizer2Filter	Normalize token text with ICU's Normalizer2 With this filter, you can normalize text in the following ways:	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUNormalizer2FilterFactory	Factory for ICUNormalizer2Filter Supports the following attributes:	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUTokenizer	Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.	Class	org.apache.lucene.analysis.icu.segmentation	Apache Lucene
ICUTokenizerConfig	Class that allows for tailored Unicode Text Segmentation on a per-writing system basis.	Class	org.apache.lucene.analysis.icu.segmentation	Apache Lucene
ICUTokenizerFactory	Factory for ICUTokenizer.	Class	org.apache.lucene.analysis.icu.segmentation	Apache Lucene
ICUTransformFilter	A TokenFilter that transforms text with ICU.	Class	org.apache.lucene.analysis.icu	Apache Lucene
ICUTransformFilterFactory	Factory for ICUTransformFilter.	Class	org.apache.lucene.analysis.icu	Apache Lucene
ScriptAttribute	This attribute stores the UTR #24 script value for a token of text.	Interface	org.apache.lucene.analysis.icu.tokenattributes	Apache Lucene
ScriptAttributeImpl	WARNING: This API is experimental and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.icu.tokenattributes	Apache Lucene
BaseFormAttribute	Attribute for Token.	Interface	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
BaseFormAttributeImpl	Attribute for Token.	Class	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
BinaryDictionary	Base class for a binary-encoded in-memory dictionary.	Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
CharacterDefinition	Character category data.	Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
ConnectionCosts		Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
CSVUtil		Class	org.apache.lucene.analysis.ja.util	Apache Lucene
Dictionary		Interface	org.apache.lucene.analysis.ja.dict	Apache Lucene
GraphvizFormatter	Outputs the dot (graphviz) string for the viterbi lattice.	Class	org.apache.lucene.analysis.ja	Apache Lucene
InflectionAttribute	Attribute for Kuromoji inflection data.	Interface	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
InflectionAttributeImpl	Attribute for Kuromoji inflection data.	Class	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
JapaneseAnalyzer	Analyzer for Japanese that uses morphological analysis.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseBaseFormFilter	Replaces term text with the BaseFormAttribute.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseBaseFormFilterFactory	Factory for JapaneseBaseFormFilter.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseIterationMarkCharFilter	Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseIterationMarkCharFilterFactory	Factory for JapaneseIterationMarkCharFilter.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseKatakanaStemFilter	A TokenFilter that normalizes common katakana spelling variations ending in a long sound character by removing this character (U+30FC).	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseKatakanaStemFilterFactory	Factory for JapaneseKatakanaStemFilter.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapanesePartOfSpeechStopFilter	Removes tokens that match a set of part-of-speech tags.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapanesePartOfSpeechStopFilterFactory	Factory for JapanesePartOfSpeechStopFilter.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseReadingFormFilter	A TokenFilter that replaces the term attribute with the reading of a token in either katakana or romaji form.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseReadingFormFilterFactory	Factory for JapaneseReadingFormFilter.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseTokenizer	Tokenizer for Japanese that uses morphological analysis.	Class	org.apache.lucene.analysis.ja	Apache Lucene
JapaneseTokenizer .Mode	Tokenization mode: this determines how the tokenizer handles compound and unknown words.	Class	org.apache.lucene.analysis.ja.JapaneseTokenizer	Apache Lucene
JapaneseTokenizer .Type	Token type reflecting the original source of this tokenKnown words from the system dictionary.	Class	org.apache.lucene.analysis.ja.JapaneseTokenizer	Apache Lucene
JapaneseTokenizerFactory	Factory for JapaneseTokenizer.	Class	org.apache.lucene.analysis.ja	Apache Lucene
Lucene43JapanesePartOfSpeechStopFilter	Backcompat JapanesePartOfSpeechStopFilter for versions 4.	Class	org.apache.lucene.analysis.ja	Apache Lucene
PartOfSpeechAttribute	Attribute for Token.	Interface	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
PartOfSpeechAttributeImpl	Attribute for Token.	Class	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
ReadingAttribute	Attribute for Kuromoji reading data Note: in some cases this value may not be applicable,	Interface	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
ReadingAttributeImpl		Class	org.apache.lucene.analysis.ja.tokenattributes	Apache Lucene
Token	Analyzed token with morphological data from its dictionary.	Class	org.apache.lucene.analysis.ja	Apache Lucene
TokenInfoDictionary	Binary dictionary implementation for a known-word dictionary model: Words are encoded into an FST mapping to a list of wordIDs.	Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
TokenInfoFST	Thin wrapper around an FST with root-arc caching for Japanese.	Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
ToStringUtil	Utility class for english translations of morphological data, used only for debugging.	Class	org.apache.lucene.analysis.ja.util	Apache Lucene
UnknownDictionary	Dictionary for unknown-word handling.	Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
UserDictionary	Class for building a User Dictionary.	Class	org.apache.lucene.analysis.ja.dict	Apache Lucene
MorfologikAnalyzer	Analyzer using Morfologik library.	Class	org.apache.lucene.analysis.morfologik	Apache Lucene
MorfologikFilter	TokenFilter using Morfologik library to transform input tokens into lemma and morphosyntactic (POS) tokens.	Class	org.apache.lucene.analysis.morfologik	Apache Lucene
MorfologikFilterFactory	Filter factory for MorfologikFilter.	Class	org.apache.lucene.analysis.morfologik	Apache Lucene
MorphosyntacticTagsAttribute	Morfologik provides morphosyntactic annotations for surface forms.	Interface	org.apache.lucene.analysis.morfologik	Apache Lucene
MorphosyntacticTagsAttributeImpl	Morphosyntactic annotations for surface forms.	Class	org.apache.lucene.analysis.morfologik	Apache Lucene
BeiderMorseFilter	TokenFilter for Beider-Morse phonetic encoding.	Class	org.apache.lucene.analysis.phonetic	Apache Lucene
BeiderMorseFilterFactory	Factory for BeiderMorseFilter.	Class	org.apache.lucene.analysis.phonetic	Apache Lucene
DaitchMokotoffSoundexFilter		Class	org.apache.lucene.analysis.phonetic	Apache Lucene
DaitchMokotoffSoundexFilterFactory	Factory for DaitchMokotoffSoundexFilter.	Class	org.apache.lucene.analysis.phonetic	Apache Lucene
DoubleMetaphoneFilter		Class	org.apache.lucene.analysis.phonetic	Apache Lucene
DoubleMetaphoneFilterFactory	Factory for DoubleMetaphoneFilter.	Class	org.apache.lucene.analysis.phonetic	Apache Lucene
PhoneticFilter	Create tokens for phonetic matches.	Class	org.apache.lucene.analysis.phonetic	Apache Lucene
PhoneticFilterFactory	Factory for PhoneticFilter.	Class	org.apache.lucene.analysis.phonetic	Apache Lucene
AnalyzerProfile	Manages analysis data configuration for SmartChineseAnalyzer SmartChineseAnalyzer has a built-in dictionary and stopword list out-of-box.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
CharType	Internal SmartChineseAnalyzer character type constants.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
HHMMSegmenter	Finds the optimal segmentation of a sentence into Chinese wordsWARNING: This API is experimental and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.cn.smart.hhmm	Apache Lucene
HMMChineseTokenizer	Tokenizer for Chinese or mixed Chinese-English text.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
HMMChineseTokenizerFactory	Factory for HMMChineseTokenizer Note: this class will currently emit tokens for punctuation.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
SegToken	SmartChineseAnalyzer internal tokenWARNING: This API is experimental and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.cn.smart.hhmm	Apache Lucene
SegTokenFilter	Filters a SegToken by converting full-width latin to half-width, then lowercasing latin.	Class	org.apache.lucene.analysis.cn.smart.hhmm	Apache Lucene
SentenceTokenizer	Tokenizes input text into sentences.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
SmartChineseAnalyzer	SmartChineseAnalyzer is an analyzer for Chinese or mixed Chinese-English text.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
SmartChineseSentenceTokenizerFactory	Factory for the SmartChineseAnalyzer SentenceTokenizerWARNING: This API is experimental and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
SmartChineseWordTokenFilterFactory	Factory for the SmartChineseAnalyzer WordTokenFilter Note: this class will currently emit tokens for punctuation.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
Utility	SmartChineseAnalyzer utility constants and methodsWARNING: This API is experimental and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
WordTokenFilter	A TokenFilter that breaks sentences into words.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
WordType	Internal SmartChineseAnalyzer token type constantsWARNING: This API is experimental and might change in incompatible ways in the next release.	Class	org.apache.lucene.analysis.cn.smart	Apache Lucene
PolishAnalyzer	Analyzer for Polish.	Class	org.apache.lucene.analysis.pl	Apache Lucene
StempelFilter	Transforms the token stream as per the stemming algorithm.	Class	org.apache.lucene.analysis.stempel	Apache Lucene
StempelPolishStemFilterFactory	Factory for StempelFilter using a Polish stemming table.	Class	org.apache.lucene.analysis.stempel	Apache Lucene
StempelStemmer	Stemmer class is a convenient facade for other stemmer-related classes.	Class	org.apache.lucene.analysis.stempel	Apache Lucene
AEProvider		Interface	org.apache.lucene.analysis.uima.ae	Apache Lucene
AEProviderFactory		Class	org.apache.lucene.analysis.uima.ae	Apache Lucene
BaseUIMATokenizer	Abstract base implementation of a Tokenizer which is able to analyze the given input with a UIMA AnalysisEngine	Class	org.apache.lucene.analysis.uima	Apache Lucene
BasicAEProvider	Basic AEProvider which just instantiates a UIMA AnalysisEngine with no additional metadata, parameters or resources	Class	org.apache.lucene.analysis.uima.ae	Apache Lucene
OverridingParamsAEProvider	AEProvider implementation that creates an Aggregate AE from the given path, also injecting runtime parameters defined in the solrconfig.	Class	org.apache.lucene.analysis.uima.ae	Apache Lucene
UIMAAnnotationsTokenizer		Class	org.apache.lucene.analysis.uima	Apache Lucene
UIMAAnnotationsTokenizerFactory		Class	org.apache.lucene.analysis.uima	Apache Lucene
UIMABaseAnalyzer		Class	org.apache.lucene.analysis.uima	Apache Lucene
UIMATypeAwareAnalyzer		Class	org.apache.lucene.analysis.uima	Apache Lucene
UIMATypeAwareAnnotationsTokenizer	A Tokenizer which creates token from UIMA Annotations filling also their TypeAttribute according to FeaturePaths specified	Class	org.apache.lucene.analysis.uima	Apache Lucene
UIMATypeAwareAnnotationsTokenizerFactory		Class	org.apache.lucene.analysis.uima	Apache Lucene
BaseTokenStreamTestCase	Base class for all Lucene unit tests that use TokenStreams.	Class	org.apache.lucene.analysis	Apache Lucene
BaseTokenStreamTestCase .CheckClearAttributesAttribute	Attribute that records if it was cleared or not.	Interface	org.apache.lucene.analysis.BaseTokenStreamTestCase	Apache Lucene
BaseTokenStreamTestCase .CheckClearAttributesAttributeImpl	Attribute that records if it was cleared or not.	Class	org.apache.lucene.analysis.BaseTokenStreamTestCase	Apache Lucene
CannedBinaryTokenStream		Class	org.apache.lucene.analysis	Apache Lucene
CannedBinaryTokenStream .BinaryToken	Represents a binary token.	Class	org.apache.lucene.analysis.CannedBinaryTokenStream	Apache Lucene
CannedTokenStream	TokenStream from a canned list of Tokens.	Class	org.apache.lucene.analysis	Apache Lucene
CollationTestBase	Base test class for testing Unicode collation.	Class	org.apache.lucene.analysis	Apache Lucene
CrankyTokenFilter	Throws IOException from random Tokenstream methods.	Class	org.apache.lucene.analysis	Apache Lucene
LookaheadTokenFilter	An abstract TokenFilter to make it easier to build graph token filters requiring some lookahead.	Class	org.apache.lucene.analysis	Apache Lucene
LookaheadTokenFilter .Position	Holds all state for a single position; subclass this to record other state at each position.	Class	org.apache.lucene.analysis.LookaheadTokenFilter	Apache Lucene
MockAnalyzer	Analyzer for testing This analyzer is a replacement for Whitespace/Simple/KeywordAnalyzers	Class	org.apache.lucene.analysis	Apache Lucene
MockBytesAnalyzer	Analyzer for testing that encodes terms as UTF-16 bytes.	Class	org.apache.lucene.analysis	Apache Lucene
MockCharFilter	the purpose of this charfilter is to send offsets out of bounds if the analyzer doesn't use correctOffset or does incorrect offset math.	Class	org.apache.lucene.analysis	Apache Lucene
MockFixedLengthPayloadFilter	TokenFilter that adds random fixed-length payloads.	Class	org.apache.lucene.analysis	Apache Lucene
MockGraphTokenFilter	Randomly inserts overlapped (posInc=0) tokens with posLength sometimes > 1.	Class	org.apache.lucene.analysis	Apache Lucene
MockHoleInjectingTokenFilter		Class	org.apache.lucene.analysis	Apache Lucene
MockPayloadAnalyzer	Wraps a whitespace tokenizer with a filter that sets the first token, and odd tokens to posinc=1, and all others	Class	org.apache.lucene.analysis	Apache Lucene
MockRandomLookaheadTokenFilter	Uses LookaheadTokenFilter to randomly peek at future tokens.	Class	org.apache.lucene.analysis	Apache Lucene
MockReaderWrapper	Wraps a Reader, and can throw random or fixed exceptions, and spoon feed read chars.	Class	org.apache.lucene.analysis	Apache Lucene
MockTokenFilter	A tokenfilter for testing that removes terms accepted by a DFA.	Class	org.apache.lucene.analysis	Apache Lucene
MockTokenizer	Tokenizer for testing.	Class	org.apache.lucene.analysis	Apache Lucene
MockUTF16TermAttributeImpl	Extension of CharTermAttributeImpl that encodes the term text as UTF-16 bytes instead of as UTF-8 bytes.	Class	org.apache.lucene.analysis	Apache Lucene
MockVariableLengthPayloadFilter	TokenFilter that adds random variable-length payloads.	Class	org.apache.lucene.analysis	Apache Lucene
SimplePayloadFilter		Class	org.apache.lucene.analysis	Apache Lucene
TokenStreamToDot	Consumes a TokenStream and outputs the dot (graphviz) string (graph).	Class	org.apache.lucene.analysis	Apache Lucene
ValidatingTokenFilter	A TokenFilter that checks consistency of the tokens (eg offsets are consistent with one another).	Class	org.apache.lucene.analysis	Apache Lucene
VocabularyAssert		Class	org.apache.lucene.analysis	Apache Lucene

Search Java Classes and Packages