Package org.apache.lucene.analysis.hunspell
A Java implementation of Hunspell stemming and
spell-checking algorithms (
Hunspell
), and a stemming
TokenFilter (HunspellStemFilter
) based on it.
For dictionaries, see e.g. LibreOffice repository or Titus Wormer's collection (UTF)
-
Interface Summary Interface Description DictEntries An object representing homonym dictionary entries.FragmentChecker An oracle for quickly checking that a specific part of a word can never be a valid word.NGramFragmentChecker.NGramConsumer A callback for n-gram ranges in words -
Class Summary Class Description AffixedWord An object representing the analysis result of a simple (non-compound) wordAffixedWord.Affix An object representing a prefix or a suffix applied to a word stemDictEntry An object representing *.dic file entry with its word, flags and morphological data.Dictionary In-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary.EntrySuggestion Suggestion to add/edit dictionary entries to generate a given list of words created byWordFormGenerator.compress(java.util.List<java.lang.String>, java.util.Set<java.lang.String>, java.lang.Runnable)
.Hunspell A spell checker based on Hunspell dictionaries.HunspellStemFilter TokenFilter that uses hunspell affix rules and words to stem tokens.HunspellStemFilterFactory TokenFilterFactory that creates instances ofHunspellStemFilter
.NGramFragmentChecker AFragmentChecker
based on all character n-grams possible in a certain language, keeping them in a relatively memory-efficient, but probabilistic data structure.Suggester A generator for misspelled word corrections based on Hunspell flags.WordFormGenerator A utility class used for generating possible word forms by adding affixes to stems (WordFormGenerator.getAllWordForms(String, String, Runnable)
), and suggesting stems and flags to generate the given set of words (WordFormGenerator.compress(List, Set, Runnable)
). -
Enum Summary Enum Description TimeoutPolicy A strategy determining what to do when Hunspell API calls take too much time -
Exception Summary Exception Description SuggestionTimeoutException An exception thrown whenHunspell.suggest(java.lang.String)
call takes too long, ifTimeoutPolicy.THROW_EXCEPTION
is used.