Class DictionaryBuilder


  • public class DictionaryBuilder
    extends Object
    Tool to build dictionaries. Usage:
        java -cp [lucene classpath] org.apache.lucene.analysis.ja.util.DictionaryBuilder \
              ${inputDir} ${outputDir} ${encoding}
     

    The input directory is expected to include unk.def, matrix.def, plus any number of .csv files, roughly following the conventions of IPADIC. JapaneseTokenizer uses dictionaries built with this tool. Note that the input files required by this build generally must be generated from a corpus of real text using tools that are not part of Lucene.

    WARNING: This API is experimental and might change in incompatible ways in the next release.