How to use
JapaneseBasicFormFilter
in
org.apache.lucene.analysis.ja

Best Java code snippets using org.apache.lucene.analysis.ja.JapaneseBasicFormFilter (Showing top 2 results out of 315)

 public TokenStream create(TokenStream stream) {
  return new JapaneseBasicFormFilter(stream);
 }
}

 /**
  * Creates
  * {@link org.apache.lucene.analysis.util.ReusableAnalyzerBase.TokenStreamComponents}
  * used to tokenize all the text in the provided {@link Reader}.
  * 
  * @return {@link org.apache.lucene.analysis.util.ReusableAnalyzerBase.TokenStreamComponents}
  *         built from a {@link JapaneseTokenizer} filtered with
  *         {@link JapaneseWidthFilter}, {@link JapanesePunctuationFilter},
  *         {@link JapanesePartOfSpeechStopFilter}, {@link JapaneseStopFilter},
  *         {@link KeywordMarkerFilter} if a stem exclusion set is provided, 
  *         {@link JapaneseBasicFormFilter}, {@link JapaneseKatakanaStemFilter},
  *         and  {@link LowerCaseFilter}
  */
 @Override
 protected TokenStreamComponents createComponents(String field, Reader reader) {
  Tokenizer tokenizer = new JapaneseTokenizer(reader, null, dictionaryDir);
  TokenStream stream = new JapaneseWidthFilter(tokenizer);
  stream = new JapanesePunctuationFilter(true, stream);
  stream = new JapanesePartOfSpeechStopFilter(true, stream, stoptags);
  stream = new StopFilter(matchVersion, stream, stopwords);
  if (!stemExclusionSet.isEmpty())
   stream = new KeywordMarkerFilter(stream, stemExclusionSet);
  stream = new JapaneseBasicFormFilter(stream);
  stream = new JapaneseKatakanaStemFilter(stream);
  stream = new LowerCaseFilter(matchVersion, stream);
  return new TokenStreamComponents(tokenizer, stream);
 }
}

Javadoc

Replaces term text with the BasicFormAttribute.

This acts as a lemmatizer for verbs and adjectives.

To prevent terms from being stemmed use an instance of KeywordMarkerFilter or a custom TokenFilter that sets the KeywordAttribute before this TokenStream.

Most used methods

<init>

Popular in Java

Parsing JSON documents to java classes using gson
getSystemService (Context)
scheduleAtFixedRate (Timer)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
InputStreamReader (java.io)
A class for turning a byte stream into a character stream. Data read from the source input stream is
String (java.lang)
InetAddress (java.net)
An Internet Protocol (IP) address. This can be either an IPv4 address or an IPv6 address, and in pra
LoggerFactory (org.slf4j)
The LoggerFactory is a utility class producing Loggers for various logging APIs, most notably for lo
Menu (java.awt)
BasicDataSource (org.apache.commons.dbcp)
Basic implementation of javax.sql.DataSource that is configured via JavaBeans properties. This is no
Top Vim plugins

How to useJapaneseBasicFormFilter in org.apache.lucene.analysis.ja

Best Java code snippets using org.apache.lucene.analysis.ja.JapaneseBasicFormFilter (Showing top 2 results out of 315)

How to use
JapaneseBasicFormFilter
in
org.apache.lucene.analysis.ja