How to use
NaiveBayesMultinomialText
in
weka.classifiers.bayes

Best Java code snippets using weka.classifiers.bayes.NaiveBayesMultinomialText (Showing top 8 results out of 315)

 /**
  * Main method for testing this class.
  *
  * @param args the options
  */
 public static void main(String[] args) {
  runClassifier(new NaiveBayesMultinomialText(), args);
 }
}

if (getUseWordFrequencies()) {
 options.add("-W");
options.add("" + getPeriodicPruning());
options.add("-M");
options.add("" + getMinWordFrequency());
if (getNormalizeDocLength()) {
 options.add("-normalize");
options.add("" + getNorm());
options.add("-lnorm");
options.add("" + getLNorm());
if (getLowercaseTokens()) {
 options.add("-lowercase");
if (getStopwordsHandler() != null) {
 options.add("-stopwords-handler");
 String spec = getStopwordsHandler().getClass().getName();
 if (getStopwordsHandler() instanceof OptionHandler) {
  spec +=
   " "
    + Utils.joinOptions(((OptionHandler) getStopwordsHandler())
     .getOptions());
String spec = getTokenizer().getClass().getName();
if (getTokenizer() instanceof OptionHandler) {
 spec +=
  " " + Utils.joinOptions(((OptionHandler) getTokenizer()).getOptions());

reset();
getCapabilities().testWithFail(data);
 updateClassifier(data.instance(i));
 pruneDictionary(true);

/** Creates a default NaiveBayesMultinomialText */
public Classifier getClassifier() {
 return new NaiveBayesMultinomialText();
}

if (getUseWordFrequencies()) {
 options.add("-W");
options.add("" + getPeriodicPruning());
options.add("-M");
options.add("" + getMinWordFrequency());
if (getNormalizeDocLength()) {
 options.add("-normalize");
options.add("" + getNorm());
options.add("-lnorm");
options.add("" + getLNorm());
if (getLowercaseTokens()) {
 options.add("-lowercase");
if (getStopwordsHandler() != null) {
 options.add("-stopwords-handler");
 String spec = getStopwordsHandler().getClass().getName();
 if (getStopwordsHandler() instanceof OptionHandler) {
  spec +=
   " "
    + Utils.joinOptions(((OptionHandler) getStopwordsHandler())
     .getOptions());
String spec = getTokenizer().getClass().getName();
if (getTokenizer() instanceof OptionHandler) {
 spec +=
  " " + Utils.joinOptions(((OptionHandler) getTokenizer()).getOptions());

reset();
getCapabilities().testWithFail(data);
 updateClassifier(data.instance(i));
 pruneDictionary(true);

/** Creates a default NaiveBayesMultinomialText */
public Classifier getClassifier() {
 return new NaiveBayesMultinomialText();
}

 /**
  * Main method for testing this class.
  *
  * @param args the options
  */
 public static void main(String[] args) {
  runClassifier(new NaiveBayesMultinomialText(), args);
 }
}

Javadoc

Multinomial naive bayes for text data. Operates directly (and only) on String attributes. Other types of input attributes are accepted but ignored during training and classification

Valid options are:

 -W 
Use word frequencies instead of binary bag of words.

 -P <# instances> 
How often to prune the dictionary of low frequency words (default = 0, i.e. don't prune)

 -M <double> 
Minimum word frequency. Words with less than this frequence are ignored. 
If periodic pruning is turned on then this is also used to determine which 
words to remove from the dictionary (default = 3).

 -normalize 
Normalize document length (use in conjunction with -norm and -lnorm)

 -norm <num> 
Specify the norm that each instance must have (default 1.0)

 -lnorm <num> 
Specify L-norm to use (default 2.0)

 -lowercase 
Convert all tokens to lowercase before adding to the dictionary.

 -stopwords-handler 
The stopwords handler to use (default Null).

 -tokenizer <spec> 
The tokenizing algorihtm (classname plus parameters) to use. 
(default: weka.core.tokenizers.WordTokenizer)

 -stemmer <spec> 
The stemmering algorihtm (classname plus parameters) to use.

 -output-debug-info 
If set, classifier is run in debug mode and 
may output additional info to the console

 -do-not-check-capabilities 
If set, classifier capabilities are not checked before classifier is built 
(use with caution).

Most used methods

<init>
getCapabilities
Returns default capabilities of the classifier.
getLNorm
Get the L Norm used.
getLowercaseTokens
Get whether to convert all tokens to lowercase
getMinWordFrequency
Get the minimum word frequency. Words that don't occur at least min freq times are ignored when upda
getNorm
Get the instance's Norm.
getNormalizeDocLength
Get whether to normalize the length of each document
getPeriodicPruning
Get how often to prune the dictionary
getStemmer
Returns the current stemming algorithm, null if none is used.
getStopwordsHandler
Gets the stopwords handler.
getTokenizer
Returns the current tokenizer algorithm.
getUseWordFrequencies
Get whether to use word frequencies rather than binary bag of words representation.

Popular in Java

Reading from database using SQL prepared statement
setRequestProperty (URLConnection)
findViewById (Activity)
runOnUiThread (Activity)
String (java.lang)
URLEncoder (java.net)
This class is used to encode a string using the format required by application/x-www-form-urlencoded
ResultSet (java.sql)
An interface for an object which represents a database table entry, returned as the result of the qu
LinkedHashMap (java.util)
LinkedHashMap is an implementation of Map that guarantees iteration order. All optional operations a
JTable (javax.swing)
Location (org.springframework.beans.factory.parsing)
Class that models an arbitrary location in a Resource.Typically used to track the location of proble
Top plugins for Android Studio

How to useNaiveBayesMultinomialText in weka.classifiers.bayes

Best Java code snippets using weka.classifiers.bayes.NaiveBayesMultinomialText (Showing top 8 results out of 315)

How to use
NaiveBayesMultinomialText
in
weka.classifiers.bayes