How to use
NaiveBayes
in
smile.classification

Best Java code snippets using smile.classification.NaiveBayes (Showing top 6 results out of 315)

  @Override
  public NaiveBayes train(double[][] x, int[] y) {
    NaiveBayes bayes = priori == null ? new NaiveBayes(model, k, p, sigma) : new NaiveBayes(model, priori, p, sigma);
    bayes.learn(x, y);
    return bayes;
  }
}

/**
 * Predict the class of an instance.
 *
 * @param x the instance to be classified.
 * @return the predicted class label. For MULTINOMIAL and BERNOULLI models,
 * returns -1 if the instance does not contain any feature words.
 */
public int predict(SparseArray x) {
  return predict(x, null);
}

update();

update();

update();

/**
 * Predict the class of an instance.
 * 
 * @param x the instance to be classified.
 * @return the predicted class label. For MULTINOMIAL and BERNOULLI models,
 * returns -1 if the instance does not contain any feature words.
 */
@Override
public int predict(double[] x) {
  return predict(x, null);
}

Javadoc

Naive Bayes classifier. A naive Bayes classifier is a simple probabilistic classifier based on applying Bayes' theorem with strong (naive) independence assumptions. Depending on the precise nature of the probability model, naive Bayes classifiers can be trained very efficiently in a supervised learning setting.

In spite of their naive design and apparently over-simplified assumptions, naive Bayes classifiers have worked quite well in many complex real-world situations and are very popular in Natural Language Processing (NLP).

For a general purpose naive Bayes classifier without any assumptions about the underlying distribution of each variable, we don't provide a learning method to infer the variable distributions from the training data. Instead, the users can fit any appropriate distributions on the data by themselves with various Distribution classes. Although the #predictmethod takes an array of double values as a general form of independent variables, the users are free to use any discrete distributions to model categorical or ordinal random variables.

For document classification in NLP, there are two major different ways we can set up an naive Bayes classifier: multinomial model and Bernoulli model. The multinomial model generates one term from the vocabulary in each position of the document. The multivariate Bernoulli model or Bernoulli model generates an indicator for each term of the vocabulary, either indicating presence of the term in the document or indicating absence. Of the two models, the Bernoulli model is particularly sensitive to noise features. A Bernoulli naive Bayes classifier requires some form of feature selection or else its accuracy will be low.

The different generation models imply different estimation strategies and different classification rules. The Bernoulli model estimates as the fraction of documents of class that contain term. In contrast, the multinomial model estimates as the fraction of tokens or fraction of positions in documents of class that contain term. When classifying a test document, the Bernoulli model uses binary occurrence information, ignoring the number of occurrences, whereas the multinomial model keeps track of multiple occurrences. As a result, the Bernoulli model typically makes many mistakes when classifying long documents. However, it was reported that the Bernoulli model works better in sentiment analysis.

The models also differ in how non-occurring terms are used in classification. They do not affect the classification decision in the multinomial model; but in the Bernoulli model the probability of nonoccurrence is factored in when computing. This is because only the Bernoulli model models absence of terms explicitly.

A third setting is Polya Urn model which simply add twice for what is seen in training data instead of one time. See reference for more detail.

Most used methods

<init>
Constructor of general naive Bayes classifier.
learn
Online learning of naive Bayes classifier on sequences, which are modeled as a bag of words. Note th
predict
Predict the class of an instance.
update
Update conditional probabilities.

Popular in Java

Reactive rest calls using spring rest template
getResourceAsStream (ClassLoader)
getApplicationContext (Context)
setScale (BigDecimal)
URI (java.net)
A Uniform Resource Identifier that identifies an abstract or physical resource, as specified by RFC
ResultSet (java.sql)
An interface for an object which represents a database table entry, returned as the result of the qu
Locale (java.util)
Locale represents a language/country/variant combination. Locales are used to alter the presentatio
ThreadPoolExecutor (java.util.concurrent)
An ExecutorService that executes each submitted task using one of possibly several pooled threads, n
IOUtils (org.apache.commons.io)
General IO stream manipulation utilities. This class provides static utility methods for input/outpu
Response (javax.ws.rs.core)
Defines the contract between a returned instance and the runtime when an application needs to provid
CodeWhisperer alternatives

How to useNaiveBayes in smile.classification

Best Java code snippets using smile.classification.NaiveBayes (Showing top 6 results out of 315)

How to use
NaiveBayes
in
smile.classification