How to use
DummyLemmaSampleStream
in
opennlp.tools.lemmatizer

Best Java code snippets using opennlp.tools.lemmatizer.DummyLemmaSampleStream (Showing top 2 results out of 315)

public String[] lemmatize(String[] toks, String[] tags) {
 try {
  LemmaSample predsSample = mSampleStream.read();
  // checks if the streams are sync
  for (int i = 0; i < toks.length; i++) {
   if (!toks[i].equals(predsSample.getTokens()[i])
     || !tags[i].equals(predsSample.getTags()[i])) {
    throw new RuntimeException("The streams are not sync!"
      + "\n expected sentence: " + Arrays.toString(toks)
      + "\n expected tags: " + Arrays.toString(tags)
      + "\n predicted sentence: "
      + Arrays.toString(predsSample.getTokens()) + "\n predicted tags: "
      + Arrays.toString(predsSample.getTags()));
   }
  }
  return predsSample.getLemmas();
 } catch (IOException e) {
  throw new RuntimeException(e);
 }
}

/**
 * Checks the evaluator results against the results got using the conlleval,
 * available at http://www.cnts.ua.ac.be/conll2000/chunking/output.html but
 * containing lemmas instead of chunks.
 *
 * @throws IOException
 */
@Test
public void testEvaluator() throws IOException {
 String inPredicted = "opennlp/tools/lemmatizer/output.txt";
 String inExpected = "opennlp/tools/lemmatizer/output.txt";
 String encoding = "UTF-8";
 DummyLemmaSampleStream predictedSample = new DummyLemmaSampleStream(
   new PlainTextByLineStream(
    new MockInputStreamFactory(new File(inPredicted)), encoding), true);
 DummyLemmaSampleStream expectedSample = new DummyLemmaSampleStream(
   new PlainTextByLineStream(
    new MockInputStreamFactory(new File(inExpected)), encoding), false);
 Lemmatizer dummyLemmatizer = new DummyLemmatizer(predictedSample);
 OutputStream stream = new ByteArrayOutputStream();
 LemmatizerEvaluationMonitor listener = new LemmaEvaluationErrorListener(stream);
 LemmatizerEvaluator evaluator = new LemmatizerEvaluator(dummyLemmatizer, listener);
 evaluator.evaluate(expectedSample);
 Assert.assertEquals(0.9877049180327869, evaluator.getWordAccuracy(), DELTA);
 Assert.assertNotSame(stream.toString().length(), 0);
}

Javadoc

This dummy lemma sample stream reads a file containing forms, postags, gold lemmas, and predicted lemmas. It can be used together with DummyLemmatizer simulate a lemmatizer.

Most used methods

Popular in Java

Making http requests using okhttp
getSystemService (Context)
onRequestPermissionsResult (Fragment)
scheduleAtFixedRate (ScheduledExecutorService)
RandomAccessFile (java.io)
Allows reading from and writing to a file in a random-access manner. This is different from the uni-
Date (java.util)
A specific moment in time, with millisecond precision. Values typically come from System#currentTime
Random (java.util)
This class provides methods that return pseudo-random values.It is dangerous to seed Random with the
JButton (javax.swing)
JPanel (javax.swing)
Project (org.apache.tools.ant)
Central representation of an Ant project. This class defines an Ant project with all of its targets,
Best IntelliJ plugins

How to useDummyLemmaSampleStream in opennlp.tools.lemmatizer

Best Java code snippets using opennlp.tools.lemmatizer.DummyLemmaSampleStream (Showing top 2 results out of 315)

How to use
DummyLemmaSampleStream
in
opennlp.tools.lemmatizer