Learn how Tabnine’s Al coding assistant generates code and provides accurate, personalized code completions.

How to use
spanList
method
in
com.yahoo.document.annotation.SpanTree

Best Java code snippets using com.yahoo.document.annotation.SpanTree.spanList (Showing top 2 results out of 315)

@Override
protected void doExecute(ExecutionContext ctx) {
  StringFieldValue input = (StringFieldValue)ctx.getValue();
  SpanList spanList = input.setSpanTree(new SpanTree(SpanTrees.LINGUISTICS)).spanList();
  int lastPosition = 0;
  for (Iterator<GramSplitter.Gram> it = linguistics.getGramSplitter().split(input.getString(), gramSize); it.hasNext();) {
    GramSplitter.Gram gram = it.next();
    // if there is a gap before this gram, then annotate the gram as punctuation
    // (technically it may be of various types, but it does not matter - we just
    // need to annotate it somehow (as a non-term) to make sure it is added to the summary)
    if (lastPosition < gram.getStart()) {
      typedSpan(lastPosition, gram.getStart() - lastPosition, TokenType.PUNCTUATION, spanList);
    }
    // annotate gram as a word term
    String gramString = gram.extractFrom(input.getString());
    typedSpan(gram.getStart(), gram.getLength(), TokenType.ALPHABETIC, spanList).
        annotate(LinguisticsAnnotator.lowerCaseTermAnnotation(gramString, gramString));
    lastPosition = gram.getStart() + gram.getLength();
  }
  // handle punctuation at the end
  if (lastPosition < input.toString().length()) {
    typedSpan(lastPosition, input.toString().length() - lastPosition, TokenType.PUNCTUATION, spanList);
  }
}

/**
 * Annotates the given string with the appropriate linguistics annotations.
 *
 * @param text the text to annotate
 * @return whether or not anything was annotated
 */
public boolean annotate(StringFieldValue text) {
  if (text.getSpanTree(SpanTrees.LINGUISTICS) != null) return true;  // Already annotated with LINGUISTICS.
  Tokenizer tokenizer = factory.getTokenizer();
  String input = (text.getString().length() <=  config.getMaxTokenizeLength())
      ? text.getString()
      : text.getString().substring(0, config.getMaxTokenizeLength());
  Iterable<Token> tokens = tokenizer.tokenize(input, config.getLanguage(), config.getStemMode(),
                        config.getRemoveAccents());
  TermOccurrences termOccurrences = new TermOccurrences(config.getMaxTermOccurrences());
  SpanTree tree = new SpanTree(SpanTrees.LINGUISTICS);
  for (Token token : tokens) {
    addAnnotationSpan(text.getString(), tree.spanList(), tokenizer, token, config.getStemMode(), termOccurrences);
  }
  if (tree.numAnnotations() == 0) return false;
  text.setSpanTree(tree);
  return true;
}

Javadoc

Convenience shorthand for (SpanList)getRoot(). This must of course only be used when it is known that the root in this tree actually is a SpanList.

Popular methods of SpanTree

<init>
Creates a new SpanTree with a given root node.
annotate
Adds an Annotation. Convenience shorthand for annotate(node,new Annotation(type,value)
numAnnotations
Returns the total number of annotations in the tree.
annotateInternal
annotationsEquals
cleanup
Ensures consistency of the tree in case SpanNodes have been removed, and there are still Annotations
clearIndex
copySpan
createIndex
getAnnotations
getCurrentIndexes
getName
Returns the name of this span tree.

Popular in Java

Running tasks concurrently on multiple threads
getSupportFragmentManager (FragmentActivity)
setContentView (Activity)
getApplicationContext (Context)
ObjectMapper (com.fasterxml.jackson.databind)
ObjectMapper provides functionality for reading and writing JSON, either to and from basic POJOs (Pl
PrintWriter (java.io)
Wraps either an existing OutputStream or an existing Writerand provides convenience methods for prin
Hashtable (java.util)
A plug-in replacement for JDK1.5 java.util.Hashtable. This version is based on org.cliffc.high_scale
TreeMap (java.util)
Walk the nodes of the tree left-to-right or right-to-left. Note that in descending iterations, next
BlockingQueue (java.util.concurrent)
A java.util.Queue that additionally supports operations that wait for the queue to become non-empty
Filter (javax.servlet)
A filter is an object that performs filtering tasks on either the request to a resource (a servlet o
Top plugins for Android Studio

How to use spanListmethodin com.yahoo.document.annotation.SpanTree

Best Java code snippets using com.yahoo.document.annotation.SpanTree.spanList (Showing top 2 results out of 315)

How to use
spanList
method
in
com.yahoo.document.annotation.SpanTree