How to use
Token
in
de.julielab.jules.types

Best Java code snippets using de.julielab.jules.types.Token (Showing top 10 results out of 315)

if (t.getPos() != null
    && POS.getPartOfSpeech(t.getPos().charAt(0)) != null) {
  POS pos = POS.getPartOfSpeech(t.getPos().charAt(0));
  List<String> stems = stemmer.findStems(t.getCoveredText(),
      pos);
  if (!stems.isEmpty()) {
    if (wnWord != null) {
      WordnetDictTerm wdt = new WordnetDictTerm(jCas,
          t.getBegin(), t.getEnd());
      wdt.setDictCanon(stems.get(0));
      wdt.setEntityId(wnWord.getID().toString());
  for (POS pos : POS.values()) {
    List<String> stems = stemmer.findStems(
        t.getCoveredText(), pos);
    if (!stems.isEmpty()) {
      IIndexWord wnWord = dict.getIndexWord(stems.get(0),
      if (wnWord != null) {
        WordnetDictTerm wdt = new WordnetDictTerm(jCas,
            t.getBegin(), t.getEnd());
        wdt.setDictCanon(stems.get(0));
        wdt.setEntityId(wnWord.getID().toString());
    + t.getCoveredText() + "< [" + t.getBegin() + ":"
    + t.getEnd() + "] from doc " + getHeaderDocId(jCas));

 public FeatureStructure createFS(int addr, CASImpl cas) {
      if (Token_Type.this.useExistingInstance) {
       // Return eq fs instance if already created
      FeatureStructure fs = Token_Type.this.jcas.getJfsFromCaddr(addr);
      if (null == fs) {
       fs = new Token(addr, Token_Type.this);
       Token_Type.this.jcas.putJfsFromCaddr(addr, fs);
       return fs;
      }
      return fs;
  } else return new Token(addr, Token_Type.this);
  }
};

/**
 * This process(JCas) method cycles through all annotations in the CAS. For
 * those that are identified as tokens by {@link AnnotationDataExtractor}
 * implementation being used, an attempt is made to extract part-of-speech
 * information. The covered text for each token is then lemmatized using the
 * {@link BioLemmatizer}, using the part-of-speech information if it was
 * available.
 */
@Override
public void process(JCas jCas) throws AnalysisEngineProcessException {
  for (Token t : JCasUtil.select(jCas, Token.class)) {
    String pos = BlueCasUtil.getSinglePosTag(t);
    String lemma = lemmatize(t.getCoveredText(), pos);
    if (lemma != null)
      t.setLemmaStr(lemma);
  }
}

if (a instanceof Token) {
  final Token token = (Token) a;
  if (prevToken != null && prevToken.getEnd() < token.getBegin()) {
    states.put(prevToken.getEnd(),
        new State(prevToken.getEnd()));
    states.put(token.getBegin(), new State(token.getBegin()));
    transitions.put(prevToken.getEnd(), new Transition(0,
        prevToken.getEnd(), token.getBegin(), null));

for (int i = 0; i < allBrs.length; i++) {
  if (allBrs[i] != null
      && token.getEnd() > allBrs[i].getBegin()) {
    coveringBr = allBrs[i];
    allBrs[i] = null;
  while (!endOfBR && tokenIt.hasNext()) {
    Token nextT = tokenIt.next();
    if (nextT.getEnd() >= coveringBr.getEnd())
      endOfBR = true;
  feats[POS] = token.getPos();
  feats[ENTITY_TYPE] = BR_LABEL;
  feats[FORM] = token.getCoveredText();
  feats[LEMMA] = token.getLemmaStr();// FIXME ensure lemma
  feats[POS] = token.getPos();
  feats[ENTITY_TYPE] = Word.OTHER_LABEL;
  feats[LABEL] = Word.OTHER_LABEL;

sb.append(token.getCoveredText() + TOKEN_SEPARATOR);
for (int i = 0; i < (token.getEnd() - token.getBegin()); i++) {
  offsets[currPos++] = token.getBegin() + i;
offsets[currPos++] = token.getEnd();
offsets[currPos++] = token.getEnd();

    t.getCoveredText());
data.add(malletToken);
malletToken.setFeatureValue(PROPERTY_POS + t.getPos(), 1.0);
if (t.getLemmaStr() != null && t.getLemmaStr().length() > 1)
  malletToken.setFeatureValue(
      PROPERTY_LEMMA + t.getLemmaStr(), 1.0);

return "Token[" + t.getCoveredText() + "]";

normalized = ((Token) a).getLemmaStr();
if (!caseSensitive){
  normalized = normalized.toLowerCase();

if (label.equals(TARGET_I) && begin == null) {
  begin = token.getBegin();
} else if (label.equals(TARGET_O) && begin != null) {
  begin = null;
end = token.getEnd();

Javadoc

Token annotation marks the span of a token and takes all additional annotations that are on the token level, including Part-of-Speech information, lemma, stemmed form, grammatical features such as gender, number and orthographical information; furthemore, Token includes the information about dependency relations to other tokens (see correspondent annotation types for further infromation). Updated by JCasGen Sat Mar 07 22:05:57 CET 2015 XML source: /Users/richarde/dev/bluebrain/git/Bluima/modules/bluima_typesystem/target/jcasgen/typesystem.xml

Most used methods

getCoveredText
getEnd
getBegin
getLemmaStr
getter for lemmaStr - gets
getPos
getter for pos - gets
<init>
readObject
Write your own initialization here
setBegin
setEnd
setLemmaStr
setter for lemmaStr - sets

Popular in Java

Running tasks concurrently on multiple threads
getResourceAsStream (ClassLoader)
compareTo (BigDecimal)
onCreateOptionsMenu (Activity)
Timestamp (java.sql)
A Java representation of the SQL TIMESTAMP type. It provides the capability of representing the SQL
Stack (java.util)
Stack is a Last-In/First-Out(LIFO) data structure which represents a stack of objects. It enables u
Timer (java.util)
Timers schedule one-shot or recurring TimerTask for execution. Prefer java.util.concurrent.Scheduled
JarFile (java.util.jar)
JarFile is used to read jar entries and their associated data from jar files.
SSLHandshakeException (javax.net.ssl)
The exception that is thrown when a handshake could not be completed successfully.
BasicDataSource (org.apache.commons.dbcp)
Basic implementation of javax.sql.DataSource that is configured via JavaBeans properties. This is no
Top 12 Jupyter Notebook extensions

How to useToken in de.julielab.jules.types

Best Java code snippets using de.julielab.jules.types.Token (Showing top 10 results out of 315)

How to use
Token
in
de.julielab.jules.types