How to use
usesSpaceDelimiters
method
in
org.carrot2.core.LanguageCode

Best Java code snippets using org.carrot2.core.LanguageCode.usesSpaceDelimiters (Showing top 2 results out of 315)

/**
 * Formats a cluster label for final rendering.
 */
public String format(PreprocessingContext context, int featureIndex)
{
  final char [][] wordsImage = context.allWords.image;
  final int [][] phrasesWordIndices = context.allPhrases.wordIndices;
  final int wordCount = wordsImage.length;
  final StringBuilder label = new StringBuilder();
  if (featureIndex < wordCount)
  {
    final char [] image = wordsImage[featureIndex];
    appendFormatted(label, image, true, false);
  }
  else
  {
    final boolean insertSpace = context.language.getLanguageCode().usesSpaceDelimiters();
    final int [] wordIndices = phrasesWordIndices[featureIndex - wordCount];
    final short [] termTypes = context.allWords.type;
    for (int i = 0; i < wordIndices.length; i++)
    {
      if (insertSpace && i > 0) label.append(' ');
      final int wordIndex = wordIndices[i];
      appendFormatted(label, wordsImage[wordIndex], i == 0,
        TokenTypeUtils.isCommon(termTypes[wordIndex]));
    }
  }
  return label.toString();
}

/**
 * Build the cluster's label from suffix tree edge indices. 
 */
private String buildLabel(int [] phraseIndices)
{
  // Count the number of terms first.
  int termsCount = 0;
  for (int j = 0; j < phraseIndices.length; j += 2)
  {
    termsCount += phraseIndices[j + 1] - phraseIndices[j] + 1;
  }

  // Extract terms info for the phrase and construct the label.
  final boolean [] stopwords = new boolean[termsCount];
  final char [][] images = new char [termsCount][];
  final short [] tokenTypes = context.allWords.type;
  int k = 0;
  for (int i = 0; i < phraseIndices.length; i += 2)
  {
    for (int j = phraseIndices[i]; j <= phraseIndices[i + 1]; j++, k++)
    {
      final int termIndex = sb.input.get(j);
      images[k] = context.allWords.image[termIndex];
      stopwords[k] = TokenTypeUtils.isCommon(tokenTypes[termIndex]);
    }
  }
  
  return LabelFormatter.format(images, stopwords, 
    context.language.getLanguageCode().usesSpaceDelimiters());
}

Javadoc

Returns true if this language uses space delimiters between words. This is a hint for formatting cluster labels.

Popular methods of LanguageCode

forISOCode
Return a LanguageCode constant for a given ISO code (or null) if not available.
name
valueOf
getIsoCode
toString
values

Popular in Java

Creating JSON documents from java classes using gson
compareTo (BigDecimal)
getSystemService (Context)
onRequestPermissionsResult (Fragment)
BufferedReader (java.io)
Wraps an existing Reader and buffers the input. Expensive interaction with the underlying reader is
PrintStream (java.io)
Fake signature of an existing Java class.
BigDecimal (java.math)
An immutable arbitrary-precision signed decimal.A value is represented by an arbitrary-precision "un
JarFile (java.util.jar)
JarFile is used to read jar entries and their associated data from jar files.
SAXParseException (org.xml.sax)
Encapsulate an XML parse error or warning.> This module, both source code and documentation, is in t
JTable (javax.swing)
Github Copilot alternatives

How to use usesSpaceDelimitersmethodin org.carrot2.core.LanguageCode

Best Java code snippets using org.carrot2.core.LanguageCode.usesSpaceDelimiters (Showing top 2 results out of 315)

How to use
usesSpaceDelimiters
method
in
org.carrot2.core.LanguageCode