com.ibm.icu.text.Normalizer2.quickCheck java code examples

/**
 * {@inheritDoc}
 * @stable ICU 4.4
 */
@Override
public Normalizer.QuickCheckResult quickCheck(CharSequence s) {
  Normalizer.QuickCheckResult result=Normalizer.YES;
  UnicodeSet.SpanCondition spanCondition=UnicodeSet.SpanCondition.SIMPLE;
  for(int prevSpanLimit=0; prevSpanLimit<s.length();) {
    int spanLimit=set.span(s, prevSpanLimit, spanCondition);
    if(spanCondition==UnicodeSet.SpanCondition.NOT_CONTAINED) {
      spanCondition=UnicodeSet.SpanCondition.SIMPLE;
    } else {
      Normalizer.QuickCheckResult qcResult=
        norm2.quickCheck(s.subSequence(prevSpanLimit, spanLimit));
      if(qcResult==Normalizer.NO) {
        return qcResult;
      } else if(qcResult==Normalizer.MAYBE) {
        result=qcResult;
      }
      spanCondition=UnicodeSet.SpanCondition.NOT_CONTAINED;
    }
    prevSpanLimit=spanLimit;
  }
  return result;
}
/**

/**
 * Performing quick check on a string, to quickly determine if the string is
 * in a particular normalization format.
 * Three types of result can be returned Normalizer.YES, Normalizer.NO or
 * Normalizer.MAYBE. Result Normalizer.YES indicates that the argument
 * string is in the desired normalized format, Normalizer.NO determines that
 * argument string is not in the desired normalized format. A
 * Normalizer.MAYBE result indicates that a more thorough check is required,
 * the user may have to put the string in its normalized form and compare
 * the results.
 *
 * @param source   string for determining if it is in a normalized format
 * @param mode     normalization format (Normalizer.NFC,Normalizer.NFD,
 *                  Normalizer.NFKC,Normalizer.NFKD)
 * @param options   Options for use with exclusion set and tailored Normalization
 *                                   The only option that is currently recognized is UNICODE_3_2
 * @return         Return code to specify if the text is normalized or not
 *                     (Normalizer.YES, Normalizer.NO or Normalizer.MAYBE)
 * @deprecated ICU 56 Use {@link Normalizer2} instead.
 */
@Deprecated
public static QuickCheckResult quickCheck(String source, Mode mode, int options) {
  return mode.getNormalizer2(options).quickCheck(source);
}

/**
 * Performing quick check on a string, to quickly determine if the string is
 * in a particular normalization format.
 * Three types of result can be returned Normalizer.YES, Normalizer.NO or
 * Normalizer.MAYBE. Result Normalizer.YES indicates that the argument
 * string is in the desired normalized format, Normalizer.NO determines that
 * argument string is not in the desired normalized format. A
 * Normalizer.MAYBE result indicates that a more thorough check is required,
 * the user may have to put the string in its normalized form and compare
 * the results.
 *
 * @param source    string for determining if it is in a normalized format
 * @param start     the start index of the source
 * @param limit     the limit index of the source it is equal to the length
 * @param mode      normalization format (Normalizer.NFC,Normalizer.NFD,
 *                   Normalizer.NFKC,Normalizer.NFKD)
 * @param options   Options for use with exclusion set and tailored Normalization
 *                                   The only option that is currently recognized is UNICODE_3_2
 * @return          Return code to specify if the text is normalized or not
 *                   (Normalizer.YES, Normalizer.NO or
 *                   Normalizer.MAYBE)
 * @deprecated ICU 56 Use {@link Normalizer2} instead.
 */
@Deprecated
public static QuickCheckResult quickCheck(char[] source,int start,
                     int limit, Mode mode,int options) {
  CharBuffer srcBuffer = CharBuffer.wrap(source, start, limit - start);
  return mode.getNormalizer2(options).quickCheck(srcBuffer);
}

 @Override
 public final boolean incrementToken() throws IOException {
  if (input.incrementToken()) {
   if (normalizer.quickCheck(termAtt) != Normalizer.YES) {
    buffer.setLength(0);
    normalizer.normalize(termAtt, buffer);
    termAtt.setEmpty().append(buffer);
   }
   return true;
  } else {
   return false;
  }
 }
}

 @Override
 public final boolean incrementToken() throws IOException {
  if (input.incrementToken()) {
   if (normalizer.quickCheck(termAtt) != Normalizer.YES) {
    buffer.setLength(0);
    normalizer.normalize(termAtt, buffer);
    termAtt.setEmpty().append(buffer);
   }
   return true;
  } else {
   return false;
  }
 }
}

Javadoc

Tests if the string is normalized. For the two COMPOSE modes, the result could be "maybe" in cases that would take a little more work to resolve definitively. Use spanQuickCheckYes() and normalizeSecondAndAppend() for a faster combination of quick check + normalization, to avoid re-checking the "yes" prefix.

Popular methods of Normalizer2

normalize
getInstance
Returns a Normalizer2 instance which uses the specified data file (an ICU data file if data=null, or
getNFCInstance
Returns a Normalizer2 instance for Unicode NFC normalization. Same as getInstance(null, "nfc", Mode.
getNFDInstance
Returns a Normalizer2 instance for Unicode NFD normalization. Same as getInstance(null, "nfc", Mode.
getNFKCInstance
Returns a Normalizer2 instance for Unicode NFKC normalization. Same as getInstance(null, "nfkc", Mod
getNFKDInstance
Returns a Normalizer2 instance for Unicode NFKD normalization. Same as getInstance(null, "nfkc", Mod
hasBoundaryBefore
Tests if the character always has a normalization boundary before it, regardless of context. If true
isInert
Tests if the character is normalization-inert. If true, then the character does not change, nor norm
normalizeSecondAndAppend
Appends the normalized form of the second string to the first string (merging them at the boundary)
spanQuickCheckYes
Returns the end of the normalized substring of the input string. In other words, with end=spanQuickC
append
Appends the second string to the first string (merging them at the boundary) and returns the first s
composePair
Performs pairwise composition of a & b and returns the composite if there is one.Returns a composite

Popular in Java

Creating JSON documents from java classes using gson
onRequestPermissionsResult (Fragment)
findViewById (Activity)
setContentView (Activity)
BufferedReader (java.io)
Wraps an existing Reader and buffers the input. Expensive interaction with the underlying reader is
FileReader (java.io)
A specialized Reader that reads from a file in the file system. All read requests made by calling me
SimpleDateFormat (java.text)
Formats and parses dates in a locale-sensitive manner. Formatting turns a Date into a String, and pa
Random (java.util)
This class provides methods that return pseudo-random values.It is dangerous to seed Random with the
Pattern (java.util.regex)
Patterns are compiled regular expressions. In many cases, convenience methods such as String#matches
BoxLayout (javax.swing)
Best IntelliJ plugins

How to use quickCheckmethodin com.ibm.icu.text.Normalizer2

Best Java code snippets using com.ibm.icu.text.Normalizer2.quickCheck (Showing top 5 results out of 315)

How to use
quickCheck
method
in
com.ibm.icu.text.Normalizer2