How to use
AbstractCSVAnnotationsExtension
in
org.phenotips.vocabulary

Best Java code snippets using org.phenotips.vocabulary.AbstractCSVAnnotationsExtension (Showing top 2 results out of 315)

@Override
public boolean isVocabularySupported(@Nonnull final Vocabulary vocabulary)
{
  return getTargetVocabularyIds().contains(vocabulary.getIdentifier());
}

@Override
public void indexingStarted(@Nonnull final Vocabulary vocabulary)
{
  if (this.operationsInProgress.incrementAndGet() == 1) {
    this.data = new HashMap<>();
    try (BufferedReader in = new BufferedReader(
      new InputStreamReader(
        new URL(getAnnotationSource()).openConnection().getInputStream(), StandardCharsets.UTF_8))) {
      CSVFormat parser = setupCSVParser(vocabulary);
      for (final CSVRecord row : parser.parse(in)) {
        processCSVRecordRow(row, vocabulary);
      }
    } catch (final IOException ex) {
      this.logger.error("Failed to load annotation source: {}", ex.getMessage());
    }
  }
}

Javadoc

Implements VocabularyExtension to annotate VocabularyInputTerm from #getTargetVocabularyIds with data from #getAnnotationSource. The default behavior implemented in this base class is to gather data from the named columns in the file, and add this data to the respective terms when reindexing a supported vocabulary. Setting up the names of the columns is done by the concrete class, either by #setupCSVParser the CSV parser to treat the first row as the header definition, or by explicitly assigning names to columns.

To let the first row be parsed as the column names:

 
protected CSVFormat setupCSVParser(Vocabulary vocabulary)}

To explicitly name columns:

 
protected CSVFormat setupCSVParser(Vocabulary vocabulary)}

With the default implementation of #processCSVRecordRow, having a column named id is mandatory.

Columns that are not named are ignored.

Missing, empty, or whitespace-only cells will be ignored.

If multiple rows for the same term identifier exists, then the values are accumulated in lists of values.

If one or more of the fields parsed happen to already have values already in the term being extended, then the existing values will be discarded and replaced with the data read from the input file.

If multiple rows for the same term identifier exists, then the values are accumulated in lists of values. If in the schema definition a field is set as non-multi-valued, then it's the responsibility of the user to make sure that only one value will be specified for such fields. If a value is specified multiple times in the input file, then it will be added multiple times in the field.

Example: for the following parser set-up:

 
CSVFormat.CSV.withHeader("id", null, "symptom", null, "frequency")

and the following input file:

 
MIM:162200,"NEUROFIBROMATOSIS, TYPE I",HP:0009737,"Lisch nodules",HP:0040284,HPO:curators

the following fields will be added: "symptom" "HP:0009737", HP:0001256 "frequency" "HP:0040284", HP:0040283, "HP:0040284"

Most used methods

getAnnotationSource
getTargetVocabularyIds
Specifies the vocabularies targeted by this extension.
processCSVRecordRow
Processes and caches the row data. By default, it simply copies every mapped value from the row. Ove
setupCSVParser
Sets up a CSV parser so that it accepts the format of the input file, and has names for each column

Popular in Java

Parsing JSON documents to java classes using gson
getSupportFragmentManager (FragmentActivity)
startActivity (Activity)
setRequestProperty (URLConnection)
URL (java.net)
A Uniform Resource Locator that identifies the location of an Internet resource as specified by RFC
UnknownHostException (java.net)
Thrown when a hostname can not be resolved.
SecureRandom (java.security)
This class generates cryptographically secure pseudo-random numbers. It is best to invoke SecureRand
SortedSet (java.util)
SortedSet is a Set which iterates over its elements in a sorted order. The order is determined eithe
LoggerFactory (org.slf4j)
The LoggerFactory is a utility class producing Loggers for various logging APIs, most notably for lo
Container (java.awt)
A generic Abstract Window Toolkit(AWT) container object is a component that can contain other AWT co
Top plugins for WebStorm

How to useAbstractCSVAnnotationsExtension in org.phenotips.vocabulary

Best Java code snippets using org.phenotips.vocabulary.AbstractCSVAnnotationsExtension (Showing top 2 results out of 315)

How to use
AbstractCSVAnnotationsExtension
in
org.phenotips.vocabulary