How to use
setExploreInputFormat
method
in
co.cask.cdap.api.dataset.lib.FileSetProperties$Builder

Best Java code snippets using co.cask.cdap.api.dataset.lib.FileSetProperties$Builder.setExploreInputFormat (Showing top 4 results out of 315)

/**
 * Set the input format used to create the Hive table.
 * Note that this can be different than the input format used
 * for the file set itself.
 */
public Builder setExploreInputFormat(Class<?> inputFormat) {
 return setExploreInputFormat(inputFormat.getName());
}

/**
 * Configure a file set to use ORC file format with a given schema. The schema is parsed
 * validated and converted into a Hive schema which is compatible with ORC format. The file set is configured to use
 * ORC input and output format, and also configured for Explore to use Hive. The schema is added
 * to the file set properties in all the different required ways:
 * <ul>
 *   <li>As a top-level dataset property;</li>
 *   <li>As the schema for the input and output format;</li>
 *   <li>As the schema to be used by the ORC serde (which is used by Hive).</li>
 * </ul>
 *
 * @param configuredSchema the original schema configured for the table
 * @param properties a builder for the file set properties
 */
public static void configureORCFileSet(String configuredSchema, FileSetProperties.Builder properties)  {
 //TODO test if complex cases run with lowercase schema only
 String lowerCaseSchema = configuredSchema.toLowerCase();
 String hiveSchema = parseHiveSchema(lowerCaseSchema, configuredSchema);
 hiveSchema = hiveSchema.substring(1, hiveSchema.length() - 1);
 properties.setExploreInputFormat("org.apache.hadoop.hive.ql.io.orc.OrcInputFormat")
  .setExploreOutputFormat("org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat")
  .setSerDe("org.apache.hadoop.hive.ql.io.orc.OrcSerde")
  .setExploreSchema(hiveSchema)
  .setEnableExploreOnCreate(true)
  .add(DatasetProperties.SCHEMA, configuredSchema)
  .build();
}

.setExploreInputFormat("org.apache.hadoop.hive.ql.io.orc.OrcInputFormat")
.setExploreOutputFormat("org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat")
.setExploreSchema("record STRING")

/**
 * Configure a file set to use Avro file format with a given schema. The schema is parsed
 * as an Avro schema, validated and converted into a Hive schema. The file set is configured to use
 * Avro key input and output format, and also configured for Explore to use Avro. The schema is added
 * to the file set properties in all the different required ways:
 * <ul>
 *   <li>As a top-level dataset property;</li>
 *   <li>As the schema for the input and output format;</li>
 *   <li>As the schema of the Hive table;</li>
 *   <li>As the schema to be used by the Avro serde (which is used by Hive).</li>
 * </ul>
 * @param configuredSchema the original schema configured for the table
 * @param properties a builder for the file set properties
 */
public static void configureAvroFileSet(String configuredSchema, FileSetProperties.Builder properties) {
 properties
  .setEnableExploreOnCreate(true)
  .setSerDe("org.apache.hadoop.hive.serde2.avro.AvroSerDe")
  .setExploreInputFormat("org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat")
  .setExploreOutputFormat("org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat")
  .setTableProperty("avro.schema.literal", configuredSchema)
  .add(DatasetProperties.SCHEMA, configuredSchema);
}

Javadoc

Set the input format used to create the Hive table. Note that this can be different than the input format used for the file set itself.

Popular methods of FileSetProperties$Builder

setInputFormat
build
setOutputFormat
setBasePath
setOutputProperty
add
setEnableExploreOnCreate
setExploreFormat
setExploreOutputFormat
setSerDe
addAll
setDataExternal
Configures whether the files (the data) in this fileset are managed externally.

Popular in Java

Creating JSON documents from java classes using gson
getContentResolver (Context)
orElseThrow (Optional)
Return the contained value, if present, otherwise throw an exception to be created by the provided s
findViewById (Activity)
BigInteger (java.math)
An immutable arbitrary-precision signed integer.FAST CRYPTOGRAPHY This implementation is efficient f
ConnectException (java.net)
A ConnectException is thrown if a connection cannot be established to a remote host on a specific po
Hashtable (java.util)
A plug-in replacement for JDK1.5 java.util.Hashtable. This version is based on org.cliffc.high_scale
SortedSet (java.util)
SortedSet is a Set which iterates over its elements in a sorted order. The order is determined eithe
Semaphore (java.util.concurrent)
A counting semaphore. Conceptually, a semaphore maintains a set of permits. Each #acquire blocks if
Manifest (java.util.jar)
The Manifest class is used to obtain attribute information for a JarFile and its entries.
Top plugins for WebStorm

How to use setExploreInputFormatmethodin co.cask.cdap.api.dataset.lib.FileSetProperties$Builder

Best Java code snippets using co.cask.cdap.api.dataset.lib.FileSetProperties$Builder.setExploreInputFormat (Showing top 4 results out of 315)

How to use
setExploreInputFormat
method
in
co.cask.cdap.api.dataset.lib.FileSetProperties$Builder