How to use
getConfiguration
method
in
org.apache.hadoop.mrunit.mapreduce.MapReduceDriver

Best Java code snippets using org.apache.hadoop.mrunit.mapreduce.MapReduceDriver.getConfiguration (Showing top 5 results out of 315)

mapReduceDriver.getConfiguration().set(BatchConstants.CFG_CUBE_NAME, cubeName);
mapReduceDriver.getConfiguration().set(BatchConstants.CFG_CUBE_SEGMENT_ID, segmentID);

mapReduceDriver.getConfiguration().set(BatchConstants.CFG_CUBE_NAME, cubeName);
mapReduceDriver.getConfiguration().set(BatchConstants.CFG_CUBE_SEGMENT_NAME, segmentName);

protected List<KeyValueReuseList<K2, V2>> sortAndGroup(
  final List<Pair<K2, V2>> mapOutputs) {
 if (mapOutputs.isEmpty()) {
  return Collections.emptyList();
 }
 if (keyValueOrderComparator == null || keyGroupComparator == null) {
  JobConf conf = new JobConf(getConfiguration());
  conf.setMapOutputKeyClass(mapOutputs.get(0).getFirst().getClass());
  if (keyGroupComparator == null) {
   keyGroupComparator = conf.getOutputValueGroupingComparator();
  }
  if (keyValueOrderComparator == null) {
   keyValueOrderComparator = conf.getOutputKeyComparator();
  }
 }
 ReduceFeeder<K2, V2> reduceFeeder = new ReduceFeeder<K2, V2>(
   getConfiguration());
 return reduceFeeder.sortAndGroup(mapOutputs, keyValueOrderComparator,
   keyGroupComparator);
}

@Override
public List<Pair<K3, V3>> run() throws IOException {
 try {
  preRunChecks(myMapper, myReducer);
  initDistributedCache();
  List<Pair<K2, V2>> mapOutputs = new ArrayList<Pair<K2, V2>>();
  // run map component
  LOG.debug("Starting map phase with mapper: " + myMapper);
  mapOutputs.addAll(MapDriver.newMapDriver(myMapper)
    .withCounters(getCounters()).withConfiguration(getConfiguration())
    .withAll(inputList).withMapInputPath(getMapInputPath()).run());
  if (myCombiner != null) {
   // User has specified a combiner. Run this and replace the mapper
   // outputs
   // with the result of the combiner.
   LOG.debug("Starting combine phase with combiner: " + myCombiner);
   mapOutputs = new ReducePhaseRunner<K2, V2, K2, V2>(inputFormatClass,
     getConfiguration(), counters,
     getOutputSerializationConfiguration(), outputFormatClass)
     .runReduce(sortAndGroup(mapOutputs), myCombiner);
  }
  // Run the reduce phase.
  LOG.debug("Starting reduce phase with reducer: " + myReducer);
  return new ReducePhaseRunner<K2, V2, K3, V3>(inputFormatClass,
    getConfiguration(), counters, getOutputSerializationConfiguration(),
    outputFormatClass).runReduce(sortAndGroup(mapOutputs), myReducer);
 } finally {
  cleanupDistributedCache();
 }
}

@Test
public void testHypercubeMapReduce() throws IOException {
 MapReduceDriver<Writable, VectorWritable, IntWritable, CentroidWritable, IntWritable, CentroidWritable>
   mapReduceDriver = new MapReduceDriver<Writable, VectorWritable, IntWritable, CentroidWritable,
   IntWritable, CentroidWritable>(new StreamingKMeansMapper(), new StreamingKMeansReducer());
 Configuration configuration = mapReduceDriver.getConfiguration();
 configure(configuration);
 System.out.printf("%s full test\n", configuration.get(StreamingKMeansDriver.SEARCHER_CLASS_OPTION));
 for (Centroid datapoint : syntheticData.getFirst()) {
  mapReduceDriver.addInput(new IntWritable(0), new VectorWritable(datapoint));
 }
 List<org.apache.hadoop.mrunit.types.Pair<IntWritable, CentroidWritable>> results = mapReduceDriver.run();
 testReducerResults(syntheticData.getFirst().size(), results);
}

Popular methods of MapReduceDriver

addInput
run
<init>
newMapReduceDriver
addOutput
getCounters
setCounters
Sets the counters object to use for this test.
setMapper
Set the Mapper instance to use with this test driver
setReducer
Sets the reducer object to use for this test
addInputFromString
addOutputFromString
cleanupDistributedCache

Popular in Java

Creating JSON documents from java classes using gson
setScale (BigDecimal)
notifyDataSetChanged (ArrayAdapter)
getContentResolver (Context)
EOFException (java.io)
Thrown when a program encounters the end of a file or stream during an input operation.
RandomAccessFile (java.io)
Allows reading from and writing to a file in a random-access manner. This is different from the uni-
Timestamp (java.sql)
A Java representation of the SQL TIMESTAMP type. It provides the capability of representing the SQL
SimpleDateFormat (java.text)
Formats and parses dates in a locale-sensitive manner. Formatting turns a Date into a String, and pa
Vector (java.util)
Vector is an implementation of List, backed by an array and synchronized. All optional operations in
FlowLayout (java.awt)
A flow layout arranges components in a left-to-right flow, much like lines of text in a paragraph. F
Top PhpStorm plugins

How to use getConfigurationmethodin org.apache.hadoop.mrunit.mapreduce.MapReduceDriver

Best Java code snippets using org.apache.hadoop.mrunit.mapreduce.MapReduceDriver.getConfiguration (Showing top 5 results out of 315)

How to use
getConfiguration
method
in
org.apache.hadoop.mrunit.mapreduce.MapReduceDriver