How to use
Combiner
in
com.hazelcast.mapreduce

Best Java code snippets using com.hazelcast.mapreduce.Combiner (Showing top 6 results out of 315)

public void finalizeCombiners() {
  for (Combiner<ValueIn, ?> combiner : combiners.values()) {
    combiner.finalizeCombine();
  }
}

public <Chunk> Map<KeyIn, Chunk> requestChunk() {
  int mapSize = MapReduceUtil.mapSize(combiners.size());
  Map<KeyIn, Chunk> chunkMap = createHashMapAdapter(mapSize);
  for (Map.Entry<KeyIn, Combiner<ValueIn, ?>> entry : combiners.entrySet()) {
    Combiner<ValueIn, ?> combiner = entry.getValue();
    Chunk chunk = (Chunk) combiner.finalizeChunk();
    combiner.reset();
    if (chunk != null) {
      chunkMap.put(entry.getKey(), chunk);
    }
  }
  COLLECTED.set(this, 0);
  return chunkMap;
}

@Override
public void emit(KeyIn key, ValueIn value) {
  Combiner<ValueIn, ?> combiner = getOrCreateCombiner(key);
  combiner.combine(value);
  COLLECTED.incrementAndGet(this);
  mapCombineTask.onEmit(this, partitionId);
}

public <Chunk> Map<KeyIn, Chunk> requestChunk() {
  int mapSize = MapReduceUtil.mapSize(combiners.size());
  Map<KeyIn, Chunk> chunkMap = createHashMapAdapter(mapSize);
  for (Map.Entry<KeyIn, Combiner<ValueIn, ?>> entry : combiners.entrySet()) {
    Combiner<ValueIn, ?> combiner = entry.getValue();
    Chunk chunk = (Chunk) combiner.finalizeChunk();
    combiner.reset();
    if (chunk != null) {
      chunkMap.put(entry.getKey(), chunk);
    }
  }
  COLLECTED.set(this, 0);
  return chunkMap;
}

@Override
public void emit(KeyIn key, ValueIn value) {
  Combiner<ValueIn, ?> combiner = getOrCreateCombiner(key);
  combiner.combine(value);
  COLLECTED.incrementAndGet(this);
  mapCombineTask.onEmit(this, partitionId);
}

public void finalizeCombiners() {
  for (Combiner<ValueIn, ?> combiner : combiners.values()) {
    combiner.finalizeCombine();
  }
}

Javadoc

The abstract Combiner class is used to build combiners for the Job.
Those Combiners are distributed inside of the cluster and are running alongside the Mapper implementations in the same node.
Combiners are called in a threadsafe way so internal locking is not required.

Combiners are normally used to build intermediate results on the mapping nodes to lower the traffic overhead between the different nodes before the reducing phase.
Combiners need to be capable of combining data in multiple chunks to create a more streaming like internal behavior.

A simple Combiner implementation in combination with a Reducer could look like this avg-function implementation.

 
public class AvgCombiner implements Combiner<Integer, Tuple<Long, Long>> 
{ 
private long count; 
private long amount; 
public void combine(Integer value) 
{ 
count++; 
amount += value; 
} 
public Tuple<Long, Long> finalizeChunk() 
{ 
Tuple<Long, Long> tuple = new Tuple<>( count, amount ); 
count = 0; 
amount = 0; 
return tuple; 
} 
} 
public class SumReducer implements Reducer<Tuple<Long, Long>, Integer> 
{ 
private long count; 
private long amount; 
public void reduce( Tuple<Long, Long> value ) 
{ 
count += value.getFirst(); 
amount += value.getSecond(); 
} 
public Integer finalizeReduce() 
{ 
return amount / count; 
} 
}

Most used methods

combine
This method is called to supply values to be combined into an intermediate result chunk. The combine
finalizeChunk
Creates a chunk of ValueOut to be sent to the Reducer for the according key.
finalizeCombine
This method is called after the mapping phase on the local node is over. No further combining runs w
reset
This method is always called after a chunk of data is retrieved. It resets the internal state of the

Popular in Java

Finding current android device location
compareTo (BigDecimal)
getSystemService (Context)
addToBackStack (FragmentTransaction)
Socket (java.net)
Provides a client-side TCP socket.
Properties (java.util)
A Properties object is a Hashtable where the keys and values must be Strings. Each property can have
BlockingQueue (java.util.concurrent)
A java.util.Queue that additionally supports operations that wait for the queue to become non-empty
HttpServlet (javax.servlet.http)
Provides an abstract class to be subclassed to create an HTTP servlet suitable for a Web site. A sub
BorderLayout (java.awt)
A border layout lays out a container, arranging and resizing its components to fit in five regions:
JPanel (javax.swing)
Top 12 Jupyter Notebook extensions

How to useCombiner in com.hazelcast.mapreduce

Best Java code snippets using com.hazelcast.mapreduce.Combiner (Showing top 6 results out of 315)

How to use
Combiner
in
com.hazelcast.mapreduce