- initialize
Initialize this DoFn. This initialization will happen before the actual
#process(Object,Emitter) is
- configure
Configure this DoFn. Subclasses may override this method to modify the
configuration of the Job that
- setContext
Called during setup to pass the TaskInputOutputContext to this DoFn instance.
The specified TaskInpu
- process
Processes the records from a PCollection.
Note: Crunch can reuse a single input record object whose
- cleanup
Called during the cleanup of the MapReduce job this DoFn is associated with.
Subclasses may override
- increment
- scaleFactor
Returns an estimate of how applying this function to a PCollectionwill cause it
to change in side. T
- disableDeepCopy
By default, Crunch will do a defensive deep copy of the outputs of a DoFn when
there are multiple do
- progress
- setStatus