Package org.apache.avro.mapred
Class AvroJob
java.lang.Object
org.apache.avro.mapred.AvroJob
Setters to configure jobs for Avro data.
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final StringThe configuration key prefix for a binary output metadata.static final StringThe configuration key for reflection-based input representation.static final StringThe configuration key for a job's input schema.static final StringThe configuration key for reflection-based map output representation.static final StringThe configuration key for a job's intermediate schema.static final StringThe configuration key for a job's output compression codec.static final StringThe configuration key for a job's output schema.static final StringThe configuration key prefix for a text output metadata. -
Method Summary
Modifier and TypeMethodDescriptionstatic GenericDatacreateDataModel(Configuration conf) static GenericDatastatic GenericDatastatic Class<? extends GenericData> Return the job's data model implementation class.static SchemaReturn a job's map input schema.static SchemaReturn a job's map output key schema.static SchemaReturn a job's output key schema.static voidsetCombinerClass(JobConf job, Class<? extends AvroReducer> c) Configure a job's combiner implementation.static voidsetDataModelClass(JobConf job, Class<? extends GenericData> modelClass) Configure a job's data model implementation class.static voidsetInputReflect(JobConf job) Indicate that a job's input data should use reflect representation.static voidsetInputSchema(JobConf job, Schema s) Configure a job's map input schema.static voidIndicate that a job's input files are in SequenceFile format.static voidIndicate that a job's map output data should use reflect representation.static voidsetMapOutputSchema(JobConf job, Schema s) Configure a job's map output schema.static voidsetMapperClass(JobConf job, Class<? extends AvroMapper> c) Configure a job's mapper implementation.static voidsetOutputCodec(JobConf job, String codec) Configure a job's output compression codec.static voidsetOutputMeta(JobConf job, String key, byte[] value) Add metadata to job output files.static voidsetOutputMeta(JobConf job, String key, long value) Add metadata to job output files.static voidsetOutputMeta(JobConf job, String key, String value) Add metadata to job output files.static voidsetOutputSchema(JobConf job, Schema s) Configure a job's output schema.static voidsetReducerClass(JobConf job, Class<? extends AvroReducer> c) Configure a job's reducer implementation.static voidsetReflect(JobConf job) Indicate that all a job's data should use the reflect representation.
-
Field Details
-
INPUT_SCHEMA
The configuration key for a job's input schema.- See Also:
-
MAP_OUTPUT_SCHEMA
The configuration key for a job's intermediate schema.- See Also:
-
OUTPUT_SCHEMA
The configuration key for a job's output schema.- See Also:
-
OUTPUT_CODEC
The configuration key for a job's output compression codec. This takes one of the strings registered inCodecFactory- See Also:
-
TEXT_PREFIX
The configuration key prefix for a text output metadata.- See Also:
-
BINARY_PREFIX
The configuration key prefix for a binary output metadata.- See Also:
-
INPUT_IS_REFLECT
The configuration key for reflection-based input representation.- See Also:
-
MAP_OUTPUT_IS_REFLECT
The configuration key for reflection-based map output representation.- See Also:
-
-
Method Details
-
setInputSchema
Configure a job's map input schema. -
getInputSchema
Return a job's map input schema. -
setMapOutputSchema
Configure a job's map output schema. The map output schema defaults to the output schema and need only be specified when it differs. Thus must be aPairschema. -
getMapOutputSchema
Return a job's map output key schema. -
setOutputSchema
Configure a job's output schema. Unless this is a map-only job, this must be aPairschema. -
setOutputCodec
Configure a job's output compression codec. -
setOutputMeta
Add metadata to job output files. -
setOutputMeta
Add metadata to job output files. -
setOutputMeta
Add metadata to job output files. -
setInputSequenceFile
Indicate that a job's input files are in SequenceFile format. -
setReflect
Indicate that all a job's data should use the reflect representation. -
setInputReflect
Indicate that a job's input data should use reflect representation. -
setMapOutputReflect
Indicate that a job's map output data should use reflect representation. -
getOutputSchema
Return a job's output key schema. -
setMapperClass
Configure a job's mapper implementation. -
setCombinerClass
Configure a job's combiner implementation. -
setReducerClass
Configure a job's reducer implementation. -
setDataModelClass
Configure a job's data model implementation class. -
getDataModelClass
Return the job's data model implementation class. -
createDataModel
-
createInputDataModel
-
createMapOutputDataModel
-