public class InsertIntoParquetTable extends SparkPlan implements UnaryNode, SparkHadoopMapReduceUtil, scala.Product, scala.Serializable
WARNING: EXPERIMENTAL! InsertIntoParquetTable with overwrite=false may cause data corruption in the case that multiple users try to append to the same table simultaneously. Inserting into a table that was previously generated by other means (e.g., by creating an HDFS directory and importing Parquet files generated by other tools) may cause unpredicted behaviour and therefore results in a RuntimeException (only detected via filename pattern so will not catch all cases).
| Constructor and Description |
|---|
InsertIntoParquetTable(ParquetRelation relation,
SparkPlan child,
boolean overwrite) |
| Modifier and Type | Method and Description |
|---|---|
SparkPlan |
child() |
RDD<org.apache.spark.sql.catalyst.expressions.Row> |
execute()
Inserts all rows into the Parquet file.
|
scala.collection.Seq<org.apache.spark.sql.catalyst.expressions.Attribute> |
output() |
boolean |
overwrite() |
ParquetRelation |
relation() |
codegenEnabled, executeCollect, makeCopy, outputPartitioning, requiredChildDistributionexpressions, inputSet, missingInput, org$apache$spark$sql$catalyst$plans$QueryPlan$$transformExpressionDown$1, org$apache$spark$sql$catalyst$plans$QueryPlan$$transformExpressionUp$1, outputSet, printSchema, references, schema, schemaString, simpleString, statePrefix, transformAllExpressions, transformExpressions, transformExpressionsDown, transformExpressionsUpapply, argString, asCode, children, collect, fastEquals, flatMap, foreach, generateTreeString, getNodeNumbered, map, mapChildren, nodeName, numberedTreeString, otherCopyArgs, stringArgs, toString, transform, transformChildrenDown, transformChildrenUp, transformDown, transformUp, treeString, withNewChildrenoutputPartitioningfirstAvailableClass, newJobContext, newTaskAttemptContext, newTaskAttemptIDproductArity, productElement, productIterator, productPrefixinitializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic InsertIntoParquetTable(ParquetRelation relation, SparkPlan child, boolean overwrite)
public ParquetRelation relation()
public SparkPlan child()
child in interface org.apache.spark.sql.catalyst.trees.UnaryNode<SparkPlan>public boolean overwrite()
public RDD<org.apache.spark.sql.catalyst.expressions.Row> execute()
public scala.collection.Seq<org.apache.spark.sql.catalyst.expressions.Attribute> output()
output in class org.apache.spark.sql.catalyst.plans.QueryPlan<SparkPlan>