1.

What Is Lineage Graph?

Answer»

The RDDs in Spark, depend on one or more other RDDs. The REPRESENTATION of DEPENDENCIES in between RDDs is known as the lineage graph. Lineage graph information is USED to compute each RDD on demand, so that whenever a part of persistent RDD is lost, the data that is lost can be RECOVERED USING the lineage graph information.

The RDDs in Spark, depend on one or more other RDDs. The representation of dependencies in between RDDs is known as the lineage graph. Lineage graph information is used to compute each RDD on demand, so that whenever a part of persistent RDD is lost, the data that is lost can be recovered using the lineage graph information.



Discussion

No Comment Found