1.

What happens if RDD partition is lost due to worker node failure?

Answer»

In Spark, if any partition of an RDD is lost due to the failure of a worker node, that partition can be re-computed USING the LINEAGE of operations from the original fault-tolerant dataset.

12. What is spark GraphX used for?

Here are the uses of GraphX in Spark:

  • It can be used for unifying ETL, exploratory analysis, and COMPUTATION of iterative graphs within a single system.
  • It can be used to present DATA in the form of graphs and collections while transforming and joining charts with RDD.
  • It can be used for writing custom iterative GRAPH algorithms with the help of Pregel API.


Discussion

No Comment Found