1.

Explain About The Major Libraries That Constitute The Spark Ecosystem

Answer»
  • Spark MLib- Machine LEARNING library in Spark for commonly used learning ALGORITHMS LIKE clustering, regression, classification, etc.
  • Spark Streaming – This library is used to process real TIME streaming data.
  • Spark GraphX – Spark API for graph parallel computations with basic operators like joinVertices, subgraph, aggregateMessages, etc.
  • Spark SQL – Helps execute SQL like queries on Spark data using standard visualization or BI tools.



Discussion

No Comment Found