| 1. |
Explain About The Major Libraries That Constitute The Spark Ecosystem? |
|
Answer» Spark MLib- Machine learning library in Spark for commonly USED learning algorithms like clustering, regression, classification, etc. Spark Streaming – This library is used to process real time streaming data. Spark GraphX – Spark API for graph parallel COMPUTATIONS with BASIC operators like joinVertices, subgraph, aggregateMessages, etc. Spark SQL – Helps execute SQL like queries on Spark data using standard VISUALIZATION or BI tools. Spark MLib- Machine learning library in Spark for commonly used learning algorithms like clustering, regression, classification, etc. Spark Streaming – This library is used to process real time streaming data. Spark GraphX – Spark API for graph parallel computations with basic operators like joinVertices, subgraph, aggregateMessages, etc. Spark SQL – Helps execute SQL like queries on Spark data using standard visualization or BI tools. |
|