|
Answer» There are 4 CHARACTERISTICS of PySpark: - ABSTRACTED Nodes: This means that the individual worker nodes can not be addressed.
- Spark API: PySpark provides APIs for utilizing Spark features.
- Map-REDUCE Model: PySpark is based on Hadoop’s Map-Reduce model this means that the programmer provides the map and the reduce functions.
- Abstracted Network: Networks are abstracted in PySpark which means that the only possible COMMUNICATION is implicit communication.
|