Saved Bookmarks
| 1. |
Explain the distributed Cache in the MapReduce framework. |
|
Answer» Distributed Cache is a significant feature provided by the MapReduce Framework, practiced when you want to share the files across all nodes in a Hadoop CLUSTER. These files can be jar files or simple properties files. Hadoop's MapReduce framework allows the facility to cache small to moderate read-only files such as TEXT files, zip files, jar files, etc., and DISTRIBUTE them to all the Datanodes(worker-nodes) MapReduce jobs are running. All DATANODE gets a copy of the file(local-copy), which Distributed Cache sends. |
|