InterviewSolution
Saved Bookmarks
| 1. |
What is an InputFormat and Record Reader in Hadoop? What are the various Input Formats in Hadoop? |
|
Answer» A file is read by a Map-Reduce job using an InputFormat. It defines how the file being read needs to be split up and read. InputFormat, in turn, defines a RecordReader which is responsible for reading ACTUAL records from the input files. The split computed by InputFormat is operated upon by map TASK. Map task USES Record Reader corresponding to InputFormat to read the data within each split and create key-value pairs. The various TYPES of InputFormat in Hadoop are:
|
|