1.

Point out the correct statement.(a) The framework groups Reducer inputs by keys(b) The shuffle and sort phases occur simultaneously i.e. while outputs are being fetched they are merged(c) Since JobConf.setOutputKeyComparatorClass(Class) can be used to control how intermediate keys are grouped, these can be used in conjunction to simulate secondary sort on values(d) All of the mentionedThis question was posed to me in an interview for internship.The query is from Java Interface in portion HDFS – Hadoop Distributed File System of Hadoop

Answer»

The correct answer is (d) All of the mentioned

For explanation I WOULD say: If equivalence rules for KEYS while grouping the INTERMEDIATES are different from those for grouping keys before reduction, then one MAY SPECIFY a Comparator.



Discussion

No Comment Found

Related InterviewSolutions