Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

1.

Why is Checkpointing Important in Hadoop?

Answer»
2.

Can Hadoop handle streaming data?

Answer»
3.

What is speculative execution in Hadoop?

Answer»
4.

What are Problems with small files and HDFS?

Answer»
5.

Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?

Answer»
6.

Consider case scenario: In M/R system, - HDFS block size is 64 MB

Answer»
7.

What is a Combiner?

Answer»
8.

What does "file could only be replicated to 0 nodes, instead of 1" mean?

Answer»
9.

What happens when two clients try to write into the same HDFS file?

Answer»
10.

Can we search for files using wildcards?

Answer»
11.

How to make a large cluster smaller by taking out some of the nodes?

Answer»
12.

What happens if one Hadoop client renames a file or a directory containing this file while another client is still writing into it?

Answer»
13.

Does the name-node stay in safe mode till all under-replicated files are fully replicated?

Answer»
14.

How do you gracefully stop a running job?

Answer»
15.

The requirement is to add a new data node to a running Hadoop cluster; how do I start services on just one data node?

Answer»
16.

Is there a hdfs command to see available free space in hdfs

Answer»
17.

Which file does the Hadoop-core configuration?

Answer»
18.

What is rack awareness?

Answer»
19.

Default replication factor to a file is 3.

Answer»
20.

Copy a directory from one node in the cluster to another

Answer»
21.

Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?

Answer»
22.

Explain how do ‘map’ and ‘reduce’ works.

Answer»
23.

What is a Secondary Namenode? Is it a substitute to the Namenode?

Answer»
24.

What is a rack?

Answer»
25.

What is the communication channel between client and namenode/datanode?

Answer»
26.

Are job tracker and task trackers present in separate machines?

Answer»
27.

How indexing is done in HDFS?

Answer»
28.

What are the benefits of block transfer?

Answer»
29.

What is a ‘block’ in HDFS?

Answer»
30.

What is a heartbeat in HDFS?

Answer»
31.

What is a task tracker?

Answer»
32.

What is a job tracker?

Answer»
33.

Why do we use HDFS for applications having large data sets and not when there are lot of small files?

Answer»
34.

What is a Datanode?

Answer»
35.

Is Namenode also a commodity hardware?

Answer»
36.

What is a Namenode?

Answer»
37.

Since the data is replicated thrice in HDFS, does it mean that any calculation done on one node will also be replicated on the other two?

Answer»
38.

Replication causes data redundancy, then why is it pursued in HDFS?

Answer»
39.

What is Fault Tolerance?

Answer»
40.

What is the basic difference between traditional RDBMS and Hadoop?

Answer»
41.

Why do we need Hadoop?

Answer»
42.

How is analysis of Big Data useful for organizations?

Answer»
43.

What are the four characteristics of Big Data?

Answer»
44.

What is Big Data?

Answer»
45.

What happens to job tracker when Namenode is down?

Answer»
46.

What if a Namenode has no data?

Answer»
47.

What does /etc /init.d do?

Answer»
48.

Which are the three modes in which Hadoop can be run?

Answer»
49.

How to restart Namenode?

Answer»
50.

What does ‘jps’ command do?

Answer»