

InterviewSolution
Saved Bookmarks
This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.
1. |
Why is Checkpointing Important in Hadoop? |
Answer» | |
2. |
Can Hadoop handle streaming data? |
Answer» | |
3. |
What is speculative execution in Hadoop? |
Answer» | |
4. |
What are Problems with small files and HDFS? |
Answer» | |
5. |
Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do? |
Answer» | |
6. |
Consider case scenario: In M/R system, - HDFS block size is 64 MB |
Answer» | |
7. |
What is a Combiner? |
Answer» | |
8. |
What does "file could only be replicated to 0 nodes, instead of 1" mean? |
Answer» | |
9. |
What happens when two clients try to write into the same HDFS file? |
Answer» | |
10. |
Can we search for files using wildcards? |
Answer» | |
11. |
How to make a large cluster smaller by taking out some of the nodes? |
Answer» | |
12. |
What happens if one Hadoop client renames a file or a directory containing this file while another client is still writing into it? |
Answer» | |
13. |
Does the name-node stay in safe mode till all under-replicated files are fully replicated? |
Answer» | |
14. |
How do you gracefully stop a running job? |
Answer» | |
15. |
The requirement is to add a new data node to a running Hadoop cluster; how do I start services on just one data node? |
Answer» | |
16. |
Is there a hdfs command to see available free space in hdfs |
Answer» | |
17. |
Which file does the Hadoop-core configuration? |
Answer» | |
18. |
What is rack awareness? |
Answer» | |
19. |
Default replication factor to a file is 3. |
Answer» | |
20. |
Copy a directory from one node in the cluster to another |
Answer» | |
21. |
Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS? |
Answer» | |
22. |
Explain how do ‘map’ and ‘reduce’ works. |
Answer» | |
23. |
What is a Secondary Namenode? Is it a substitute to the Namenode? |
Answer» | |
24. |
What is a rack? |
Answer» | |
25. |
What is the communication channel between client and namenode/datanode? |
Answer» | |
26. |
Are job tracker and task trackers present in separate machines? |
Answer» | |
27. |
How indexing is done in HDFS? |
Answer» | |
28. |
What are the benefits of block transfer? |
Answer» | |
29. |
What is a ‘block’ in HDFS? |
Answer» | |
30. |
What is a heartbeat in HDFS? |
Answer» | |
31. |
What is a task tracker? |
Answer» | |
32. |
What is a job tracker? |
Answer» | |
33. |
Why do we use HDFS for applications having large data sets and not when there are lot of small files? |
Answer» | |
34. |
What is a Datanode? |
Answer» | |
35. |
Is Namenode also a commodity hardware? |
Answer» | |
36. |
What is a Namenode? |
Answer» | |
37. |
Since the data is replicated thrice in HDFS, does it mean that any calculation done on one node will also be replicated on the other two? |
Answer» | |
38. |
Replication causes data redundancy, then why is it pursued in HDFS? |
Answer» | |
39. |
What is Fault Tolerance? |
Answer» | |
40. |
What is the basic difference between traditional RDBMS and Hadoop? |
Answer» | |
41. |
Why do we need Hadoop? |
Answer» | |
42. |
How is analysis of Big Data useful for organizations? |
Answer» | |
43. |
What are the four characteristics of Big Data? |
Answer» | |
44. |
What is Big Data? |
Answer» | |
45. |
What happens to job tracker when Namenode is down? |
Answer» | |
46. |
What if a Namenode has no data? |
Answer» | |
47. |
What does /etc /init.d do? |
Answer» | |
48. |
Which are the three modes in which Hadoop can be run? |
Answer» | |
49. |
How to restart Namenode? |
Answer» | |
50. |
What does ‘jps’ command do? |
Answer» | |