How to process Big Data?

1.	How to process Big Data?
Answer» There are various frameworks for Big Data processing. One of the most popular is MapReduce. It consists of mainly two phases CALLED Map phase and the Reduce phase. In between Map and Reduce phase there is an INTERMEDIATE phase called SHUFFLE. The given job is divided into two tasks: Map tasks Reduce tasks. The input is divided into splits of fixed size. Each input split is then given to each mapper. The mappers run in parallel. So the EXECUTION time is drastically reduced and we get the output very fast. The input to the mapper is a key-value pair. The output of mappers is another key-value pair. This intermediate result is then shuffled and given to reducers. The output of reducers is your DESIRED output.

Answer»

There are various frameworks for Big Data processing.

One of the most popular is MapReduce. It consists of mainly two phases CALLED Map phase and the Reduce phase. In between Map and Reduce phase there is an INTERMEDIATE phase called SHUFFLE. The given job is divided into two tasks:

Map tasks
Reduce tasks.

The input is divided into splits of fixed size. Each input split is then given to each mapper. The mappers run in parallel. So the EXECUTION time is drastically reduced and we get the output very fast.

The input to the mapper is a key-value pair. The output of mappers is another key-value pair. This intermediate result is then shuffled and given to reducers. The output of reducers is your DESIRED output.

Discussion

No Comment Found

Related InterviewSolutions

Is cloud-based solution a good option for Big Data?
How to deal with outliers?
What is Data enrichment?
What is Lambda Architecture?
What are the messaging systems used with Big Data?
Are there any tools to assess the Big Data Maturity Model?
How does Big Data Maturity Model help to plan the Big Data journey?
How to evaluate Big Data Maturity Model?
Are there any categories of Big Data Maturity Model?
What is Big Data Maturity Model?