Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

1.

Define three step to deploy Big Data solution?

Answer»

Define three step to deploy Big Data solution?
Below are the three step to deploy solution in Big Data:-
(1)Data Ingestion:-This is the FIRST step where we begin with collection of data from multiple sources. And sources can be social media platforms, Log Files, business documents and any other data relevant to our business. Here we can extract data through real-time streaming or from any BATCH jobs.
(2)Data STORAGE:-After the first step data is extracted now our job is to store data in a database. Herer we can do through HDFS or HBase. Here we USE HDFS storage if we need sequential access and HBase is best for random read/write access.
(3)Data Processing:-This is the last step and this will relates to data processing. And data processing is usually done with framework like Hadoop, Spark, MapReduce, Flink and Pig etc.

2.

Name the different platforms which deal with Big Data with there features?

Answer»

Name the different platforms which deal with Big Data with there features?
Below are the 8 different platform which deal with Big Data:-
(1)Hadoop:-Hadoop is open source, High Scalable, Runs on commodity hardware and has a good ecosystem.
(2)HPCC:-HPCC is a open source, Good alternative to Hadoop, Parallelism at data, pipeline and System Level, High-Performance Online QUERY Applications
(3)Storm:-Storm is open source, Log Processing, Distributed Stream processing and Real-Time analytics.
(4)CDH:-CDH is licence based but also AVAILABLE free with limited version, Easy implementation, It is more secure and Cloudera Manager for easy administration
(5)HDP:-HDP is licence based but free with limited free version, It have Data Analytics Studio, It have DASHBOARD with Ambari UI, HDP Sandbox available for VirtualBox, VMware, Docker
(6)MapR:-MapR is licence based but free with limited version, It have features of AI and ML, It Open APIs and it have on-premise and cloud support.
(7)Cassandra:-Cassandra is open source, EXAMPLE of this is NoSQL Database, It have feature of Log-structured storage and this will includes cassandra struture language(CQL).
(8)MongoDB:-MongoDB is licence based and also have option of open source, This have NoSQL database, this have a feature of document oriented and have feature of Aggregation pipeline.