Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

1.

Where can the metastore database be hosted?

Answer»
2.

Which database the sqoop metastore runs on?

Answer»
3.

Give the sqoop command to see the content of the job named myjob?

Answer»
4.

How can you see the list of stored jobs in sqoop metastore?

Answer»
5.

What is the purpose of sqoop-merge?

Answer»
6.

What is a sqoop metastore?

Answer»
7.

Give a command to execute a stored procedure named proc1 which exports data to from MySQL db named DB1 into a HDFS directory named Dir1.

Answer»
8.

Give a sqoop command to import data from all tables in the MySql DB DB1.

Answer»
9.

Give a Sqoop command to import all the records from employee table divided into groups of records by the values in the column department_id.

Answer»
10.

What does the following query do?

Answer»
11.

Give a sqoop command to run only 8 mapreduce tasks in parallel

Answer»
12.

Give a sqoop command to import the columns employee_id,first_name,last_name from the MySql table Employee

Answer»
13.

What are the two file formats supported by sqoop for import?

Answer»
14.

How to import only the updated rows form a table into HDFS using sqoop assuming the source has last update timestamp details for each row?

Answer»
15.

How can you control the mapping between SQL data types and Java types?

Answer»
16.

What happens when a table is imported into a HDFS directory which already exists using the –apend parameter?

Answer»
17.

What does this sqoop command achieve?

Answer»
18.

What is the importance of --split-by clause in running parallel import tasks in sqoop?

Answer»
19.

In a sqoop import command you have mentioned to run 8 parallel Mapreduce task but sqoop runs only 4. What can be the reason?

Answer»
20.

How can you force sqoop to execute a free form Sql query only once and import the rows serially.

Answer»
21.

What do you mean by Free Form Import in Sqoop?

Answer»
22.

Give a sqoop command to show all the databases in a MySql server.

Answer»
23.

Sqoop imported a table successfully to HBase but it is found that the number of rows is fewer than expected. What can be the cause?

Answer»
24.

How can you schedule a sqoop job using Oozie?

Answer»
25.

How can we load to a column in a relational table which is not null but the incoming value from HDFS has a null value?

Answer»
26.

How can you export only a subset of columns to a relational table using sqoop?

Answer»
27.

How can you sync a exported table with HDFS data in which some rows are deleted?

Answer»
28.

How will you update the rows that are already exported?

Answer»
29.

How do you clear the data in a staging table before loading it by Sqoop?

Answer»
30.

How will you implement all-or-nothing load using sqoop?

Answer»
31.

What is the difference between the parameters sqoop.export.records.per.statement and sqoop.export.statements.per.transaction

Answer»
32.

Before starting the data transfer using mapreduce job, sqoop takes a long time to retrieve the minimum and maximum values of columns mentioned in –split-by parameter. How can we make it efficient?

Answer»
33.

How can you choose a name for the mapreduce job which is created on submitting a free-form query import?

Answer»
34.

How can we slice the data to be imported to multiple parallel tasks?

Answer»
35.

How do you fetch data which is the result of join between two tables?

Answer»
36.

Is it possible to add a parameter while running a saved job?

Answer»
37.

What is the usefulness of the options file in sqoop.

Answer»
38.

When the source data keeps getting updated frequently, what is the approach to keep it in sync with the data in HDFS imported by sqoop?

Answer»
39.

How can you avoid importing tables one-by-one when importing a large number of tables from a database?

Answer»
40.

How can you control the number of mappers used by the sqoop command?

Answer»
41.

What is a disadvantage of using --direct parameter for faster data load by sqoop?

Answer»
42.

What is the significance of using --compress-codec parameter?

Answer»
43.

What is the default extension of the files produced from a sqoop import using the --compress parameter?

Answer»
44.

What is the advantage of using --password-file rather than -P option while preventing the display of password in the sqoop import statement?

Answer»
45.

How can we import a subset of rows from a table without using the where clause?

Answer»
46.

How can you import only a subset of rows form a table?

Answer»
47.

When to use --target-dir and when to use --warehouse-dir while importing data?

Answer»
48.

Is JDBC driver enough to connect sqoop to the databases?

Answer»
49.

What is the role of JDBC driver in a Sqoop set up?

Answer»