

InterviewSolution
Saved Bookmarks
This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.
1. |
Where can the metastore database be hosted? |
Answer» | |
2. |
Which database the sqoop metastore runs on? |
Answer» | |
3. |
Give the sqoop command to see the content of the job named myjob? |
Answer» | |
4. |
How can you see the list of stored jobs in sqoop metastore? |
Answer» | |
5. |
What is the purpose of sqoop-merge? |
Answer» | |
6. |
What is a sqoop metastore? |
Answer» | |
7. |
Give a command to execute a stored procedure named proc1 which exports data to from MySQL db named DB1 into a HDFS directory named Dir1. |
Answer» | |
8. |
Give a sqoop command to import data from all tables in the MySql DB DB1. |
Answer» | |
9. |
Give a Sqoop command to import all the records from employee table divided into groups of records by the values in the column department_id. |
Answer» | |
10. |
What does the following query do? |
Answer» | |
11. |
Give a sqoop command to run only 8 mapreduce tasks in parallel |
Answer» | |
12. |
Give a sqoop command to import the columns employee_id,first_name,last_name from the MySql table Employee |
Answer» | |
13. |
What are the two file formats supported by sqoop for import? |
Answer» | |
14. |
How to import only the updated rows form a table into HDFS using sqoop assuming the source has last update timestamp details for each row? |
Answer» | |
15. |
How can you control the mapping between SQL data types and Java types? |
Answer» | |
16. |
What happens when a table is imported into a HDFS directory which already exists using the –apend parameter? |
Answer» | |
17. |
What does this sqoop command achieve? |
Answer» | |
18. |
What is the importance of --split-by clause in running parallel import tasks in sqoop? |
Answer» | |
19. |
In a sqoop import command you have mentioned to run 8 parallel Mapreduce task but sqoop runs only 4. What can be the reason? |
Answer» | |
20. |
How can you force sqoop to execute a free form Sql query only once and import the rows serially. |
Answer» | |
21. |
What do you mean by Free Form Import in Sqoop? |
Answer» | |
22. |
Give a sqoop command to show all the databases in a MySql server. |
Answer» | |
23. |
Sqoop imported a table successfully to HBase but it is found that the number of rows is fewer than expected. What can be the cause? |
Answer» | |
24. |
How can you schedule a sqoop job using Oozie? |
Answer» | |
25. |
How can we load to a column in a relational table which is not null but the incoming value from HDFS has a null value? |
Answer» | |
26. |
How can you export only a subset of columns to a relational table using sqoop? |
Answer» | |
27. |
How can you sync a exported table with HDFS data in which some rows are deleted? |
Answer» | |
28. |
How will you update the rows that are already exported? |
Answer» | |
29. |
How do you clear the data in a staging table before loading it by Sqoop? |
Answer» | |
30. |
How will you implement all-or-nothing load using sqoop? |
Answer» | |
31. |
What is the difference between the parameters sqoop.export.records.per.statement and sqoop.export.statements.per.transaction |
Answer» | |
32. |
Before starting the data transfer using mapreduce job, sqoop takes a long time to retrieve the minimum and maximum values of columns mentioned in –split-by parameter. How can we make it efficient? |
Answer» | |
33. |
How can you choose a name for the mapreduce job which is created on submitting a free-form query import? |
Answer» | |
34. |
How can we slice the data to be imported to multiple parallel tasks? |
Answer» | |
35. |
How do you fetch data which is the result of join between two tables? |
Answer» | |
36. |
Is it possible to add a parameter while running a saved job? |
Answer» | |
37. |
What is the usefulness of the options file in sqoop. |
Answer» | |
38. |
When the source data keeps getting updated frequently, what is the approach to keep it in sync with the data in HDFS imported by sqoop? |
Answer» | |
39. |
How can you avoid importing tables one-by-one when importing a large number of tables from a database? |
Answer» | |
40. |
How can you control the number of mappers used by the sqoop command? |
Answer» | |
41. |
What is a disadvantage of using --direct parameter for faster data load by sqoop? |
Answer» | |
42. |
What is the significance of using --compress-codec parameter? |
Answer» | |
43. |
What is the default extension of the files produced from a sqoop import using the --compress parameter? |
Answer» | |
44. |
What is the advantage of using --password-file rather than -P option while preventing the display of password in the sqoop import statement? |
Answer» | |
45. |
How can we import a subset of rows from a table without using the where clause? |
Answer» | |
46. |
How can you import only a subset of rows form a table? |
Answer» | |
47. |
When to use --target-dir and when to use --warehouse-dir while importing data? |
Answer» | |
48. |
Is JDBC driver enough to connect sqoop to the databases? |
Answer» | |
49. |
What is the role of JDBC driver in a Sqoop set up? |
Answer» | |