InterviewSolution
This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.
| 1. |
What do you understand about a data cube in the context of data warehousing? |
|
Answer» A data cube is a multidimensional data MODEL that stores optimized, SUMMARIZED, or aggregated data for quick and EASY ANALYSIS using OLAP technologies. The precomputed data is stored in a data cube, which makes online analytical processing easier. We all think of a cube as a three-dimensional structure, however in data warehousing, an n-dimensional data cube can be implemented. A data cube stores information in terms of dimensions and facts. Data Cubes have two categories. They are as follows :
|
|
| 2. |
What do you mean by snowflake schema in the context of data warehousing? |
|
Answer» Snowflake Schema is a MULTIDIMENSIONAL model that is also used in data WAREHOUSES. The FACT TABLES, dimension tables, and sub dimension tables are all contained in the snowflake schema. With fact tables, dimension tables, and sub-dimension tables, this schema forms a snowflake. |
|
| 3. |
Explain what you mean by a star schema in the context of data warehousing. |
|
Answer» Star schema is a sort of MULTIDIMENSIONAL model and is USED in a data warehouse. The fact tables and dimension tables are both contained in the star schema. There are fewer foreign-key JOINS in this design. With fact and dimension tables, this schema forms a star. |
|
| 4. |
Enlist some of the renowned ETL tools currently used in the industry. |
|
Answer» Some of the renowned ETL tools currently used in the INDUSTRY are as FOLLOWS :
|
|
| 5. |
Enlist a few data warehouse solutions that are currently being used in the industry. |
|
Answer» Some of the major data WAREHOUSE SOLUTIONS currently being used in the INDUSTRY are as follows : |
|
| 6. |
What do you understand about metadata and why is it used for? |
|
Answer» Metadata is defined as information about data. Metadata is the context that provides data a more complete identity and serves as the foundation for its interactions with other data. It can also be a useful tool for saving TIME, staying organised, and getting the most out of the files you're working with. Structural Metadata describes how an object should be classified in ORDER to fit into a wider system of things. Structural Metadata makes a LINK with other files that allows them to be categorized and used in a variety of WAYS. Administrative Metadata contains information about an object's history, who owned it previously, and what it can be used for. Rights, licences, and permissions are EXAMPLES. This information is useful for persons who are in charge of managing and caring for an asset. When a piece of information is placed in the correct context, it takes on a whole new meaning. Furthermore, better-organized Metadata will considerably reduce search time. |
|
| 7. |
What are the characteristics of a data warehouse? |
|
Answer» Following are the characteristics of a data warehouse:-
|
|
| 8. |
What do you mean by Active Data Warehousing? |
|
Answer» The technical capacity to collect TRANSACTIONS as they change and integrate them into the WAREHOUSE, as well as maintaining batch or planned cycle refreshes, is known as active data warehousing. Automating routine processes and choices is possible with an active data warehouse. The active data warehouse sends decisions to the On-Line Transaction Processing (OLTP) systems AUTOMATICALLY. An active data warehouse is designed to capture and distribute data in real time. They give you a unified VIEW of your customers across all of your BUSINESS lines. Business Intelligence Systems are linked to it. |
|
| 9. |
What do you mean by Real time data warehousing? |
|
Answer» A system that reflects the condition of the warehouse in REAL time is REFERRED to as real-time data WAREHOUSING. If you perform a query on the real-time data warehouse to learn more about a specific aspect of the company or entity described by the warehouse, the result reflects the status of that entity at the time the query was run. Most data warehouses contain data that is highly latent — that is, data that reflects the BUSINESS at a specific point in time. A real-time data warehouse provides current (or real-time) data with LOW latency. |
|
| 10. |
What do you mean by a factless fact table in the context of data warehousing? |
|
Answer» A fact table with no measures is known as a factless fact table. It's essentially a crossroads of dimensions (it contains nothing but dimensional KEYS). One form of factless table is used to capture an event, while the other is used to describe conditions. In the first type of factless fact table, there is no measured value for an event, but it develops the relationship among the dimension members from several dimensions. The existence of the relationship is itself the fact. This type of fact table can be utilised to create valuable reports on its own. Various CRITERIA can be used to count the number of occurrences. The second type of factless fact table is a tool that's used to back up negative ANALYTICAL reports. Consider a store that did not sell a product for a period of time. To create such a report, you'll need a factless fact table that CAPTURES all of the conceivable product COMBINATIONS that were on offer. By comparing the factless table to the sales table for the list of things that did sell, you can figure out what's missing. |
|
| 11. |
Differentiate between data warehouse and database. |
||||||||||||||||||
|
Answer» Database: A database is a logically organized collection of structured data kept electronically in a computer SYSTEM. A database MANAGEMENT system is usually in charge of a database (DBMS). The data, the DBMS, and the applications that go with them are referred to as a database system, which is commonly abbreviated to just a database. The following table ENLISTS the difference between data warehouse and database:-
|
|||||||||||||||||||
| 12. |
What are the different types of data marts in the context of data warehousing? |
|
Answer» Following are the different types of data mart in data warehousing:
|
|
| 13. |
What are the different types of data warehouse? |
|
Answer» Following are the different types of data warehouse:
|
|
| 14. |
What are the disadvantages of using a data warehouse? |
|
Answer» Following are the disadvantages of using a data warehouse:-
|
|
| 15. |
What are the advantages of a data warehouse? |
|
Answer» Following are the advantages of using a data warehouse:
|
|
| 16. |
Differentiate between fact table and dimension table. |
||||||||||||||||||||||||
|
Answer» The record of a reality or fact table could be made up of attributes from various dimension tables. The Fact Table, also known as the Reality Table, assists the user in investigating the business aspects that aid him in call taking in order to improve his FIRM. Dimension Tables, on the other hand, make it easier for the reality table or fact table to collect dimensions from which measurements must be taken. The following table enlists the difference between a fact table and a dimension table:-
|
|||||||||||||||||||||||||
| 17. |
What are the different types of dimension tables in the context of data warehousing? |
|
Answer» Following are the different types of dimension tables in the context of data warehousing:-
|
|
| 18. |
What do you mean by dimension table in the context of data warehousing? What are the advantages of using a dimension table? |
|
Answer» A table in a DATA warehouse's star SCHEMA is referred to as a dimension table. Dimensional data models, which are made up of fact and dimension tables, are used to create data warehouses. Dimension tables contain dimension keys, values, and attributes and are used to describe dimensions. It is usually of a tiny size. The number of rows might range from a few to thousands. It is a DESCRIPTION of the objects in the fact table. The term "dimension table" refers to a collection or group of data pertaining to any quantifiable occurrence. They serve as the foundation for dimensional modelling. It includes a column that serves as a primary key, allowing each dimension ROW or record to be uniquely identified. Through this key, it is linked to the fact tables. When it's constructed, a system-generated key called the surrogate key is used to uniquely identify the rows in the dimension. Following are the advantages of using a dimension table :
|
|
| 19. |
What do you understand about a fact table in the context of a data warehouse? What are the different types of fact tables? |
|
Answer» In a Data Warehouse system, a Fact table is simply a table that holds all of the facts or business information that can be exposed to reporting and analysis when needed. Fields that reflect direct facts, as well as foreign fields that connect the fact table to other dimension tables in the Data Warehouse system, are stored in these tables. Depending on the model type used to construct the Data Warehouse, a Data Warehouse system can have ONE or more fact tables. Following are the three types of fact tables:-
|
|
| 20. |
What do you mean by OLAP in the context of data warehousing? What guidelines should be followed while selecting an OLAP system? |
|
Answer» OLAP is an acronym for On-Line Analytical Processing. OLAP is a software technology classification that allows analysts, managers, and executives to get insight into information through quick, reliable, interactive access to data that has been converted from raw data to reflect the true dimensionality of the company as perceived by the clients. OLAP allows for multidimensional examination of corporate data while also allowing for complex estimations, trend analysis, and advanced data modelling. It's rapidly improving the foundation for Intelligent Solutions, which includes Business Performance Management, Strategy, Budgeting, Predicting, Financial Documentation, Analysis, Modeling, Knowledge Discovery, and Data Warehouses Reporting. End-clients can use OLAP to perform ad hoc record analysis in several dimensions, giving them the information and understanding they need to make better choices. Following guidelines must be followed while selecting an OLAP system:-
|
|
| 21. |
What do you mean by data mining? Differentiate between data mining and data warehousing. |
||||||||||||
|
Answer» Data mining is the process of collecting information in order to find PATTERNS, trends, and usable data that will help a company to make data-driven decisions from large amounts of data. In other WORDS, Data Mining is the method of analysing hidden patterns of data from VARIOUS perspectives for categorization into useful data, which is gathered and assembled in specific areas such as data warehouses, EFFICIENT analysis, data mining algorithm, assisting decision making, and other data requirements, ultimately resulting in cost-cutting and revenue generation. Data mining is the process of automatically examining enormous amounts of data for patterns and trends that go beyond simple analysis. Data mining estimates the probability of future events by utilising advanced mathematical algorithms for data segments. Following are the differences between data warehousing and data mining:-
|
|||||||||||||