InterviewSolution
This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.
| 1. |
Explain in short about Snowflake Clustering. |
|
Answer» In SNOWFLAKE, clustering is a type of DATA partitioning, where unique cluster keys are SPECIFIED for each table. Cluster keys are subsets of a table's columns that are used to co-locate data within the table. These keys are APPROPRIATE for comprehensive TABLES. The process of managing clustered data in a table is known as re-clustering. |
|
| 2. |
Snowflake is what kind of database? |
|
Answer» All Snowflake's FEATURES are built on top of SQL (Structured Query Language) databases. The data in this relational database SYSTEM is stored in columns and it is compatible with other tools, including Excel and Tableau. As a SQL database, Snowflake contains a query TOOL, supports multi-statement transactions, PROVIDES role-based security, etc. |
|
| 3. |
Is snowflake OLTP (Online Transactional Processing) or OLAP (Online Analytical Processing)? |
|
Answer» Snowflake is developed as an OLAP (Online Analytical Processing) database system, not as an OLTP (Online TRANSACTION Processing) database system. In OLTP (Online Transaction Processing), data is collected, stored, and processed from real-time transactions, but in OLAP (Online Analytical Processing), complex queries are used to evaluate aggregated historical data from OLTP systems. Snowflake is not designed to handle much updating and inserting of SMALL amounts of data like a transactional database would. Snowflake, for INSTANCE, cannot handle referential integrity because, even though it supports integrity and other constraints, they are not ENFORCED except the NOT NULL CONSTRAINT that is always enforced. Other constraints than NOT NULL are created as disabled constraints. However, depending on the use, we may also use it for online transaction processing (OLTP). |
|
| 4. |
What do you mean by Horizontal and Vertical Scaling? |
Answer»
|
|
| 5. |
Which ETL tools are compatible with Snowflake? |
|
Answer» SNOWFLAKE is compatible with the following ETL TOOLS:
|
|
| 6. |
Is Snowflake an ETL (Extract, Transform, and Load) tool? |
|
Answer» Yes, Snowflake is an ETL (Extract, TRANSFORM, and Load) tool, since it is performed in three steps, including:
|
|
| 7. |
In Snowflake, how are data and information secured? |
|
Answer» Every organization considers DATA security to be one of its TOP priorities. The Snowflake platform adheres to the best security standards in the industry to encrypt and safeguard customer data. The platform provides the best key management features at no additional charge. To protect client data, Snowflake EMPLOYS the FOLLOWING security measures:
|
|
| 8. |
Which cloud platforms does Snowflake currently support? |
|
Answer» SNOWFLAKE currently supports the following cloud platforms: |
|
| 9. |
What do you mean by Snowflake Computing? |
|
Answer» The TERM SNOWFLAKE computing refers to Snowflake's ability to provide instant, secure, and governed access to all data networks, ALONG with its core architecture that enables multiple types of data workloads and offers a unified platform for modern data applications. In contrast to other data warehouses, Snowflake does not use a database or "big data" software platform such as Hadoop. Snowflake, however, combines an entirely new SQL QUERY engine with a natively cloud-based architecture. |
|
| 10. |
Explain Snowpipe. |
|
Answer» In simple terms, Snowpipe is a continuous data ingestion service provided by Snowflake that LOADS files within minutes as soon as they are added to a stage and submitted for ingestion. Therefore, you can load data from files in micro-BATCHES (organizing data into small groups/matches), allowing users to access the data within minutes (very less response time), rather than MANUALLY running COPY statements on a schedule to load large batches. By loading the data into micro-batches, Snowpipe makes it EASIER to analyze it. Snowpipe uses a combination of filenames and file checksums to ensure that only new data is PROCESSED. Advantages of Snowpipe -
|
|
| 11. |
Explain stages in Snowflake. |
|
Answer» STAGES are locations in Snowflake where data is stored, and staging is the process of uploading data into a stage. Data that needs to be loaded or stored within Snowflake is stored either elsewhere in the other cloud regions LIKE in AWS (Amazon Web Service) S3, GCP (Google Cloud Platform), or AZURE, or is stored internally within Snowflake. When data is stored in another cloud region, this is known as an EXTERNAL stage; when it is stored inside a snowflake, it is known as an internal stage. Internal stages can be further categorized as follows:
|
|
| 12. |
What is the difference between Snowflake and Redshift? |
||||||||||
|
Answer» Cloud-based data warehouses are becoming increasingly popular, with Redshift and Snowflake being two of the biggest players. These are large data analytics databases capable of analyzing and READING vast amounts of data. Snowflake vs Redshift -
|
|||||||||||
| 13. |
Can you tell me how to access the Snowflake Cloud data warehouse? |
|
Answer» Snowflake's data warehouse can be accessed using the following ways:
|
|
| 14. |
What do you mean by virtual warehouse? |
|
Answer» A VIRTUAL warehouse is basically a collection of computing resources (like CPU, memory, Solid state drive, etc.) that customers can access to run queries, load data, and perform other DML (Data Manipulation Language) and SQL (Structured Query Language) OPERATIONS. For example, it provides memory, temporary storage, and CPU resources that can be used for DML operations and SQL execution. You can use this independent compute cluster at any time and then TURN it off when not needed. You are charged (paid) for each virtual warehouse you run, their size, and how long they run. Virtual warehouses do not SHARE their compute resources with each other, and therefore, their performance is independent of each other. As shown in the FOLLOWING diagram, different groups of users can be assigned separate and dedicated virtual warehouses. Therefore, ETL processes can continuously load and execute complex transformation procedures on separate warehouses, ensuring no impact on data scientists or finance reports. |
|
| 15. |
Explain Snowflake Architecture. |
|
Answer» The Snowflake architecture is a hybrid of shared-disk (A common disk or storage device is shared by all computing nodes) and shared-nothing (Each computing node has a private MEMORY and storage space) database architecture in order to combine the best of both. Snowflake utilizes a central data repository for persistent data, which is available to all compute nodes similar to a shared-disk architecture. But, equally, as with shared-nothing architectures, Snowflake uses massively parallel computing (MPP) clusters for query processing, in which each node stores part of the whole data set locally. The Snowflake architecture is divided into three key layers as shown below:
|
|
| 16. |
What are the essential features of Snowflake? |
|
Answer» Snowflake has the following key FEATURES:
|
|