50 + Interview Questions in Netezza Interview Questions in Netezza Tutorial

1.	What Are 4 Environment Variables That Are Required. What Are Different States On Netezza?
Answer» Environment VARIABLES: NZ_HOST, NZ_DATABASE, NZ_USER and NZ_PASSWORD Online: Normal or usual state. Stopped: Netezza will shutdown after completing current queries, no new queries allowed. Offline: WAITS for COMPLETION of current queries, new or queries in queue receive error. Paused: Same as above, but no error displayed. TYPICALLY caused during Netezza bootup or startup. Down: Just plain down, could be due to Netezza server problem or user initiated. Environment variables: NZ_HOST, NZ_DATABASE, NZ_USER and NZ_PASSWORD

1.

What Are 4 Environment Variables That Are Required. What Are Different States On Netezza?

Answer»

Environment VARIABLES: NZ_HOST, NZ_DATABASE, NZ_USER and NZ_PASSWORD

Online: Normal or usual state.
Stopped: Netezza will shutdown after completing current queries, no new queries allowed.
Offline: WAITS for COMPLETION of current queries, new or queries in queue receive error.
Paused: Same as above, but no error displayed. TYPICALLY caused during Netezza bootup or startup.
Down: Just plain down, could be due to Netezza server problem or user initiated.

Environment variables: NZ_HOST, NZ_DATABASE, NZ_USER and NZ_PASSWORD

Discussion

2.	Explain Fpga And How Is It Useful For Query Performance?
Answer» FPGA: Field Programmable GATE Array (FPGA) is located on each SPU. Netezza is different from other architectures. Netezza can do a “hardware upgrade” through software by using FPGA. Hardware is reconfigured during install. While reading data from disk, FPGA on each SPU ALSO HELPS in ‘filtering’ unnecessary data before GETTING loaded into memory on each SPU. This way, FPGA does not overwhelm with all the data from disk. FPGA: Field Programmable Gate Array (FPGA) is located on each SPU. Netezza is different from other architectures. Netezza can do a “hardware upgrade” through software by using FPGA. Hardware is reconfigured during install. While reading data from disk, FPGA on each SPU also helps in ‘filtering’ unnecessary data before getting loaded into memory on each SPU. This way, FPGA does not overwhelm with all the data from disk.

2.

Explain Fpga And How Is It Useful For Query Performance?

Answer»

FPGA: Field Programmable GATE Array (FPGA) is located on each SPU. Netezza is different from other architectures. Netezza can do a “hardware upgrade” through software by using FPGA. Hardware is reconfigured during install.

While reading data from disk, FPGA on each SPU ALSO HELPS in ‘filtering’ unnecessary data before GETTING loaded into memory on each SPU. This way, FPGA does not overwhelm with all the data from disk.

FPGA: Field Programmable Gate Array (FPGA) is located on each SPU. Netezza is different from other architectures. Netezza can do a “hardware upgrade” through software by using FPGA. Hardware is reconfigured during install.

While reading data from disk, FPGA on each SPU also helps in ‘filtering’ unnecessary data before getting loaded into memory on each SPU. This way, FPGA does not overwhelm with all the data from disk.

Discussion

3.	What Is A Zone Map?
Answer» Zone map in Netezza is similar (concept wise) to partitions in Oracle. Netezza maintains map for DATA so that it does relies on zone map to pull only the range it is interested in. For example, if we need to pull out data from Jan 2009 TILL June 2009 from a table that is distributed on date column, zone map helps us to achieve this. Zone map is MAINTAINED by Netezza automagically, no user intervention needed. Zone mapping is done at a block (EXTENT) level. Netezza has zone maps for all COLUMNS (not just distributed column) and includes information such as minimum, maximum, total number of records. Zone map in Netezza is similar (concept wise) to partitions in Oracle. Netezza maintains map for data so that it does relies on zone map to pull only the range it is interested in. For example, if we need to pull out data from Jan 2009 till June 2009 from a table that is distributed on date column, zone map helps us to achieve this. Zone map is maintained by Netezza automagically, no user intervention needed. Zone mapping is done at a block (extent) level. Netezza has zone maps for all columns (not just distributed column) and includes information such as minimum, maximum, total number of records.

3.

What Is A Zone Map?

Answer»

Zone map in Netezza is similar (concept wise) to partitions in Oracle. Netezza maintains map for DATA so that it does relies on zone map to pull only the range it is interested in. For example, if we need to pull out data from Jan 2009 TILL June 2009 from a table that is distributed on date column, zone map helps us to achieve this. Zone map is MAINTAINED by Netezza automagically, no user intervention needed. Zone mapping is done at a block (EXTENT) level. Netezza has zone maps for all COLUMNS (not just distributed column) and includes information such as minimum, maximum, total number of records.

Zone map in Netezza is similar (concept wise) to partitions in Oracle. Netezza maintains map for data so that it does relies on zone map to pull only the range it is interested in. For example, if we need to pull out data from Jan 2009 till June 2009 from a table that is distributed on date column, zone map helps us to achieve this. Zone map is maintained by Netezza automagically, no user intervention needed. Zone mapping is done at a block (extent) level. Netezza has zone maps for all columns (not just distributed column) and includes information such as minimum, maximum, total number of records.

Discussion

4.	How Do You Deal With Historical Data, With Respect To Zone Maps?
Answer» Sort DATA first, BASED on historical data (for EXAMPLE, DATE) and load this in using NZLOAD. Sort data first, based on historical data (for example, date) and load this in using nzload.

4.

How Do You Deal With Historical Data, With Respect To Zone Maps?

Answer»

Sort DATA first, BASED on historical data (for EXAMPLE, DATE) and load this in using NZLOAD.

Sort data first, based on historical data (for example, date) and load this in using nzload.

Discussion

5.	Does Everything Gets Cached In Netezza (or Any Other Data Appliance)?
Answer» Typically only schema and other DATABASE objects are CACHED in appliances. Data is not cached, in general. In most cases, data is not saved any where (in any cache or on host COMPUTER) and is STREAMED directly from SPU to client software. Typically only schema and other database objects are cached in appliances. Data is not cached, in general. In most cases, data is not saved any where (in any cache or on host computer) and is streamed directly from SPU to client software.

5.

Does Everything Gets Cached In Netezza (or Any Other Data Appliance)?

Answer»

Typically only schema and other DATABASE objects are CACHED in appliances. Data is not cached, in general. In most cases, data is not saved any where (in any cache or on host COMPUTER) and is STREAMED directly from SPU to client software.

Typically only schema and other database objects are cached in appliances. Data is not cached, in general. In most cases, data is not saved any where (in any cache or on host computer) and is streamed directly from SPU to client software.

Discussion

6.	How Is Load Achieved In Netezza And Why Is That Quick / Fast?
Answer» Loads by pass a few steps that typically a query would go through (a query GOES through plan generation, optimization and transaction management). Loads are done in terms of “sets” and this set is based on underlying table structure (thus loads for two different tables are different as their sets are based on table structures). Data is processed to check format and distribution of records calculated very QUICKLY (in one step), fills into ‘set’ structure and writes to storage structure. Storage ALSO PERFORMS space availability and other admin tasks, all these operations go pretty quick (think of them as UNIX named pipes that streams data and SPU STORES these records). Loads by pass a few steps that typically a query would go through (a query goes through plan generation, optimization and transaction management). Loads are done in terms of “sets” and this set is based on underlying table structure (thus loads for two different tables are different as their sets are based on table structures). Data is processed to check format and distribution of records calculated very quickly (in one step), fills into ‘set’ structure and writes to storage structure. Storage also performs space availability and other admin tasks, all these operations go pretty quick (think of them as UNIX named pipes that streams data and SPU stores these records).

6.

How Is Load Achieved In Netezza And Why Is That Quick / Fast?

Answer»

Loads by pass a few steps that typically a query would go through (a query GOES through plan generation, optimization and transaction management). Loads are done in terms of “sets” and this set is based on underlying table structure (thus loads for two different tables are different as their sets are based on table structures). Data is processed to check format and distribution of records calculated very QUICKLY (in one step), fills into ‘set’ structure and writes to storage structure. Storage ALSO PERFORMS space availability and other admin tasks, all these operations go pretty quick (think of them as UNIX named pipes that streams data and SPU STORES these records).

Loads by pass a few steps that typically a query would go through (a query goes through plan generation, optimization and transaction management). Loads are done in terms of “sets” and this set is based on underlying table structure (thus loads for two different tables are different as their sets are based on table structures). Data is processed to check format and distribution of records calculated very quickly (in one step), fills into ‘set’ structure and writes to storage structure. Storage also performs space availability and other admin tasks, all these operations go pretty quick (think of them as UNIX named pipes that streams data and SPU stores these records).

Discussion

7.	When Are We Likely To Receive Incorrect (aggregate) Results?
Answer» Very rarely a driver may return aggregated results that are STILL getting processed BACK to client. In this case, client may assume that calculation is complete, INSTEAD of updating with latest or final results. Obviously, driver has to wait for Netezza to complete operation on host COMPUTER, before delivering results. Very rarely a driver may return aggregated results that are still getting processed back to client. In this case, client may assume that calculation is complete, instead of updating with latest or final results. Obviously, driver has to wait for Netezza to complete operation on host computer, before delivering results.

7.

When Are We Likely To Receive Incorrect (aggregate) Results?

Answer»

Very rarely a driver may return aggregated results that are STILL getting processed BACK to client. In this case, client may assume that calculation is complete, INSTEAD of updating with latest or final results. Obviously, driver has to wait for Netezza to complete operation on host COMPUTER, before delivering results.

Very rarely a driver may return aggregated results that are still getting processed back to client. In this case, client may assume that calculation is complete, instead of updating with latest or final results. Obviously, driver has to wait for Netezza to complete operation on host computer, before delivering results.

Discussion

8.	Does Netezza Support Concurrent Update Of The Same Record?
Answer» In CASE of conflict in which the same RECORD is set for MODIFICATION, Netezza rolls back recent TRANSACTION that is attempted on the same record, in fact same table. This is generally acceptable in DW environments. Netezza does support serialization transactions and does not permit dirty READS. In case of conflict in which the same record is set for modification, Netezza rolls back recent transaction that is attempted on the same record, in fact same table. This is generally acceptable in DW environments. Netezza does support serialization transactions and does not permit dirty reads.

8.

Does Netezza Support Concurrent Update Of The Same Record?

Answer»

In CASE of conflict in which the same RECORD is set for MODIFICATION, Netezza rolls back recent TRANSACTION that is attempted on the same record, in fact same table. This is generally acceptable in DW environments. Netezza does support serialization transactions and does not permit dirty READS.

In case of conflict in which the same record is set for modification, Netezza rolls back recent transaction that is attempted on the same record, in fact same table. This is generally acceptable in DW environments. Netezza does support serialization transactions and does not permit dirty reads.

Discussion

9.	Can A Group Become A Member Of Another Group In Netezza User Administration. Can We Use Same Group Name For Databases?
Answer» In Netezza, PUBLIC group is created automatically and every one is a memeber of this group by default. We can CREATE as many groups and any user can be MEMBER of any group(s). Group can not be a member of another group. Group NAMES, user names and database names are unique. That is, we can not have a database called sales and a group also called sales. In Netezza, public group is created automatically and every one is a memeber of this group by default. We can create as many groups and any user can be member of any group(s). Group can not be a member of another group. Group names, user names and database names are unique. That is, we can not have a database called sales and a group also called sales.

9.

Can A Group Become A Member Of Another Group In Netezza User Administration. Can We Use Same Group Name For Databases?

Answer»

In Netezza, PUBLIC group is created automatically and every one is a memeber of this group by default. We can CREATE as many groups and any user can be MEMBER of any group(s). Group can not be a member of another group. Group NAMES, user names and database names are unique. That is, we can not have a database called sales and a group also called sales.

In Netezza, public group is created automatically and every one is a memeber of this group by default. We can create as many groups and any user can be member of any group(s). Group can not be a member of another group. Group names, user names and database names are unique. That is, we can not have a database called sales and a group also called sales.

Discussion

10.	What Happens To Records That Are Loaded During Nzload Process, But Were Not Committed?
Answer» They are LOGICALLY deleted and ADMINISTRATOR can RUN nzreclaim, we MAY also TRUNCATE table. They are logically deleted and administrator can run nzreclaim, we may also truncate table.

10.

What Happens To Records That Are Loaded During Nzload Process, But Were Not Committed?

Answer»

They are LOGICALLY deleted and ADMINISTRATOR can RUN nzreclaim, we MAY also TRUNCATE table.

They are logically deleted and administrator can run nzreclaim, we may also truncate table.

Discussion

11.	How Can We Give A Global Permission To User Joe So That He Can Create Table In Any Database.?
Answer» Login into SYSTEM DATABASE and give that permission to USER by SAYING “grant CREATE table to joe;” Login into system database and give that permission to user by saying “grant create table to joe;”

11.

How Can We Give A Global Permission To User Joe So That He Can Create Table In Any Database.?

Answer»

Login into SYSTEM DATABASE and give that permission to USER by SAYING “grant CREATE table to joe;”

Login into system database and give that permission to user by saying “grant create table to joe;”

Discussion

12.	What Permission Will You Give To Connect To A Database?
Answer» List. Grant list, select on table to PUBLIC (if logged into SALES database, this allows all users to QUERY tables in sales database). List. Grant list, select on table to public (if logged into sales database, this allows all users to query tables in sales database).

12.

What Permission Will You Give To Connect To A Database?

Answer»

List. Grant list, select on table to PUBLIC (if logged into SALES database, this allows all users to QUERY tables in sales database).

List. Grant list, select on table to public (if logged into sales database, this allows all users to query tables in sales database).

Discussion

13.	Do We Need To Drop All Tables And Objects In That Database, Before Dropping A Database?
Answer» No, DROP DATABASE will TAKE CARE of it. No, drop database will take care of it.

13.

Do We Need To Drop All Tables And Objects In That Database, Before Dropping A Database?

Answer»

No, DROP DATABASE will TAKE CARE of it.

No, drop database will take care of it.

Discussion

14.	What Constraints On A Table Are Enforced?
Answer» Not NULL and default. NETEZZA does not apply PK and FK. Not null and default. Netezza does not apply PK and FK.

14.

What Constraints On A Table Are Enforced?

Answer»

Not NULL and default. NETEZZA does not apply PK and FK.

Not null and default. Netezza does not apply PK and FK.

Discussion

15.	Why Not Null Specification Is Better In Netezza?
Answer» Specifying not NULL RESULTS in better PERFORMANCE as NULL VALUES are tracked at rowheader level. Having NULL values results in STORING references to NULL values in header. If all columns are NOT NULL, then there is no record header. Specifying not null results in better performance as NULL values are tracked at rowheader level. Having NULL values results in storing references to NULL values in header. If all columns are NOT NULL, then there is no record header.

15.

Why Not Null Specification Is Better In Netezza?

Answer»

Specifying not NULL RESULTS in better PERFORMANCE as NULL VALUES are tracked at rowheader level. Having NULL values results in STORING references to NULL values in header. If all columns are NOT NULL, then there is no record header.

Specifying not null results in better performance as NULL values are tracked at rowheader level. Having NULL values results in storing references to NULL values in header. If all columns are NOT NULL, then there is no record header.

Discussion

16.	Create Table As (ctas), Does It Distribute Data Randomly Or Based On Table On Which It Received Data?
Answer» RESPONSE: Newly created table from CTAS gets DISTRIBUTION from the ORIGINAL table. Response: Newly created table from CTAS gets distribution from the original table.

16.

Create Table As (ctas), Does It Distribute Data Randomly Or Based On Table On Which It Received Data?

Answer»

RESPONSE: Newly created table from CTAS gets DISTRIBUTION from the ORIGINAL table.

Response: Newly created table from CTAS gets distribution from the original table.

Discussion

17.	Why Do You Prefer Truncate Instead Of Drop Table Command?
Answer» Just empties DATA from table, keeping table structure and PERMISSION intact. Just empties data from table, keeping table structure and permission intact.

17.

Why Do You Prefer Truncate Instead Of Drop Table Command?

Answer»

Just empties DATA from table, keeping table structure and PERMISSION intact.

Just empties data from table, keeping table structure and permission intact.

Discussion

18.	When No Distribution Clause Is Used While Creating A Table, What Distribution Is Used By Netezza?
Answer» FIRST COLUMN (same as in TERADATA). First column (same as in Teradata).

18.

When No Distribution Clause Is Used While Creating A Table, What Distribution Is Used By Netezza?

Answer»

FIRST COLUMN (same as in TERADATA).

First column (same as in Teradata).

Discussion

19.	Can We Update All Columns In A Netezza Table.?
Answer» No, the column that is USED in distribution clause cannot be used for UPDATES. Remember, up to four columns can be used for distribution of data on SPU. From practical sense, updating distribution columns RESULT in redistribution of data; the single most PERFORMANCE hit when large table is involved. This restriction makes sense. No, the column that is used in distribution clause cannot be used for updates. Remember, up to four columns can be used for distribution of data on SPU. From practical sense, updating distribution columns result in redistribution of data; the single most performance hit when large table is involved. This restriction makes sense.

19.

Can We Update All Columns In A Netezza Table.?

Answer»

No, the column that is USED in distribution clause cannot be used for UPDATES. Remember, up to four columns can be used for distribution of data on SPU. From practical sense, updating distribution columns RESULT in redistribution of data; the single most PERFORMANCE hit when large table is involved. This restriction makes sense.

No, the column that is used in distribution clause cannot be used for updates. Remember, up to four columns can be used for distribution of data on SPU. From practical sense, updating distribution columns result in redistribution of data; the single most performance hit when large table is involved. This restriction makes sense.

Discussion

20.	What Data Type Works Best For Zone Maps?
Answer» ZONE MAPS work BEST for integer data TYPES. Zone maps work best for integer data types.

20.

What Data Type Works Best For Zone Maps?

Answer»

ZONE MAPS work BEST for integer data TYPES.

Zone maps work best for integer data types.

Discussion

21.	How Netezza Updates A Row In A Table?
Answer» NETEZZA logically DELETES the ORIGINAL ROW by flagging the deletexid column with the current transaction id and inserts a new row with the updated VALUES. Netezza logically deletes the original row by flagging the deletexid column with the current transaction id and inserts a new row with the updated values.

21.

How Netezza Updates A Row In A Table?

Answer»

NETEZZA logically DELETES the ORIGINAL ROW by flagging the deletexid column with the current transaction id and inserts a new row with the updated VALUES.

Netezza logically deletes the original row by flagging the deletexid column with the current transaction id and inserts a new row with the updated values.

Discussion

22.	What Are The Different Ways To Load Data From A File Into A Table?
Answer» USE nzload to load the data from a file into a table CREATE an EXTERNAL table and then load the original table USING the external table. Use nzload to load the data from a file into a table Create an external table and then load the original table using the external table.

22.

What Are The Different Ways To Load Data From A File Into A Table?

Answer»

USE nzload to load the data from a file into a table

CREATE an EXTERNAL table and then load the original table USING the external table.

Use nzload to load the data from a file into a table

Create an external table and then load the original table using the external table.

Discussion

23.	What Are The Ways To Load A Data From A Table Into A File?
Answer» CREATE an EXTERNAL table. USE nzsql UTILITY with -o OPTION. Create an external table. Use nzsql utility with -o option.

23.

What Are The Ways To Load A Data From A Table Into A File?

Answer»

CREATE an EXTERNAL table.

USE nzsql UTILITY with -o OPTION.

Create an external table.

Use nzsql utility with -o option.

Discussion

24.	What Is Nzload?
Answer» NZLOAD UTILITY is USED load data from a file into a table. It is used to load bulk data quickly and simultaneously rejects ERRONEOUS content. Nzload utility is used load data from a file into a table. It is used to load bulk data quickly and simultaneously rejects erroneous content.

24.

What Is Nzload?

Answer»

NZLOAD UTILITY is USED load data from a file into a table. It is used to load bulk data quickly and simultaneously rejects ERRONEOUS content.

Nzload utility is used load data from a file into a table. It is used to load bulk data quickly and simultaneously rejects erroneous content.

Discussion

25.	How Do You Remove Logically Deleted Records?
Answer» WHENEVER you delete a ROW in a table, it is not physically deleted. It is logically deleted by FLAGGING the deletexid field in the table. NZRECLAIM utility is used to REMOVE the logically deleted RECORDS. Whenever you delete a row in a table, it is not physically deleted. It is logically deleted by flagging the deletexid field in the table. NZRECLAIM utility is used to remove the logically deleted records.

25.

How Do You Remove Logically Deleted Records?

Answer»

WHENEVER you delete a ROW in a table, it is not physically deleted. It is logically deleted by FLAGGING the deletexid field in the table. NZRECLAIM utility is used to REMOVE the logically deleted RECORDS.

Whenever you delete a row in a table, it is not physically deleted. It is logically deleted by flagging the deletexid field in the table. NZRECLAIM utility is used to remove the logically deleted records.

Discussion

26.	When Netezza Redistributes A Table And When It Broadcasts A Table?
Answer» Whenever it is not POSSIBLE to do a COLLOCATED JOIN, NETEZZA EITHER redistributes the tables or broadcasts the table. When the table is a small one, then netezza broadcasts the table. Otherwise netezza redistributes the table. Whenever it is not possible to do a collocated join, netezza either redistributes the tables or broadcasts the table. When the table is a small one, then netezza broadcasts the table. Otherwise netezza redistributes the table.

26.

When Netezza Redistributes A Table And When It Broadcasts A Table?

Answer»

Whenever it is not POSSIBLE to do a COLLOCATED JOIN, NETEZZA EITHER redistributes the tables or broadcasts the table. When the table is a small one, then netezza broadcasts the table. Otherwise netezza redistributes the table.

Whenever it is not possible to do a collocated join, netezza either redistributes the tables or broadcasts the table. When the table is a small one, then netezza broadcasts the table. Otherwise netezza redistributes the table.

Discussion

27.	What Is Collocated Join?
Answer» When you join tables which are distributed on the same key and USED these key columns in the join condition, then each SPU in netezza WORKS 100% INDEPENDENT of the other, as the required data is available in itself. This type of joins is CALLED collocated joins. When you join tables which are distributed on the same key and used these key columns in the join condition, then each SPU in netezza works 100% independent of the other, as the required data is available in itself. This type of joins is called collocated joins.

27.

What Is Collocated Join?

Answer»

When you join tables which are distributed on the same key and USED these key columns in the join condition, then each SPU in netezza WORKS 100% INDEPENDENT of the other, as the required data is available in itself. This type of joins is CALLED collocated joins.

When you join tables which are distributed on the same key and used these key columns in the join condition, then each SPU in netezza works 100% independent of the other, as the required data is available in itself. This type of joins is called collocated joins.

Discussion

28.	How Do You Check The Rows In A Table Are Equally Distributed In All Spu’s Or Not?
Answer» To CHECK the distribution of rows run the FOLLOWING QUERY SELECT datasliceid, COUNT(*)FROM &LT;table name> GROUP BY datasliceid

28.

How Do You Check The Rows In A Table Are Equally Distributed In All Spu’s Or Not?

Answer»

To CHECK the distribution of rows run the FOLLOWING QUERY
SELECT datasliceid, COUNT(*)FROM &LT;table name> GROUP BY datasliceid

Discussion

29.	If You Did Not Specify Any Distribution Clause, How The Create Table As (ctas) Will Distribute The Rows?
Answer» CTAS will GET DISTRIBUTION from the ORIGINAL table. CTAS will get distribution from the original table.

29.

If You Did Not Specify Any Distribution Clause, How The Create Table As (ctas) Will Distribute The Rows?

Answer»

CTAS will GET DISTRIBUTION from the ORIGINAL table.

CTAS will get distribution from the original table.

Discussion

30.	How Do You Redistribute A Table?
Answer» USE Create Table As (CTAS) to redistribute the data in a table. While creating the NEW table specify the DISTRIBUTE on CLAUSE to distribute the data on the new COLUMNS. Use Create Table As (CTAS) to redistribute the data in a table. While creating the new table specify the distribute on clause to distribute the data on the new columns.

30.

How Do You Redistribute A Table?

Answer»

USE Create Table As (CTAS) to redistribute the data in a table. While creating the NEW table specify the DISTRIBUTE on CLAUSE to distribute the data on the new COLUMNS.

Use Create Table As (CTAS) to redistribute the data in a table. While creating the new table specify the distribute on clause to distribute the data on the new columns.

Discussion

31.	What Data Types Are Most Suited For The Columns Specified In Distribution Clause?
Answer» Integer Integer

31.

What Data Types Are Most Suited For The Columns Specified In Distribution Clause?

Answer»

Integer

Discussion

32.	Can You Update The Columns Used In Distribution Clause?
Answer» No, the COLUMN that is USED in DISTRIBUTION CLAUSE cannot be used for UPDATES. No, the column that is used in distribution clause cannot be used for updates.

32.

Can You Update The Columns Used In Distribution Clause?

Answer»

No, the COLUMN that is USED in DISTRIBUTION CLAUSE cannot be used for UPDATES.

No, the column that is used in distribution clause cannot be used for updates.

Discussion

33.	If You Did Not Specify Any Distribute On Clause While Creating A Table, What Distribution Netezza Uses?
Answer» NETEZZA DISTRIBUTES the DATA on the first column and it USES Hash partitioning Netezza distributes the data on the first column and it uses Hash partitioning

33.

If You Did Not Specify Any Distribute On Clause While Creating A Table, What Distribution Netezza Uses?

Answer»

NETEZZA DISTRIBUTES the DATA on the first column and it USES Hash partitioning

Netezza distributes the data on the first column and it uses Hash partitioning

Discussion

34.	Up To How Many Columns You Can Specify In Distribute On Clause?
Answer» You can SPECIFY up to FOUR COLUMNS in the DISTRIBUTION CLAUSE. You can specify up to four columns in the distribution clause.

34.

Up To How Many Columns You Can Specify In Distribute On Clause?

Answer»

You can SPECIFY up to FOUR COLUMNS in the DISTRIBUTION CLAUSE.

You can specify up to four columns in the distribution clause.

Discussion

35.	What Are The Partitioning Methods Available In Netezza?
Answer» There are two partitioning METHODS AVAILABLE in netezza: RANDOM partitioning: DISTRIBUTES the DATA randomly. Hash Partitioning: Distributes the data on the specified columns. There are two partitioning methods available in netezza:

35.

What Are The Partitioning Methods Available In Netezza?

Answer»

There are two partitioning METHODS AVAILABLE in netezza:

RANDOM partitioning: DISTRIBUTES the DATA randomly.
Hash Partitioning: Distributes the data on the specified columns.

There are two partitioning methods available in netezza:

Discussion

36.	What Are The Best Practices Of Creating Materialized Views?
Answer» Create materialized views with few columns which are frequently queried. Specify ORDER by clause on the most RESTRICTIVE columns (columns USED in where clause). Periodically or manually REFRESH the materialized views.

36.

What Are The Best Practices Of Creating Materialized Views?

Answer»

Create materialized views with few columns which are frequently queried.
Specify ORDER by clause on the most RESTRICTIVE columns (columns USED in where clause).
Periodically or manually REFRESH the materialized views.

Discussion

37.	What Are The Limitations Of Materialized Views?
Answer» You cannot INSERT, update, delete or truncate a MATERIALIZED view. Any CHANGES on the base tables will REFLECT into materialized views. You can specify only one base table in the from clause. Base table can’t be a external table, system table or a temporary table. You cannot use a where clause in the materialized view. Expressions are not allowed as columns.

37.

What Are The Limitations Of Materialized Views?

Answer»

You cannot INSERT, update, delete or truncate a MATERIALIZED view. Any CHANGES on the base tables will REFLECT into materialized views.
You can specify only one base table in the from clause.
Base table can’t be a external table, system table or a temporary table.
You cannot use a where clause in the materialized view.
Expressions are not allowed as columns.

Discussion

38.	What Is The Distribution Of Materialized Views?
Answer» A MATERIALIZED VIEW has the same DISTRIBUTION KEY as the BASE table. A materialized view has the same distribution key as the base table.

38.

What Is The Distribution Of Materialized Views?

Answer»

A MATERIALIZED VIEW has the same DISTRIBUTION KEY as the BASE table.

A materialized view has the same distribution key as the base table.

Discussion

39.	What Is The Use Of Creating Materialized Views?
Answer» A materialized view reduces the width (number of columns) of DATA being scanned in the BASE table by creating a thin version (FEWER columns) of the base table that contains a SMALL subset of frequently queried columns. A materialized view reduces the width (number of columns) of data being scanned in the base table by creating a thin version (fewer columns) of the base table that contains a small subset of frequently queried columns.

39.

What Is The Use Of Creating Materialized Views?

Answer»

A materialized view reduces the width (number of columns) of DATA being scanned in the BASE table by creating a thin version (FEWER columns) of the base table that contains a SMALL subset of frequently queried columns.

A materialized view reduces the width (number of columns) of data being scanned in the base table by creating a thin version (fewer columns) of the base table that contains a small subset of frequently queried columns.

Discussion

40.	What Is Generate Statistics And Generate Express Statistics Or What Is The Difference Between Generate Statistics And Generate Express Statistics?
Answer» Generate statistics is USED to gather statistics about each table column’s proportion of duplicate values, MINIMUM values, maximum values, null values, unique values and updates the system catalog tables. The difference between ‘generate statistics’ and ‘generate EXPRESS statistics’ is based on how the column uniqueness is calculates. The ‘generate express statistics’ calculates ESTIMATED dispersion values based on the sampling of rows in the table. ‘Generate express statistics’ uses approximation in generating the stats where as ‘generate statistics’ uses all the rows in the table. Generate statistics is used to gather statistics about each table column’s proportion of duplicate values, minimum values, maximum values, null values, unique values and updates the system catalog tables. The difference between ‘generate statistics’ and ‘generate express statistics’ is based on how the column uniqueness is calculates. The ‘generate express statistics’ calculates estimated dispersion values based on the sampling of rows in the table. ‘Generate express statistics’ uses approximation in generating the stats where as ‘generate statistics’ uses all the rows in the table.

40.

What Is Generate Statistics And Generate Express Statistics Or What Is The Difference Between Generate Statistics And Generate Express Statistics?

Answer»

Generate statistics is USED to gather statistics about each table column’s proportion of duplicate values, MINIMUM values, maximum values, null values, unique values and updates the system catalog tables.

The difference between ‘generate statistics’ and ‘generate EXPRESS statistics’ is based on how the column uniqueness is calculates. The ‘generate express statistics’ calculates ESTIMATED dispersion values based on the sampling of rows in the table. ‘Generate express statistics’ uses approximation in generating the stats where as ‘generate statistics’ uses all the rows in the table.

Generate statistics is used to gather statistics about each table column’s proportion of duplicate values, minimum values, maximum values, null values, unique values and updates the system catalog tables.

The difference between ‘generate statistics’ and ‘generate express statistics’ is based on how the column uniqueness is calculates. The ‘generate express statistics’ calculates estimated dispersion values based on the sampling of rows in the table. ‘Generate express statistics’ uses approximation in generating the stats where as ‘generate statistics’ uses all the rows in the table.

Discussion

41.	How The Zonemaps Are Created And Updated?
Answer» ZONEMAPS are created and REFRESHED for every SPU when you GENERATE statistics, Nzload operation, INSERT, update operations, Nzreclaim operation. Zonemaps are created and refreshed for every SPU when you Generate statistics, Nzload operation, Insert, update operations, Nzreclaim operation.

41.

How The Zonemaps Are Created And Updated?

Answer»

ZONEMAPS are created and REFRESHED for every SPU when you GENERATE statistics, Nzload operation, INSERT, update operations, Nzreclaim operation.

Zonemaps are created and refreshed for every SPU when you Generate statistics, Nzload operation, Insert, update operations, Nzreclaim operation.

Discussion

42.	What Are Zonemaps?
Answer» An extent is the smallest unit of disk allocation on a SPU. Zonemaps is internal mapping structures to the extents that take ADVANTAGE of the internal ORDERING of data to eliminate extents that do not need to be SCANNED. Zonemaps TRANSPARENTLY avoid scanning of unreferenced rows. Zonemaps are CREATED for every column in the table and contain the minimum and maximum values for every extent. An extent is the smallest unit of disk allocation on a SPU. Zonemaps is internal mapping structures to the extents that take advantage of the internal ordering of data to eliminate extents that do not need to be scanned. Zonemaps transparently avoid scanning of unreferenced rows. Zonemaps are created for every column in the table and contain the minimum and maximum values for every extent.

42.

What Are Zonemaps?

Answer»

An extent is the smallest unit of disk allocation on a SPU. Zonemaps is internal mapping structures to the extents that take ADVANTAGE of the internal ORDERING of data to eliminate extents that do not need to be SCANNED. Zonemaps TRANSPARENTLY avoid scanning of unreferenced rows. Zonemaps are CREATED for every column in the table and contain the minimum and maximum values for every extent.

An extent is the smallest unit of disk allocation on a SPU. Zonemaps is internal mapping structures to the extents that take advantage of the internal ordering of data to eliminate extents that do not need to be scanned. Zonemaps transparently avoid scanning of unreferenced rows. Zonemaps are created for every column in the table and contain the minimum and maximum values for every extent.

Discussion

43.	What Is A Snippet?
Answer» A snippet is a small UNIT of work that are CARRIED out in SPU. A snippet is a small unit of work that are carried out in SPU.

43.

What Is A Snippet?

Answer»

A snippet is a small UNIT of work that are CARRIED out in SPU.

A snippet is a small unit of work that are carried out in SPU.

Discussion

44.	How Fpga Can Be Helpful In Improving Query Performance?
Answer» While reading data from the disk, the FIELD PROGRAMMABLE Gate Array (FPGA) on each SPU FILTERS out unwanted data. This process of data elimination removes IO bottlenecks and frees up DOWNSTREAM components such as the CPU, MEMORY and network from processing extra data. While reading data from the disk, the Field Programmable Gate Array (FPGA) on each SPU filters out unwanted data. This process of data elimination removes IO bottlenecks and frees up downstream components such as the CPU, memory and network from processing extra data.

44.

How Fpga Can Be Helpful In Improving Query Performance?

Answer»

While reading data from the disk, the FIELD PROGRAMMABLE Gate Array (FPGA) on each SPU FILTERS out unwanted data. This process of data elimination removes IO bottlenecks and frees up DOWNSTREAM components such as the CPU, MEMORY and network from processing extra data.

While reading data from the disk, the Field Programmable Gate Array (FPGA) on each SPU filters out unwanted data. This process of data elimination removes IO bottlenecks and frees up downstream components such as the CPU, memory and network from processing extra data.

Discussion

45.	How The Not Null Specification On A Column Improves The Netezza Performance?
Answer» Specifying Not NULL on each COLUMN in table RESULTS in BETTER PERFORMANCE. Netezza tracks the NULL values at rowheader level. Having NULL values results in storing references to NULL values in header. If all columns are NOT NULL, then there is no record header. Specifying Not Null on each column in table results in better performance. Netezza tracks the NULL values at rowheader level. Having NULL values results in storing references to NULL values in header. If all columns are NOT NULL, then there is no record header.

45.

How The Not Null Specification On A Column Improves The Netezza Performance?

Answer»

Specifying Not NULL on each COLUMN in table RESULTS in BETTER PERFORMANCE. Netezza tracks the NULL values at rowheader level. Having NULL values results in storing references to NULL values in header. If all columns are NOT NULL, then there is no record header.

Specifying Not Null on each column in table results in better performance. Netezza tracks the NULL values at rowheader level. Having NULL values results in storing references to NULL values in header. If all columns are NOT NULL, then there is no record header.

Discussion

46.	Can You Insert Duplicate Rows In Netezza Table?
Answer» Yes. As there are no PRIMARY KEY constraints in netezza you can insert duplicate ROWS. Yes. As there are no primary key constraints in netezza you can insert duplicate rows.

46.

Can You Insert Duplicate Rows In Netezza Table?

Answer»

Yes. As there are no PRIMARY KEY constraints in netezza you can insert duplicate ROWS.

Yes. As there are no primary key constraints in netezza you can insert duplicate rows.

Discussion

47.	What Are The Constraints On A Table Are Enforced?
Answer» The only CONSTRAINT NETEZZA supports is Not NULL. There are no PRIMARY key and foreign key constraints in netezza. The only constraint netezza supports is Not null. There are no primary key and foreign key constraints in netezza.

47.

What Are The Constraints On A Table Are Enforced?

Answer»

The only CONSTRAINT NETEZZA supports is Not NULL. There are no PRIMARY key and foreign key constraints in netezza.

The only constraint netezza supports is Not null. There are no primary key and foreign key constraints in netezza.

Discussion

48.	What Are The Different States Of Netezza?
Answer» Online: Normal or USUAL state. Stopped: Netezza will shut down after completing CURRENT queries, no new queries allowed. Offline: Waits for completion of current queries, new queries in queue receives error. Paused: Same as above, but no error DISPLAYED. TYPICALLY caused during Netezza boot up or STARTUP. Down: Just plain down, could be due to Netezza server problem or user initiated.

48.

What Are The Different States Of Netezza?

Answer»

Online: Normal or USUAL state.
Stopped: Netezza will shut down after completing CURRENT queries, no new queries allowed.
Offline: Waits for completion of current queries, new queries in queue receives error.
Paused: Same as above, but no error DISPLAYED. TYPICALLY caused during Netezza boot up or STARTUP.
Down: Just plain down, could be due to Netezza server problem or user initiated.

Discussion

49.	What Are The Environment Variables That Are Required To Connect To Netezza?
Answer» The ENVIRONMENT VARIABLES REQUIRED are: NZ_HOST, NZ_DATABASE, NZ_USER, NZ_PASSWORD The environment variables required are: NZ_HOST, NZ_DATABASE, NZ_USER, NZ_PASSWORD

49.

What Are The Environment Variables That Are Required To Connect To Netezza?

Answer»

The ENVIRONMENT VARIABLES REQUIRED are: NZ_HOST, NZ_DATABASE, NZ_USER, NZ_PASSWORD

The environment variables required are: NZ_HOST, NZ_DATABASE, NZ_USER, NZ_PASSWORD

Discussion

50.	What Are The Data Warehouse Appliances You Know?
Answer» NETEZZA GREENPLUM Vertica TERADATA Dataupia ORACLE EXADATA

50.

What Are The Data Warehouse Appliances You Know?

Answer»

NETEZZA
GREENPLUM
Vertica
TERADATA
Dataupia
ORACLE EXADATA

Discussion

Explore topic-wise InterviewSolutions in Current Affairs.

What Are 4 Environment Variables That Are Required. What Are Different States On Netezza?

Explain Fpga And How Is It Useful For Query Performance?

What Is A Zone Map?

How Do You Deal With Historical Data, With Respect To Zone Maps?

Does Everything Gets Cached In Netezza (or Any Other Data Appliance)?

How Is Load Achieved In Netezza And Why Is That Quick / Fast?

When Are We Likely To Receive Incorrect (aggregate) Results?

Does Netezza Support Concurrent Update Of The Same Record?

Can A Group Become A Member Of Another Group In Netezza User Administration. Can We Use Same Group Name For Databases?

What Happens To Records That Are Loaded During Nzload Process, But Were Not Committed?

How Can We Give A Global Permission To User Joe So That He Can Create Table In Any Database.?

What Permission Will You Give To Connect To A Database?

Do We Need To Drop All Tables And Objects In That Database, Before Dropping A Database?

What Constraints On A Table Are Enforced?

Why Not Null Specification Is Better In Netezza?

Create Table As (ctas), Does It Distribute Data Randomly Or Based On Table On Which It Received Data?

Why Do You Prefer Truncate Instead Of Drop Table Command?

When No Distribution Clause Is Used While Creating A Table, What Distribution Is Used By Netezza?

Can We Update All Columns In A Netezza Table.?

What Data Type Works Best For Zone Maps?

How Netezza Updates A Row In A Table?

What Are The Different Ways To Load Data From A File Into A Table?

What Are The Ways To Load A Data From A Table Into A File?

What Is Nzload?

How Do You Remove Logically Deleted Records?

When Netezza Redistributes A Table And When It Broadcasts A Table?

What Is Collocated Join?

How Do You Check The Rows In A Table Are Equally Distributed In All Spu’s Or Not?

If You Did Not Specify Any Distribution Clause, How The Create Table As (ctas) Will Distribute The Rows?

How Do You Redistribute A Table?

What Data Types Are Most Suited For The Columns Specified In Distribution Clause?

Can You Update The Columns Used In Distribution Clause?

If You Did Not Specify Any Distribute On Clause While Creating A Table, What Distribution Netezza Uses?

Up To How Many Columns You Can Specify In Distribute On Clause?

What Are The Partitioning Methods Available In Netezza?

What Are The Best Practices Of Creating Materialized Views?

What Are The Limitations Of Materialized Views?

What Is The Distribution Of Materialized Views?

What Is The Use Of Creating Materialized Views?

What Is Generate Statistics And Generate Express Statistics Or What Is The Difference Between Generate Statistics And Generate Express Statistics?

How The Zonemaps Are Created And Updated?

What Are Zonemaps?

What Is A Snippet?

How Fpga Can Be Helpful In Improving Query Performance?

How The Not Null Specification On A Column Improves The Netezza Performance?

Can You Insert Duplicate Rows In Netezza Table?

What Are The Constraints On A Table Are Enforced?

What Are The Different States Of Netezza?

What Are The Environment Variables That Are Required To Connect To Netezza?

What Are The Data Warehouse Appliances You Know?