Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

51.

What Are Normalization, First Normal Form, Second Normal Form And Third Normal Form?

Answer»

Normalization is the process of efficiently organizing data in a DATABASE. The two goals of the normalization process are eliminate redundant data (storing the same data in more than one table) and ensure data dependencies make SENSE (only storing related data in the table).

FIRST normalization form:

  • Eliminate duplicate columns from the same table.
  • Create separate tables for each group of related data and identify each row with a unique column or set of columns (primary key).

Second normal form:

  • Removes sub set of data that apply to multiple rows of table and place them in separate table.
  • Create relationships between these new tables and their predecessors through the use of FOREIGN keys.

THIRD normal form:

  • Remove column that are not dependent upon the primary key.

Normalization is the process of efficiently organizing data in a database. The two goals of the normalization process are eliminate redundant data (storing the same data in more than one table) and ensure data dependencies make sense (only storing related data in the table).

First normalization form:

Second normal form:

Third normal form:

52.

What Type Of Indexing Mechanism Do We Need To Use For A Typical Data Warehouse?

Answer»

Primary INDEX MECHANISM is the ideal type of index for DATA warehouse.

Primary Index mechanism is the ideal type of index for data warehouse.

53.

Why Are Oltp Database Designs Not Generally A Good Idea For A Data Warehouse?

Answer»

OLTP designs are for real time DATA and they are not normalized and preaggregated. They are not GOOD for decision SUPPORT SYSTEMS.

OLTP designs are for real time data and they are not normalized and preaggregated. They are not good for decision support systems.

54.

Difference Between Star And Snowflake Schemas?

Answer»

STAR SCHEMA is De-NORMALIZED and SNOWFLAKE schema is normalized.

Star schema is De-normalized and snowflake schema is normalized.

55.

What Is Spool Space And When Running A Job If It Reaches The Maximum Spool Space How You Solve The Problem?

Answer»

Spool space is used to hold intermediate rows during processing, and to hold the rows in the answer SET of a transaction. Spool space REACHES maximum when the query is not properly optimized. Use APPROPRIATE conditions in WHERE clause of the query to LIMIT the answer set.

Spool space is used to hold intermediate rows during processing, and to hold the rows in the answer set of a transaction. Spool space reaches maximum when the query is not properly optimized. Use appropriate conditions in WHERE clause of the query to limit the answer set.

56.

When Tpump Is Used Instead Of Multiload?

Answer»

TPump provides an alternative to MultiLoad for the low VOLUME batch maintenance of LARGE databases under control of a Teradata system. Instead of updating Teradata databases overnight, or in batches throughout the day, TPump updates information in real TIME, acquiring every BIT of data from the client system with low processor utilization. It does this through a continuous feed of data into the data warehouse, rather than the TRADITIONAL batch updates. Continuous updates results in more accurate, timely data. And, unlike most load utilities, TPump uses row hash locks rather than table level locks. This allows you to run queries while TPump is running. This also means that TPump can be stopped instantaneously. As a result, businesses can make better decisions that are based on the most current data.

TPump provides an alternative to MultiLoad for the low volume batch maintenance of large databases under control of a Teradata system. Instead of updating Teradata databases overnight, or in batches throughout the day, TPump updates information in real time, acquiring every bit of data from the client system with low processor utilization. It does this through a continuous feed of data into the data warehouse, rather than the traditional batch updates. Continuous updates results in more accurate, timely data. And, unlike most load utilities, TPump uses row hash locks rather than table level locks. This allows you to run queries while TPump is running. This also means that TPump can be stopped instantaneously. As a result, businesses can make better decisions that are based on the most current data.

57.

What Is Inner Join And Outer Join?

Answer»

An INNER join gets data from both tables where the specified data exists in both tables. An OUTER join gets data from the SOURCE table at all times, and RETURNS data from the outer JOINED table ONLY if it matches the criteria.

An inner join gets data from both tables where the specified data exists in both tables. An outer join gets data from the source table at all times, and returns data from the outer joined table ONLY if it matches the criteria.

58.

Can We Collect Statistics On Table Level?

Answer»

YES we can COLLECT STATISTICS on table LEVEL. The syntax is COLLECT STAT ON TAB_A;

Yes we can collect statistics on table level. The syntax is COLLECT STAT ON TAB_A;

59.

Can We Collect Statistics On Multiple Columns?

Answer»

YES we can COLLECT STATISTICS on MULTIPLE COLUMNS.

Yes we can collect statistics on multiple columns.

60.

What Is Collect Statistics?

Answer»

Collects demographic DATA for one or more columns of a table, hash index, or join index, computes a statistical profile of the COLLECTED data, and STORES the synopsis in the data dictionary. The OPTIMIZER uses the synopsis data when it generates its table access and join PLANS.

Collects demographic data for one or more columns of a table, hash index, or join index, computes a statistical profile of the collected data, and stores the synopsis in the data dictionary. The Optimizer uses the synopsis data when it generates its table access and join plans.

61.

What Are The Things To Be Considered While Creating Secondary Index?

Answer»

Creating a SECONDARY index causes TERADATA to build a sub-table to contain its index rows, thus adding another set of rows that requires updating each time a table row is inserted, deleted, or UPDATED. Secondary index sub-tables are also duplicated WHENEVER a table is DEFINED with FALLBACK, so the maintenance overhead is effectively doubled.

Creating a secondary index causes Teradata to build a sub-table to contain its index rows, thus adding another set of rows that requires updating each time a table row is inserted, deleted, or updated. Secondary index sub-tables are also duplicated whenever a table is defined with FALLBACK, so the maintenance overhead is effectively doubled.

62.

What Is Primary Index And Secondary Index?

Answer»

PRIMARY index is the mechanism for assigning a data row to an AMP and a LOCATION on the AMP’s DISKS. Indexes also used to ACCESS rows from a table without having to search the ENTIRE table. Secondary indexes enhance set selection by specifying access paths less frequently used than the primary index path. Secondary indexes are also used to facilitate aggregate operations.

If a secondary index covers a query, then the Optimizer determines that it would be less costly to accesses its rows directly rather than using it to access the base table rows it points to. Sometimes multiple secondary indexes with low individual selectivity can be overlapped and bit mapped to provide enhanced.

Primary index is the mechanism for assigning a data row to an AMP and a location on the AMP’s disks. Indexes also used to access rows from a table without having to search the entire table. Secondary indexes enhance set selection by specifying access paths less frequently used than the primary index path. Secondary indexes are also used to facilitate aggregate operations.

If a secondary index covers a query, then the Optimizer determines that it would be less costly to accesses its rows directly rather than using it to access the base table rows it points to. Sometimes multiple secondary indexes with low individual selectivity can be overlapped and bit mapped to provide enhanced.

63.

What Is The Purpose Of Indexes?

Answer»

An index is a MECHANISM that can be used by the SQL query optimizer to make table access more performant. Indexes enhance DATA access by providing a moreor- less direct path to stored data and AVOIDING the necessity to perform full table scans to LOCATE the small number of ROWS you typically want to retrieve or update.

An index is a mechanism that can be used by the SQL query optimizer to make table access more performant. Indexes enhance data access by providing a moreor- less direct path to stored data and avoiding the necessity to perform full table scans to locate the small number of rows you typically want to retrieve or update.

64.

What Is A Clique?

Answer»

Clique is a group of DISK ARRAYS PHYSICALLY cabled to a group of NODES.

Clique is a group of disk arrays physically cabled to a group of nodes.

65.

What Is Node? How Many Nodes And Amps Used In Your Previous Project?

Answer»

Node is a database RUNNING in a server. We USED 318 nodes and each node has 2 to 4 AMPS.

Node is a database running in a server. We used 318 nodes and each node has 2 to 4 AMPS.

66.

What's The Difference Between Timestamp (0) And Timestamp (6)?

Answer»

TIMESTAMP (0) is CHAR (19) and TIMESTAMP (6) is CHAR (26)
EVERYTHING is same EXCEPT that TIMESTAMP (6) has MICROSECONDS too.

TIMESTAMP (0) is CHAR (19) and TIMESTAMP (6) is CHAR (26)
Everything is same except that TIMESTAMP (6) has microseconds too.

67.

Which Is More Efficient Group By Or Distinct To Find Duplicates?

Answer»

With more DUPLICATES GROUP BY is more EFFICIENT, if only a few duplicates EXIST DISTINCT is more efficient.

With more duplicates GROUP BY is more efficient, if only a few duplicates exist DISTINCT is more efficient.

68.

How Do You See A Ddl For An Existing Table?

Answer»

By USING SHOW table COMMAND.

By using show table command.

69.

How Many Tables Can You Join In V2r5?

Answer»

Up to 64 TABLES.

Up to 64 tables.

70.

How Do You Verify A Complicated Sql?

Answer»

I USE explain statement to CHECK if the QUERY is doing what I wanted it to do.

I use explain statement to check if the query is doing what I wanted it to do.

71.

My Table Got Locked During Mload Due To A Failed Job. What Do I Do To Perform Other Operations On It?

Answer»

Using RELEASE MLOAD. It removes access LOCKS from the target tables in Teradata. It MUST be ENTERED from BTEQ and not from MultiLoad. To proceed, you can do RELEASE MLOAD <table name>.

Using RELEASE MLOAD. It removes access locks from the target tables in Teradata. It must be entered from BTEQ and not from MultiLoad. To proceed, you can do RELEASE MLOAD <table name>.

72.

While Creating Table My Dba Has Fallback Or No Fallback In His Ddl. What Is That?

Answer»

FALLBACK requests that a SECOND COPY of each row inserted into a table be stored on another AMP in the same CLUSTER. This is done when AMP goes down or disk FAILS.

FALLBACK requests that a second copy of each row inserted into a table be stored on another AMP in the same cluster. This is done when AMP goes down or disk fails.

73.

What's The Syntax Of Sub String?

Answer»

SUBSTRING (string_expression, N1 [N2]).

SUBSTRING (string_expression, n1 [n2]).

74.

Is It Necessary To Add? Quit Statement After A Bteq Query When I Am Calling It In A Unix Environment?

Answer»

Not NECESSARY but it is GOOD to add a QUIT statement after a QUERY.

Not necessary but it is good to add a QUIT statement after a query.

75.

What Is The Use Of Having Index's On Table?

Answer»

For FASTER RECORD SEARCH.

For faster record search.

76.

Can You Fastexport A Field, Which Is Primary Key By Putting Equality On That Key?

Answer»

No.

No.

77.

Is Multi Insert Ansi Standard?

Answer»

No.

No.

78.

What Is Multi Insert?

Answer»

Inserting data records into the table using multiple insert statements. Putting a SEMI COLON in front of the KEY word INSERT in the next statement rather than terminating the first statement with a semi colon ACHIEVES it.

Insert into Sales “select * from CUSTOMER”; Insert into Loan “select * from customer”;

Inserting data records into the table using multiple insert statements. Putting a semi colon in front of the key word INSERT in the next statement rather than terminating the first statement with a semi colon achieves it.

79.

Difference Between Inner Join And Outer Join?

Answer»

An inner join gets data from both tables where the specified data exists in both tables. An outer join gets data from the source table at all TIMES, and RETURNS data from the outer JOINED table ONLY if it matches the CRITERIA.

An inner join gets data from both tables where the specified data exists in both tables. An outer join gets data from the source table at all times, and returns data from the outer joined table ONLY if it matches the criteria.

80.

Which Is Faster Fastload Or Multiload?

Answer»

FastLoad.

FastLoad.

81.

What Is The Difference Between Fastload And Multiload?

Answer»

FASTLOAD uses multiple sessions to quickly load large AMOUNT of data on empty table. MULTILOAD is used for high-volume maintenance on tables and VIEWS. It works with non-empty tables also. Maximum 5 tables can be used in MultiLoad.

FastLoad uses multiple sessions to quickly load large amount of data on empty table. MultiLoad is used for high-volume maintenance on tables and views. It works with non-empty tables also. Maximum 5 tables can be used in MultiLoad.