50 + Interview Questions in Database Similarity Searching in Bioinformatics Page 1 InterviewSolution

1.	The rigorous dynamic programming method is normally not used for database searching because it is slow and computationally expensive.(a) True(b) FalseI have been asked this question by my college professor while I was bunking the class.This interesting question is from Database Searching with the Smith topic in division Database Similarity Searching of Bioinformatics
Answer» The correct option is (a) True Explanation: HEURISTICS such as BLAST and FASTA are developed for faster speed. However, the heuristic METHODS are limited in SENSITIVITY and are not GUARANTEED to find the optimal alignment. They often fail to find alignment for distantly related sequences.

Discussion

2.	In heuristic methods, speed doesn’t vary with the size of database.(a) True(b) FalseI have been asked this question during an interview.This key question is from Heuristic Database Searching topic in section Database Similarity Searching of Bioinformatics
Answer» The correct option is (b) False For explanation I WOULD say: The speed is the TIME it takes to get results from database searches. Depending on the size of the database, speed sometimes can be a PRIMARY concern in the SEARCH methods.

Discussion

3.	Sensitivity refers to the ability to find as many correct hits as possible.(a) True(b) FalseThe question was asked by my school principal while I was bunking the class.My question is based upon Heuristic Database Searching topic in division Database Similarity Searching of Bioinformatics
Answer» The correct CHOICE is (a) True For explanation I would say: Among the unique requirements for implementing algorithms for sequence database Searching, the FIRST criterion is sensitivity, which REFERS to the ability to FIND as many correct hits as possible. It is MEASURED by the extent of inclusion of correctly identified sequence members of the same family. These correct hits are considered ‘true positives’ in the database searching exercise.

Discussion

4.	The positional difference for each word between the two sequences is obtained by ________ the position of the ______ sequence from that of the _______ sequence and is expressed as the offset.(a) subtracting, second, first(b) adding, second, first(c) adding, first, second(d) subtracting, first, secondThis question was posed to me during an interview for a job.This key question is from FASTA topic in section Database Similarity Searching of Bioinformatics
Answer» Right option is (d) subtracting, first, second Easiest explanation: The positional difference for each word between the two SEQUENCES is OBTAINED by subtracting the POSITION of the first sequence from that of the second sequence and is EXPRESSED as the offset. The ktups that have the same offset values are then linked to reveal a contiguous IDENTICAL sequence region that corresponds to a stretch of diagonal in a two-dimensional matrix.

Discussion

5.	Score can be negative in Smith–Waterman algorithm.(a) True(b) FalseThe question was asked during an internship interview.I'd like to ask this question from Database Searching with the Smith in portion Database Similarity Searching of Bioinformatics
Answer» The CORRECT answer is (B) False To explain: Negative score is SET to 0. In Needleman–Wunsch ALGORITHM, the Score can be negative. ALSO, in Smith–Waterman algorithm, in tracing back step, it begins with the highest score, ends when 0 is encountered.

Discussion

6.	The gapped portion in the diagonals represents matches in FASTA.(a) True(b) FalseThis question was addressed to me in an international level competition.My question comes from Comparison of FASTA and BLAST topic in chapter Database Similarity Searching of Bioinformatics
Answer» Right option is (B) False For explanation I would say: The DIAGONAL’s nature indicates the matching of the SEQUENCES. After all diagonals are found, it tries to JOIN diagonals by adding gaps. Further, it COMPUTES alignments in regions of best diagonals.

Discussion

7.	BLAST often produces several short HSPs rather than a single aligned region.(a) True(b) FalseI have been asked this question by my school principal while I was bunking the class.My query is from Comparison of FASTA and BLAST topic in portion Database Similarity Searching of Bioinformatics
Answer» Correct answer is (a) True Explanation: The results of the word MATCHING and attempts to extend the alignment are segments. They are called as HSPs (High-Scoring Segment PAIRS). BLAST often PRODUCES SEVERAL SHORT HSPs rather than a single aligned region.

Discussion

8.	The final step involves pairwise alignment by extending from the words in both directions while counting the alignment score using the same substitution matrix.(a) True(b) FalseI have been asked this question during an interview for a job.My question is based upon Basic Local Alignment Search Tool (BLAST) topic in portion Database Similarity Searching of Bioinformatics
Answer» RIGHT answer is (a) True Easy explanation: The extension continues until the score of the alignment drops below a threshold due to mismatches (the drop threshold is twenty-two for PROTEINS and twenty for DNA). The resulting contiguous aligned segment pair without GAPS is called high-scoring segment pair. In the ORIGINAL version of BLAST, the highest scored HSPs are presented as the FINAL report. They are also called maximum scoring pairs.

Discussion

9.	Which of the following is not a variant of BLAST?(a) BLASTN(b) BLASTP(c) BLASTX(d) TBLASTNXI have been asked this question in an online interview.My question is taken from Basic Local Alignment Search Tool (BLAST) topic in section Database Similarity Searching of Bioinformatics
Answer» Correct answer is (d) TBLASTNX Explanation: BLAST is a family of programs that includes BLASTN, BLASTP, BLASTX TBLASTN, and TBLASTX. BLASTN queries nucleotide sequences with a nucleotide sequence database. The alignment scoring is BASED on the BLOSUM62 matrix.

Discussion

10.	The initiation of FASTA format has ____ symbol.(a) >(b)
Answer» The correct option is (a) > The best explanation: Its FORMAT is simple as used by almost all programs. Header line has > at the beginning. ALSO no specific requirements are there for line length, CHARACTERS, etc.

Discussion

11.	In FASTA, For a Z-score > 15, the match can be considered extremely ______ with _____ of a homologous relationship.(a) insignificant, uncertainty(b) significant, uncertainty(c) significant, certainty(d) insignificant, certaintyI got this question during an internship interview.The above asked question is from Comparison of FASTA and BLAST in division Database Similarity Searching of Bioinformatics
Answer» The CORRECT answer is (c) significant, certainty To elaborate: If Z is in the range of 5 to 15, the sequence pair can be DESCRIBED as highly probable homologs. If Z<5, their relationship is described as less CERTAIN.

Discussion

12.	TBLASTX queries protein sequences to anucleotide sequence database with the sequences translated in all six reading frames.(a) True(b) FalseI have been asked this question in an interview.The origin of the question is Basic Local Alignment Search Tool (BLAST) topic in division Database Similarity Searching of Bioinformatics
Answer» Right option is (b) False The explanation: TBLASTN queries protein sequences to anucleotide SEQUENCE database with the sequences translated in all six reading frames. TBLASTX uses nucleotide sequences, which are translated in all six frames, to search against a nucleotide sequence database that has all the sequences translated in six frames. In addition, there is also a bl2seq program that performs local alignment of two user-provided input sequences. The graphical OUTPUT INCLUDES horizontal bars and a diagonal in a two-dimensional diagram SHOWING the OVERALL extent of matching between the two sequences.

Discussion

13.	An increase in sensitivity is associated with _______ in selectivity.(a) no specific change(b) increase(c) decrease(d) exponential increaseI got this question in an interview.This question is from Heuristic Database Searching topic in section Database Similarity Searching of Bioinformatics
Answer» Correct choice is (c) decrease To explain I would say: Ideally, one wants to have the GREATEST sensitivity, selectivity, and speed in database searches. However, satisfying all three requirements is DIFFICULT in reality. What generally happens is that an increase in sensitivity is associated with decrease in selectivity. A very inclusive search tends to include many false POSITIVES. Similarly, an improvement in speed often comes at the cost of lowered sensitivity and selectivity. A compromise between the three criteria often has to be made.

Discussion

14.	Dynamic programming method is the fastest and most practical method.(a) True(b) FalseI had been asked this question at a job interview.The question is from Heuristic Database Searching topic in chapter Database Similarity Searching of Bioinformatics
Answer» The correct choice is (b) False For explanation: DYNAMIC programming method is SLOW and impractical to USE in most cases. Special SEARCH METHODS are needed to speed up the computational process of sequence comparison.

Discussion

15.	The use of low-complexity masking in the BLAST procedure means that it may have higher specificity than FASTA because potential false positives are reduced.(a) True(b) FalseThis question was addressed to me in examination.My question comes from Comparison of FASTA and BLAST in division Database Similarity Searching of Bioinformatics
Answer» Right answer is (a) True Best EXPLANATION: In addition to the given STATEMENT, BLAST SOMETIMES gives MULTIPLE best-scoring ALIGNMENTS from the same sequence. FASTA returns only one final alignment.

Discussion

16.	The main application of pairwise alignment is retrieving biological sequences in databases based on similarity.(a) True(b) FalseThe question was asked in a national level competition.My doubt stems from Heuristic Database Searching in chapter Database Similarity Searching of Bioinformatics
Answer» Correct option is (a) True Easiest EXPLANATION: This process involves submission of a query SEQUENCE and PERFORMING a pairwise comparison of the query sequence with all individual sequences in a database. THUS, database similarity searching is pairwise alignment on a large scale. This type of searching is one of the most effective ways to assign putative functions to newly DETERMINED sequences.

Discussion

17.	The specificity refers to the ability to include incorrect hits.(a) True(b) FalseThis question was posed to me in a job interview.My question is based upon Heuristic Database Searching topic in chapter Database Similarity Searching of Bioinformatics
Answer» The correct option is (b) False For EXPLANATION I would say: In heuristic DATABASE searching methods, The SECOND requirement criterion is 1 also called specificity, which refers to the ability to exclude incorrect hits. These incorrect hits are UNRELATED sequences mistakenly identified in database searching and are considered ‘false POSITIVES.’

Discussion

18.	Z-score describes the number of standard deviations from the mean score for the database search.(a) True(b) FalseThis question was posed to me in an interview.The above asked question is from FASTA topic in division Database Similarity Searching of Bioinformatics
Answer» Correct answer is (a) True To explain: Because most of the alignments with the query sequence are with unrelated sequences, the higher the Z-score for a reported match, the further away from the mean of the score distribution, hence, the more significant the match. For a Z-score > 15, the match can be considered extremely significant, with a CERTAINTY of a homologous relationship. If Z is in the range of 5 to 15, the sequence pair can be described as highly probable homologs. If Z<5, their relationship is described as less CERTAIN.

Discussion

19.	Par Align is a web-based server that uses parallel processors to perform exhaustive sequence comparisons using either a parallelized version of the Smith–Waterman algorithm or a heuristic program for further speed gains.(a) True(b) FalseI got this question in semester exam.My question is taken from Database Searching with the Smith topic in division Database Similarity Searching of Bioinformatics
Answer» Correct choice is (a) True Easy explanation: The heuristic subprogram first finds exact ungapped alignments and uses them as anchors for extension into gapped alignments by combining the SCORES of SEVERAL diagonals in the alignment MATRIX. The search speed of ParAlign approaches to that of BLAST, but with higher sensitivity.

Discussion

20.	Currently, there are two major heuristic algorithms for performing database searches: BLAST and FASTA.(a) True(b) FalseI had been asked this question in my homework.The doubt is from Heuristic Database Searching in division Database Similarity Searching of Bioinformatics
Answer» The correct OPTION is (a) True Best EXPLANATION: These methods are not guaranteed to find the optimal alignment or true homologs, but are 50–100 times faster than dynamic programming. The increased computational speed COMES at a moderate expense of sensitivity and specificity of the SEARCH, which is easily tolerated by working molecular biologists. Both programs can provide a REASONABLY good indication of sequence similarity by identifying similar sequence segments.

Discussion

21.	Scan PS is a web-based program that implements a modified version of the Needleman-Wunsch algorithm.(a) True(b) FalseThe question was posed to me by my college professor while I was bunking the class.Question is taken from Database Searching with the Smith in chapter Database Similarity Searching of Bioinformatics
Answer» Right OPTION is (b) False Explanation: ScanPS (Scan Protein Sequence) is a web-based PROGRAM that implements a modified version of the Smith–Waterman algorithm optimized for parallel processing. The MAJOR feature is that the program allows iterative searching similar to PSI-BLAST, which builds profiles from one round of search results and uses them for the second round of database searching. Full dynamic programming is used in each cycle for ADDED sensitivity.

Discussion

22.	Which of the following is not a benefit of BLAST?(a) Handling of gaps(b) Speed(c) More sensitive(d) Statistical rigorThe question was asked during a job interview.This intriguing question comes from Comparison of FASTA and BLAST in division Database Similarity Searching of Bioinformatics
Answer» The CORRECT CHOICE is (a) Handling of gaps For explanation: In addition to this, user friendly UI of BLAST is ALSO one of its benefits. HOWEVER, it does not handle gaps well. In that case gapped BLAST is better.

Discussion

23.	Local sequence alignments are necessary for many cases out of which one is repeated.(a) True(b) FalseThe question was asked during an interview for a job.This interesting question is from Database Searching with the Smith in chapter Database Similarity Searching of Bioinformatics
Answer» The CORRECT CHOICE is (a) True The explanation: It can ALSO be USED for modular organization of genes and proteins (exons, domains, etc.) Also it can be used in cases of sequences diverged so that similarity was retained, or can be detected, just in some sub-regions.

Discussion

24.	FASTA doesn’t use bit scores.(a) True(b) FalseThis question was posed to me in exam.My query is from FASTA in chapter Database Similarity Searching of Bioinformatics
Answer» Correct choice is (b) False To explain I would say: FASTA uses E-values and bit scores as well. Estimation of the TWO PARAMETERS in FASTA is ESSENTIALLY the same as in BLAST. HOWEVER, the FASTA output provides one more statistical parameter, the Z-score.

Discussion

25.	In Smith–Waterman algorithm, in initialization Step, the _________ row and ________ column are subject to gap penalty.(a) first, first(b) first, second(c) second, First(d) first, lastI have been asked this question in semester exam.My query is from Database Searching with the Smith topic in section Database Similarity Searching of Bioinformatics
Answer» The CORRECT answer is (a) first, first The explanation: In Smith–Waterman ALGORITHM, first ROW and first column are set to 0. In the Needleman Wunsch algorithm, First row and first column are subject to gap PENALTY.

Discussion

26.	The web-based FASTA program is offered by the European Bioinformatics Institute.(a) True(b) FalseThis question was posed to me by my college director while I was bunking the class.The question is from FASTA topic in section Database Similarity Searching of Bioinformatics
Answer» The CORRECT option is (a) True The best explanation: Similar to BLAST, FASTA has a number of subprograms. The web-based FASTA program offered by the European Bioinformatics INSTITUTE ALLOWS the use of either DNA or protein sequences as the QUERY to search against a protein database or nucleotide database.

Discussion

27.	A recent improvement in the implementation of BLAST is the ability to provide gapped alignment.(a) True(b) FalseThe question was asked in quiz.My doubt stems from Basic Local Alignment Search Tool (BLAST) in chapter Database Similarity Searching of Bioinformatics
Answer» The correct option is (a) True The explanation: In gapped BLAST, the HIGHEST scored segment is chosen to be extended in both directions using dynamic programming where gaps may be introduced. The EXTENSION continues if the alignment score is above a certain THRESHOLD; otherwise it is terminated. However, the overall score is ALLOWED to drop below the threshold only if it is temporary and rises again to attain above threshold values. Final trimming of terminal regions is needed before producing a report of the final alignment.

Discussion

28.	In sequence alignment by BLAST, each word from query sequence is typically _______ residues for protein sequences and _______ residues for DNA sequences.(a) ten, eleven(b) three, three(c) three, eleven(d) three, tenI got this question in final exam.My question is taken from Basic Local Alignment Search Tool (BLAST) topic in division Database Similarity Searching of Bioinformatics
Answer» The correct option is (c) THREE, eleven For explanation: The FIRST STEP is to create a list of words from the QUERY sequence. Each word is typically three residues for protein sequences and eleven residues for DNA sequences. The list includes every possible word extracted from the query sequence. This step is also CALLED seeding.

Discussion

29.	One of the challenges in SWA is obtaining correct alignments in regions of low similarity between distantly related biological sequences.(a) True(b) FalseThis question was posed to me during an online interview.Question is from Database Searching with the Smith topic in division Database Similarity Searching of Bioinformatics
Answer» Correct option is (a) True Best EXPLANATION: It is because mutations have added too much ‘NOISE’ over evolutionary time to ALLOW for a meaningful comparison of those regions. Local alignment AVOIDS such regions ALTOGETHER and focuses on those with a positive score, i.e. those with an evolutionarily conserved signal of similarity.

Discussion

30.	In SW algorithm, to align two sequences of lengths of m and n _________ time is required.(a) O(mn)(b) O(m^2n)(c) O(m^2n^3)(d) O(mn^2)The question was posed to me by my college professor while I was bunking the class.I would like to ask this question from Database Searching with the Smith in portion Database Similarity Searching of Bioinformatics
Answer» RIGHT option is (b) O(m^2n) Explanation: The Smith–Waterman algorithm is quite demanding of time. HENCE if two SEQUENCES of lengths of m and nhave to be aligned, the REQUIRED time is O(m^2n). It requires O(mn) calculation STEPS.

Discussion

31.	FASTA and BLAST are __________ but __________ for larger datasets.(a) faster, more sensitive(b) faster, less sensitive(c) slower, less sensitive(d) slower, more sensitiveThis question was addressed to me by my college director while I was bunking the class.My question is based upon Database Searching with the Smith topic in division Database Similarity Searching of Bioinformatics
Answer» Correct option is (B) faster, less sensitive Easy explanation: Empirical tests have indeed shown that the exhaustive METHOD PRODUCES superior results over the heuristic methods like BLAST and FASTA. But heuristic methods are better and PRACTICAL when it comes to assess larger datasets with comparatively low sensitivity.

Discussion

32.	The last step is to perform a statistical evaluation of the final alignment as in BLAST, which produces the E-value.(a) True(b) FalseI got this question by my college director while I was bunking the class.This is a very interesting question from FASTA in section Database Similarity Searching of Bioinformatics
Answer» The correct ANSWER is (a) True For explanation I WOULD say: The LAST step of FASTA is similar to that of BLAST. The determination of E-value is the most important part of the whole analysis as it gives the degree of alignment between the SEQUENCES.

Discussion

33.	Which of the following is not correct about BLAST?(a) The BLAST web server has been designed in such away as to simplify the task of program selection(b) The programs are organized based on the type of query sequences(c) The programs are organized based on the type of nucleotide sequences, or nucleotide sequence to be translated(d) BLAST is not based on heuristic searching methodsI got this question in an interview.My question is based upon Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics
Answer» CORRECT choice is (d) BLAST is not based on heuristic searching methods The explanation: BLAST and FASTA are based on heuristic searching methods. In addition, programs for special purposes are grouped separately; for example, bl2seq, immunoglobulin BLAST, and VECSCREEN, a program for REMOVING contaminating vector SEQUENCES.

Discussion

34.	BLASTX uses protein sequences as queries to search against a protein sequence database.(a) True(b) FalseThe question was posed to me in exam.I'd like to ask this question from Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics
Answer» CORRECT choice is (B) False Explanation: BLASTP, and not BLASTX, uses PROTEIN SEQUENCES as queries to search against a protein sequence database. BLASTX uses nucleotide sequences as queries and translates them in all six reading frames to produce translated protein sequences, which are used to QUERY a protein sequence database.

Discussion

35.	In sequence alignment by BLAST, the second step is to search a sequence database for the occurrence of these words.(a) True(b) FalseI got this question in my homework.My question is from Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics
Answer» Correct choice is (a) True The EXPLANATION: This step is to identify database sequences containing the MATCHING words. The matching of the words is SCORED by a GIVEN substitution matrix. A word is considered a match if it is above a THRESHOLD.

Discussion

36.	Which of the following is incorrect?(a) Smith–Waterman algorithm is the fastest(b) Smith–Waterman algorithm is comparatively slower method(c) To speedup up comparison, heuristic methods are used(d) Heuristic algorithms perform faster searchesThe question was posed to me during an interview.This intriguing question originated from Heuristic Database Searching topic in portion Database Similarity Searching of Bioinformatics
Answer» The correct choice is (a) Smith–Waterman algorithm is the fastest For explanation: Searching a LARGE DATABASE using the dynamic programming methods, such as the Smith–Waterman algorithm, ALTHOUGH accurate and reliable, is too slow and impractical when computational resources are limited. To speed up the comparison, heuristic methods have to be used. The heuristic algorithms perform faster searches because they examine only a fraction of the possible ALIGNMENTS EXAMINED in regular dynamic programming.

Discussion

37.	Which of the following is not a benefit or a factual of FASTA over BLAST?(a) FASTA scans smaller window sizes(b) It gives more sensitive results(c) It gives less sensitive results(d) It gives results with a better coverage rate for homologsThis question was addressed to me in homework.The above asked question is from Comparison of FASTA and BLAST in portion Database Similarity Searching of Bioinformatics
Answer» The correct option is (c) It gives less sensitive results To EXPLAIN I WOULD say: By default, FASTA scans SMALLER window SIZES. Thus, it gives more sensitive results than BLAST, with a better coverage rate for homologs. However, it is usually slower than BLAST.

Discussion

38.	The second step in FASTA is to narrow down the high similarity regions between the two sequences.(a) True(b) FalseThe question was posed to me in an interview.This key question is from FASTA in division Database Similarity Searching of Bioinformatics
Answer» Right option is (a) True To explain I would SAY: Normally, MANY diagonals between the two SEQUENCES can be identified in the hashing STEP. The top ten regions with the highest density of diagonals are identified as high SIMILARITY regions. The diagonals in these regions are scored using a substitution matrix.

Discussion

39.	BLAST uses a _______ to find matching words, whereas FASTA identifies identical matching words using the _____(a) substitution matrix, hashing procedure(b) substitution matrix, blocks(c) hashing procedure, substitution matrix(d) ktups, substitution matrixThis question was addressed to me in a national level competition.The query is from Comparison of FASTA and BLAST in section Database Similarity Searching of Bioinformatics
Answer» The CORRECT choice is (a) substitution matrix, HASHING procedure Easy explanation: BLAST and FASTA have been shown to perform almost equally well in regular DATABASE searching; However, there are some NOTABLE differences between the two approaches. The major difference is in the seeding step– BLAST uses a substitution matrix to find matching words, whereas FASTA identifies identical matching words using the hashing procedure.

Discussion

40.	The first step in FASTA alignment id to arrange the sequences in matrices’ rows and columns in order to be analyzed.(a) True(b) FalseThe question was asked in an interview for internship.The query is from FASTA in section Database Similarity Searching of Bioinformatics
Answer» CORRECT choice is (b) False The explanation: The first step in FASTA alignment is to identify ktups between TWO sequences by USING the hashing STRATEGY. This strategy works by constructing a look up table that shows the position of each ktup for the two sequences under consideration. The mentioned METHOD is similar to that of BLAST.

Discussion

41.	If one is looking for protein homologs encoded in newly sequenced genomes, one may use TBLASTN, which translates nucleotide database sequences in all six open reading frames.(a) True(b) FalseI have been asked this question in an interview.My doubt stems from Basic Local Alignment Search Tool (BLAST) topic in section Database Similarity Searching of Bioinformatics
Answer» CORRECT choice is (a) True To explain I would say: This may help to identify protein coding genes that have not YET been annotated. If a DNA SEQUENCE is to be used as the query, a protein-level comparison can be DONE with TBLASTX. However, both PROGRAMS are very computationally intensive and the search process can be very slow.

Discussion

42.	The BLAST program was developed in _______(a) 1992(b) 1995(c) 1990(d) 1991This question was posed to me in quiz.My question comes from Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics
Answer» Correct option is (c) 1990 Easiest EXPLANATION: The BLAST program was DEVELOPED by Stephen Altschul of NCBI in 1990 and hassince become one of the most popular programs for SEQUENCE analysis. BLAST uses heuristics to align a query sequence with all SEQUENCES in a database.

Discussion

43.	In FASTA, neighboring high-scoring segments along the same diagonal are selected and joined to form a single alignment.(a) True(b) FalseI had been asked this question in unit test.This is a very interesting question from FASTA in portion Database Similarity Searching of Bioinformatics
Answer» The correct option is (a) True The explanation is: This step allows INTRODUCING gaps between the diagonals while applying gap penalties. The score of the gapped alignment is calculated again. In step 3, the gapped alignment is refined further using the Smith–Waterman ALGORITHM to PRODUCE a final alignment.

Discussion

44.	Which of the following is not correct about FASTA?(a) Its stands for FAST ALL(b) It was in fact the first database similarity search tool developed, preceding the development of BLAST(c) FASTA uses a ‘hashing’ strategy to find matches for a short stretch of identical residues with a length of k(d) The string of residues is known as blocksThe question was posed to me in my homework.This interesting question is from FASTA in chapter Database Similarity Searching of Bioinformatics
Answer» Correct answer is (d) The string of residues is known as blocks The explanation: The string of residues is known as ktuples or ktups, which are EQUIVALENT to words inBLAST, but are normally shorter than the words. Typically, a ktup is composed of TWO residues for PROTEIN sequences and six residues for DNA sequences.

Discussion

45.	Which of the following is not one of the requirements for implementing algorithms for sequence database searching?(a) Size of the dataset(b) Sensitivity(c) Specificity(d) SpeedThis question was posed to me in my homework.I'm obligated to ask this question of Heuristic Database Searching in chapter Database Similarity Searching of Bioinformatics
Answer» Correct answer is (a) Size of the dataset For explanation I would say: There are unique requirements for IMPLEMENTING algorithms for sequence database SEARCHING out of which, the LATER three play an important role. HOWEVER, speed can vary with the size of database. ACHIEVING all three at a time is nearly impossible.

Discussion

46.	FASTA is derived from logic of the dot plot.(a) True(b) FalseThis question was addressed to me in unit test.This intriguing question originated from Comparison of FASTA and BLAST topic in section Database Similarity Searching of Bioinformatics
Answer» Correct answer is (a) True BEST explanation: Because of this, it computes best diagonals from all frames of ALIGNMENT. The method LOOKS for EXACT matches between words in query and test sequence.

Discussion

47.	Which of the following is incorrect the ‘word’ method?(a) Both BLAST and FASTA use a heuristic word method(b) Word method is used for fast pairwise sequence alignment in BLAST and FASTA(c) The basic assumption is that two related sequences must have at least one word in common(d) Two related sequences must have at zero word in common while assumingThe question was asked during an interview.The question is from Heuristic Database Searching topic in division Database Similarity Searching of Bioinformatics
Answer» Right option is (d) TWO related sequences must have at zero word in common while assuming Explanation: This is the third method of pairwise sequence alignment. It works by finding SHORT stretches of identical or nearly identical letters in two sequences. These short strings of characters are called words, which are similar to the windows used in the DOT matrix method. The basic assumption is that two related sequences must have at least one word in common. By FIRST identifying word matches, a longer alignment can be obtained by EXTENDING similarity regions from the words. Once regions of high sequence similarity are found, adjacent high-scoring regions can be joined into full alignment.

Discussion

48.	The function of the scoring matrix is to conduct one-to-one comparisons between all components in two sequences and record the optimal alignment results.(a) True(b) FalseI have been asked this question in unit test.This intriguing question comes from Database Searching with the Smith topic in section Database Similarity Searching of Bioinformatics
Answer» RIGHT choice is (a) True For explanation: The scoring PROCESS reflects the concept of dynamic programming. The FINAL optimal alignment is FOUND by iteratively expanding the growing optimal alignment.

Discussion

49.	BLAST might not find matches for very short sequences.(a) True(b) FalseThis question was addressed to me in a job interview.I want to ask this question from Comparison of FASTA and BLAST topic in chapter Database Similarity Searching of Bioinformatics
Answer» Correct choice is (a) True The BEST explanation: In BLAST, SIMILARITY matching of words is involved. If no words are found SIMILAR, then no alignment is detected and HENCE it might not find matches for very short sequences.

Discussion

50.	FASTX, which compares a protein query sequence to a translated DNA database.(a) True(b) FalseThis question was posed to me in an interview for job.The doubt is from FASTA topic in section Database Similarity Searching of Bioinformatics
Answer» RIGHT answer is (B) False The best I can EXPLAIN: Some available variants of the program are FASTX and TFASTX. FASTX translates a DNA SEQUENCE and USES the translated protein sequence to query a protein database. TFASTX compares a protein query sequence to a translated DNA database.

Discussion

Explore topic-wise InterviewSolutions in .

In heuristic methods, speed doesn’t vary with the size of database.(a) True(b) FalseI have been asked this question during an interview.This key question is from Heuristic Database Searching topic in section Database Similarity Searching of Bioinformatics

Sensitivity refers to the ability to find as many correct hits as possible.(a) True(b) FalseThe question was asked by my school principal while I was bunking the class.My question is based upon Heuristic Database Searching topic in division Database Similarity Searching of Bioinformatics

Score can be negative in Smith–Waterman algorithm.(a) True(b) FalseThe question was asked during an internship interview.I'd like to ask this question from Database Searching with the Smith in portion Database Similarity Searching of Bioinformatics

The gapped portion in the diagonals represents matches in FASTA.(a) True(b) FalseThis question was addressed to me in an international level competition.My question comes from Comparison of FASTA and BLAST topic in chapter Database Similarity Searching of Bioinformatics

BLAST often produces several short HSPs rather than a single aligned region.(a) True(b) FalseI have been asked this question by my school principal while I was bunking the class.My query is from Comparison of FASTA and BLAST topic in portion Database Similarity Searching of Bioinformatics

Which of the following is not a variant of BLAST?(a) BLASTN(b) BLASTP(c) BLASTX(d) TBLASTNXI have been asked this question in an online interview.My question is taken from Basic Local Alignment Search Tool (BLAST) topic in section Database Similarity Searching of Bioinformatics

The initiation of FASTA format has ____ symbol.(a) >(b)

An increase in sensitivity is associated with _______ in selectivity.(a) no specific change(b) increase(c) decrease(d) exponential increaseI got this question in an interview.This question is from Heuristic Database Searching topic in section Database Similarity Searching of Bioinformatics

Dynamic programming method is the fastest and most practical method.(a) True(b) FalseI had been asked this question at a job interview.The question is from Heuristic Database Searching topic in chapter Database Similarity Searching of Bioinformatics

The main application of pairwise alignment is retrieving biological sequences in databases based on similarity.(a) True(b) FalseThe question was asked in a national level competition.My doubt stems from Heuristic Database Searching in chapter Database Similarity Searching of Bioinformatics

The specificity refers to the ability to include incorrect hits.(a) True(b) FalseThis question was posed to me in a job interview.My question is based upon Heuristic Database Searching topic in chapter Database Similarity Searching of Bioinformatics

Z-score describes the number of standard deviations from the mean score for the database search.(a) True(b) FalseThis question was posed to me in an interview.The above asked question is from FASTA topic in division Database Similarity Searching of Bioinformatics

Currently, there are two major heuristic algorithms for performing database searches: BLAST and FASTA.(a) True(b) FalseI had been asked this question in my homework.The doubt is from Heuristic Database Searching in division Database Similarity Searching of Bioinformatics

Which of the following is not a benefit of BLAST?(a) Handling of gaps(b) Speed(c) More sensitive(d) Statistical rigorThe question was asked during a job interview.This intriguing question comes from Comparison of FASTA and BLAST in division Database Similarity Searching of Bioinformatics

Local sequence alignments are necessary for many cases out of which one is repeated.(a) True(b) FalseThe question was asked during an interview for a job.This interesting question is from Database Searching with the Smith in chapter Database Similarity Searching of Bioinformatics

FASTA doesn’t use bit scores.(a) True(b) FalseThis question was posed to me in exam.My query is from FASTA in chapter Database Similarity Searching of Bioinformatics

The web-based FASTA program is offered by the European Bioinformatics Institute.(a) True(b) FalseThis question was posed to me by my college director while I was bunking the class.The question is from FASTA topic in section Database Similarity Searching of Bioinformatics

A recent improvement in the implementation of BLAST is the ability to provide gapped alignment.(a) True(b) FalseThe question was asked in quiz.My doubt stems from Basic Local Alignment Search Tool (BLAST) in chapter Database Similarity Searching of Bioinformatics

BLASTX uses protein sequences as queries to search against a protein sequence database.(a) True(b) FalseThe question was posed to me in exam.I'd like to ask this question from Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics

In sequence alignment by BLAST, the second step is to search a sequence database for the occurrence of these words.(a) True(b) FalseI got this question in my homework.My question is from Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics

The second step in FASTA is to narrow down the high similarity regions between the two sequences.(a) True(b) FalseThe question was posed to me in an interview.This key question is from FASTA in division Database Similarity Searching of Bioinformatics

The first step in FASTA alignment id to arrange the sequences in matrices’ rows and columns in order to be analyzed.(a) True(b) FalseThe question was asked in an interview for internship.The query is from FASTA in section Database Similarity Searching of Bioinformatics

The BLAST program was developed in _______(a) 1992(b) 1995(c) 1990(d) 1991This question was posed to me in quiz.My question comes from Basic Local Alignment Search Tool (BLAST) in section Database Similarity Searching of Bioinformatics

In FASTA, neighboring high-scoring segments along the same diagonal are selected and joined to form a single alignment.(a) True(b) FalseI had been asked this question in unit test.This is a very interesting question from FASTA in portion Database Similarity Searching of Bioinformatics

FASTA is derived from logic of the dot plot.(a) True(b) FalseThis question was addressed to me in unit test.This intriguing question originated from Comparison of FASTA and BLAST topic in section Database Similarity Searching of Bioinformatics

BLAST might not find matches for very short sequences.(a) True(b) FalseThis question was addressed to me in a job interview.I want to ask this question from Comparison of FASTA and BLAST topic in chapter Database Similarity Searching of Bioinformatics

FASTX, which compares a protein query sequence to a translated DNA database.(a) True(b) FalseThis question was posed to me in an interview for job.The doubt is from FASTA topic in section Database Similarity Searching of Bioinformatics