Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

51.

Using cDNA sequence with the __________ it is much simpler to locate protein-encoding sequences in these molecules.(a) exons taken out(b) exons removed(c) introns added(d) introns removedThis question was addressed to me in class test.Question is taken from Sequencing cDNA Libraries of Expressed Genes, Submission of Sequences to the Databases topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct answer is (d) introns removed

To EXPLAIN: The only possible DIFFICULTY is that a gene of interest MAY be developmentally expressed or regulated in such a way that the mRNA is not present. This problem has been CIRCUMVENTED by pooling mRNA preparations from tissues that express a large proportion of the genome, from a variety of tissues and developing organs or from organisms subjected to several environmental INFLUENCES.

52.

Annotation is based on finding significant alignment to sequences of known function in database similarity searches.(a) True(b) FalseI had been asked this question during an internship interview.This intriguing question comes from Comparative Genomics topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right answer is (a) True

Explanation: Accurate annotation of genome sequences is an important first step in genome analysis. Matches of lesser significance provide only a TENTATIVE or hypothetical prediction and should be used as a working HYPOTHESIS of FUNCTION.

53.

New gene functions are thought to be gained by duplication of an existing gene creating two tandem copies.(a) True(b) FalseThe question was posed to me in semester exam.I would like to ask this question from Sequence Assembly and Gene Identification in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct answer is (a) True

The BEST I can explain: Functional differentiation then occurs between the COPIES by mutation and selection. However, because most mutations are deleterious, and because only one GENE copy may be needed for function, there is a strong tendency of one copy to ACCUMULATE mutations that render the gene nonfunctional.

54.

After sequencing the Hemophilus genome and Organisms were selected for sequencing based on minimum criteria. Which of the following is not one of them?(a) They had been subjected to a good deal of biological analysis(b) They were model eukaryotic organisms(c) They were an important human pathogen, e.g., Mycobacterium tuberculosis (tuberculosis)(d) They were of phylogenetic interestThis question was addressed to me in semester exam.I want to ask this question from Genome Anatomy topic in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The CORRECT ANSWER is (b) They were MODEL EUKARYOTIC organisms

Explanation: There were model prokaryotic organisms. For E.g., E. coli and Bacillus subtilis. The success of sequencing the Hemophilus genome in a relatively short time and with a modest budget heralded the sequencing of a large number of additional prokaryotic organisms.

55.

The higher the E value, the more significant the alignment between a pair of matching sequences.(a) True(b) FalseI had been asked this question during an interview for a job.Query is from Sequence Assembly and Gene Identification in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct choice is (b) False

Explanation: The lower the E value, the more SIGNIFICANT the ALIGNMENT between a PAIR of matching SEQUENCES. The E value of an alignment score is the probability that an alignment score as good as the one found would be observed between two random or unrelated sequences in a search of a database of the same size.

56.

Gene identification in prokaryotic organisms is simplified by their lacking _________(a) exons(b) introns(c) coding segments(d) useful nucleotide sequencesI had been asked this question by my school teacher while I was bunking the class.Question is taken from Sequence Assembly and Gene Identification topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct choice is (b) introns

To explain I would say: Once the sequence patterns that are characteristic of the GENES in a particular prokaryotic organism (e.g, CODON usage, codon neighbor preference) have been FOUND, gene locations in the genome sequence can be predicted quite accurately. The presence of introns in eukaryotic genomes makes gene PREDICTION more involved because, in addition to the above FEATURES, locations of intron–exon and exon–intron splice junctions must also be predicted.

57.

Which of the following information is not directly obtained by microarray analysis?(a) Which genes are expressed at a particular stage of the cell cycle(b) Which genes are expressed at a particular stage of developmental cycle of an organism(c) Which genes are depleted at what time(d) Genes that respond to a given environmental signal to the same extentI got this question in an online interview.My doubt is from Sequence Assembly and Gene Identification topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right choice is (c) Which genes are depleted at what time

To explain: For chronological information there are other numerous techniques that can be FOLLOWED. This type of information provides an indication as to which genes share a related biological function or may act in the same biochemical pathway and may thereby give clues that will assist in gene identification.

58.

The results of searches in separate fields may be combined to narrow down the choices.(a) True(b) FalseThe question was posed to me in a national level competition.Origin of the question is Using the Database Access Program ENTREZ topic in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer» CORRECT choice is (a) True

Explanation: The number of terms to be searched for and the field to be searched is the main decisions to be MADE. In doing so, it is important to be as SPECIFIC as possible, or ELSE there may be a great many possibilities.
59.

The databases Genbank, EMBL and DDBJ are updated daily.(a) True(b) FalseThis question was posed to me by my college professor while I was bunking the class.The doubt is from Using the Database Access Program ENTREZ topic in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct choice is (a) True

The best I can explain: The mentioned database centers are updated DAILY and exchange NEW sequences daily, so that it is only necessary to access one of them. The EMBL STANDS for European MOLECULAR Biology Laboratory and DDBJ for DNA DATABANK of Japan.

60.

Which of the following is wrong about FASTA Sequence Format?(a) The FASTA sequence format includes a comment line identified by a “>” character in the first column followed by the name and origin of the sequence(b) The FASTA sequence format includes the sequence in standard one-letter symbols(c) This format provides a very convenient way to copy just the sequence part from one window to another because there are no numbers or other nonsequence characters within the sequence(d) The presence of ‘*’ is not quite essential for reading the sequence correctly by some sequence analysis programsThe question was posed to me in quiz.This interesting question is from Sequence Formats & Computer Storage of Sequences in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»
61.

Which of the following is an incorrect statement?(a) The last column contains the sequences themselves(b) It is quite tough making an index of the information in each of these fields so that a search query can locate all the occurrences through the index(c) If one imagines a large table with each sequence entry occupying one row, then each column will include one of the above types of information for each sequence, and each column is called a FIELD(d) The DNA, protein, and reference databases have all been cross-referenced so that moving between them is readily accomplishedThis question was posed to me during an interview for a job.My enquiry is from Multiple Sequence Formats & Storage of Information in a Sequence Database topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct choice is (b) It is quite tough making an index of the information in each of these fields so that a search QUERY can locate all the OCCURRENCES through the index

To explain: It is very easy to MAKE an index of the information in each of these fields so that a search query can locate all the occurrences through the index. Even related sequences are cross-referenced. In addition, the information in one DATABASE can be cross-referenced to that in ANOTHER database.

62.

The translated sequence can then be compared to a database of protein sequences with the hope of finding a strong similarity to a protein of known function, and hence to identify the function of the cloned EST.(a) True(b) FalseThis question was posed to me by my college professor while I was bunking the class.My enquiry is from Sequencing cDNA Libraries of Expressed Genes, Submission of Sequences to the Databases in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct choice is (a) True

Easiest EXPLANATION: The translated sequence can then be COMPARED in the MENTIONED way HENCE to identify the function of the cloned EST. The corresponding cDNA clone of the gene of interest can then be obtained and the gene completely SEQUENCED.

63.

An important development for computational purposes was the decision by Craig Venter to prepare databases of partial sequences of the expressed genes, called expressed sequence tags or ESTs.(a) True(b) FalseThis question was posed to me in an online quiz.I'm obligated to ask this question of Sequencing cDNA Libraries of Expressed Genes, Submission of Sequences to the Databases topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct option is (a) True

For explanation I WOULD say: This was an important development from resolution point of view. This has just enough DNA sequence to give a GOOD idea of the protein sequence.

64.

Which of the given statements is incorrect about Ancient Conserved Regions?(a) The method involves database similarity searches of the SwissProt database with human, worm, yeast, or E. coli genes and identification of matches with sequences from a different phylum than the query sequence(b) An analysis of ACRs that predate the radiation of the major animal phyla some 580–540 million years ago suggested that 50–60% of coding sequences are ACRs(c) These ACRs may represent proteins present at the time of the prokaryotic–eukaryotic divergence(d) Phylogenetically diverse groups of organisms have been analyzed for the presence of conserved proteins and protein domains that have been conserved over long periods of evolutionary time, called ancient conserved regions or ACRsThe question was asked during an internship interview.This interesting question is from Comparative Genomics topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct answer is (b) An analysis of ACRs that predate the radiation of the major animal PHYLA some 580–540 million years ago SUGGESTED that 50–60% of coding sequences are ACRs

The explanation is: The analysis of ACRs 580–540 million years ago suggested that 20–40% of coding sequences are ACRs. For example, a search with 1916 E. coli PROTEINS DETECTED 266 ACRs found in 439 sequences, roughly one-quarter of the SwissProt database.

65.

In protein/domain analysis, each protein in the predicted proteome is again used as a query of a curated protein sequence database such as ____ in order to locate similar domains and sequences. To find orthologs, very low E value scores (E

Answer»

Correct answer is (d) SwissProt

To elaborate: The domain composition of each protein is also determined by searching for matches in domain DATABASES such as Interpro. The analysis reveals how MANY domains and domain COMBINATIONS are present in the proteome, and reveals any unusual representation that might have biological significance. The number of expressed genes in each family can also be compared to the number in other ORGANISMS to DETERMINE whether or not there has been an expansion of the family in the genome.

66.

Comparative genomics includes a comparison of gene number, gene content, and gene location in both prokaryotic and eukaryotic groups of organisms.(a) True(b) FalseThis question was posed to me in an interview for internship.Asked question is from Comparative Genomics topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct option is (a) True

Easiest explanation: The availability of COMPLETE genome sequences makes POSSIBLE a comparison of all of the PROTEINS ENCODED by one genome, the PROTEOME of that organism, with those of another. Because the genome sequence provides both the sequence and the map location of each gene, both the sequence and location can be compared.

67.

In Protein 2D gel Electrophoresis, Individual proteins produced by the genome can be separated to _____ by this method and specific ones identified by various _______(a) smaller extent, biochemical and immunological tests(b) a large extent, biochemical and immunological tests(c) a large extent, biochemical tests only(d) smaller extent, purely mechanical testsI have been asked this question in an interview.I'm obligated to ask this question of Sequence Assembly and Gene Identification topic in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct choice is (b) a LARGE EXTENT, biochemical and immunological tests

Easy explanation: Moreover, changes in levels of proteins in response to an environment signal can be monitored in much the same way as a microarray analysis is performed. Microarrays only DETECT untranslated mRNAs, whereas a two-dimensional GEL protein analysis detects translation products, THUS revealing an additional level of regulation.

68.

The ________ are _______ by genetic structure to retroviruses.(a) STR retrotransposons, related(b) LTR retrotransposons, related(c) STR transposons, related(d) LTR retrotransposons, not relatedI had been asked this question in my homework.I would like to ask this question from Sequence Assembly and Gene Identification topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct option is (b) LTR RETROTRANSPOSONS, related

To explain I would say: There are THREE main SUBCLASSES of these TEs—the LONG terminal repeat (LTR) retrotransposons, retroposons, and retrovirus-like elements with LTRs. Class I elements encode reverse transcriptase and use RNA mediated mechanisms of transcription.

69.

Yeast is about ______ compact than E. coli.(a) fivefold, less(b) threefold, more(c) twofold, less(d) twofold, moreThe question was posed to me during an online interview.My query is from Sequence Assembly and Gene Identification in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The CORRECT choice is (c) twofold, less

The explanation: Of the REMAINING genomes, C. elegans and A. thaliana have approximately the same density of genes (one gene per 6 kb). DROSOPHILA is the LEAST DENSE in this comparison (one gene per 14 kb).

70.

In Proteolysis and fragment sequencing, Protein spots may be excised from a two-dimensional protein gel and subjected to a combination of amino acid sequencing and cleavage analyses using the techniques of mass spectrometry and high-pressure liquid chromatography.(a) True(b) FalseI have been asked this question during an interview.My doubt is from Sequence Assembly and Gene Identification topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The CORRECT answer is (a) True

Easy explanation: GENOME regions that encode these sequences can then be identified and the corresponding gene located. A similar METHOD MAY be used to identify the gene that ENCODES a particular protein that has been purified and characterized in the laboratory.

71.

The human genome contains about _________ of class II of elements that probably predate human evolution (Smit 1996).(a) 2,000 copies(b) 200,000 copies(c) 2,00,00,000 copies(d) 20,00,000 copiesThe question was asked in unit test.My doubt stems from Genome Anatomy topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct answer is (b) 200,000 copies

To EXPLAIN I would say: The class of TEs, class II, is MADE up of ELEMENTS that employ a DNA-based MECHANISM of transposition. Class II elements also include the Activation-Dissociation (Ac-Ds) family in maize and the P element in Drosophila.

72.

Which of the given statements is incorrect?(a) The simplest and newest way of submitting sequences is through the Web site on a Web form page called BankIt(b) The sequence can also be annotated with information about the sequence, such as mRNA start and coding regions(c) The submitted form is transformed into GenBank format and returned to the submitter for review before being added to GenBank(d) Sequin does not run on UNIXI have been asked this question in an interview for internship.Enquiry is from Sequencing cDNA Libraries of Expressed Genes, Submission of Sequences to the Databases topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct option is (d) SEQUIN does not run on UNIX

To explain I would say: The other method of SUBMISSION is to use Sequin (formerly called Authorin), which RUNS on personal computers and UNIX machines. The program provides an easy-to-use GRAPHIC interface and can manage LARGE submissions such as genomic sequence information.

73.

Which of the given statements is incorrect?(a) The processes of domain assortment and gene duplication produce families of proteins in organisms(b) Following speciation, a newly derived genome will inherit the families of ancestor organisms, but will also develop new ones to meet evolutionary challenges(c) Comparison of each of the proteins encoded by an organism with every protein, an all-against-all comparison, reveals which protein families have been amplified and what rearrangements have occurred as steps in the evolutionary process(d) When two or more proteins in the proteome share a high degree of similarity they are least likely to be paralogsThis question was addressed to me in examination.Question is from Comparative Genomics in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right answer is (d) When two or more proteins in the proteome share a high degree of similarity they are least likely to be paralogs

To explain I would say: When two or more proteins in the proteome share a high degree of similarity because they share the same SET of domains, they are likely to be paralogs, genes that arose by gene DUPLICATION events. Proteins that ALIGN over shorter regions share some domains, but also may not share OTHERS. Although gene duplication events could have CREATED such variation, other rearrangements may have also occurred, blurring the evolutionary history.

74.

In Metabolic pathways and regulation, as genes are identified in a new genome sequence, some will be found that are known to act sequentially in a metabolic pathway or to have a known role in gene regulation in other organisms.(a) True(b) FalseThis question was posed to me in an interview for job.The query is from Sequence Assembly and Gene Identification topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer» CORRECT option is (a) True

Best EXPLANATION: From this information, the metabolic pathways and metabolic activities of the organism will become apparent. In some cases, the apparent absence of a gene in a well-represented pathway may lead to a more DETAILED search for the gene. Clustering of genes in the pathway on the genome of a related organism can PROVIDE a further hint as to where the gene may be located.
75.

Introns can remain at a corresponding position in a eukaryotic gene for long periods of evolutionary time.(a) True(b) FalseThis question was addressed to me during an interview.This intriguing question originated from Genome Anatomy in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right option is (a) True

To explain: The origin of introns in EUKARYOTIC genes is not understood but has been accounted for by two models. The “introns-early” VIEW proposes that introns were used to assemble the first genes from sets of ANCIENT conserved exons, whereas the “introns-late” view proposes that introns broke up previously continuous genes by inserting into them.

76.

Each genome sequence is scanned for protein-encoding genes using gene models trained on known gene sequences from the same organism.(a) True(b) FalseI have been asked this question in an internship interview.My doubt is from Sequence Assembly and Gene Identification in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct option is (a) True

To explain: For a NEW genome, each predicted gene is translated into a PROTEIN sequence; the COLLECTION of protein sequences ENCODED by the genome is the proteome of the organism. every protein in the proteome is then used as a query sequence in a database similarity SEARCH. Matching database sequences are realigned with the query sequence to evaluate the extent and significance of the alignment.

77.

Most of the repetitive DNA is found near the open ends of chromosomes.(a) True(b) FalseI got this question in examination.My enquiry is from Genome Anatomy in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct answer is (B) False

Explanation: Some of it is found NEAR the open ENDS of chromosomes- telomeres but Most of this repetitive DNA is found near the centromere. It is because they serve as a point of ATTACHMENT for pulling the chromosomes APART during cell division.

78.

The chromosomes comprised linear DNA molecules in a tightly compact form that was wrapped around protein complexes, called the nucleosome.(a) True(b) FalseThe question was posed to me in my homework.Asked question is from Genome Anatomy in section Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct option is (a) True

The explanation is: Nuclei and chromosomes were not observed in bacteria (a prokaryotic cell), but when BACTERIAL DNA was eventually detected, the molecule was usually circular and was ALSO in a compacted FORM. The FOLLOWING sections outline the structure and composition of prokaryotic and EUKARYOTIC genomes.

79.

Using boolean logic, the search looks for database entries that include the first term __________ the second, and subsequent terms repeated until the last term.(a) AND(b) OR(c) ExOR(d) NANDThe question was posed to me in an interview for job.Asked question is from Using the Database Access Program ENTREZ in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct choice is (a) AND

Easiest explanation: On the ENTREZ form, MAKE a SELECTION in the data entry window after the TERM “Search,” then ENTER search terms in the longer data entry window after “for.” The database will be searched for sequence database entries that contain all of these terms or related ONES.

80.

For a protein search, which of the following is the wrong choice for fields?(a) Accession (number)(b) E. C. number(c) Issue(d) Journal numberI have been asked this question in examination.This key question is from Using the Database Access Program ENTREZ topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct option is (d) Journal number

The best I can EXPLAIN: Other fields being- author name, journal name, keyword, modification date. Also, it includes organism, page number, primary ACCESSION (number), properties, protein name, publication date (of reference), seqID string, sequence length, substance name, text word, title word, volume, and sequence ID. Similar fields are SHOWN for the DNA DATABASE search.

81.

The FASTA format is readily converted into other formats and also is smaller and simpler.(a) True(b) FalseThe question was posed to me by my college director while I was bunking the class.The doubt is from Multiple Sequence Formats & Storage of Information in a Sequence Database topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The CORRECT option is (a) True

To EXPLAIN I would SAY: It CONTAINS just a line of sequence identifiers FOLLOWED by the sequence without numbers, is very useful for browsing and analyzing purposes. One browser window may retrieve sequences from a database and a second may analyze these sequences.

82.

Which of the given statements is incorrect about READSEQ?(a) It is an extremely useful sequence formatting program developed by D. G. Gilbert at Indiana University, Bloomington(b) It was developed at Indiana University, Bloomington(c) It can recognize a DNA or protein sequence file in any of the formats(d) It can recognize a DNA or protein sequence file in some particular formatsThe question was asked during an internship interview.The above asked question is from Multiple Sequence Formats & Storage of Information in a Sequence Database topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct choice is (d) It can RECOGNIZE a DNA or protein sequence file in some particular formats

The explanation: It can identify the FORMAT, and WRITE a NEW file with an alternative format. Some of these formats are used for special types of analyses such as multiple sequence alignment and phylogenetic analysis.

83.

Pseudogenes are DNA sequences that were derived from distinct genes but which have acquired mutations that are deleterious to function in the same period of time.(a) True(b) FalseThe question was posed to me during an interview.Enquiry is from Sequence Assembly and Gene Identification in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct answer is (b) False

The best explanation: PSEUDOGENES are DNA sequences that were derived from a FUNCTIONAL copy of a gene but which have acquired mutations that are deleterious to FUNCTION. For e.g. the PSEUDOGENE TRY5 is similar to the NEARBY functional gene TRY4.

84.

WU-BLAST produces P scores and BLAST (NCBI) produces E scores where _________(a) E = ln (1 + P^2)(b) E = ln (1 – P^2)(c) E = ln (1 + P)(d) E = ln (1 – P)I got this question in an interview.The query is from Sequence Assembly and Gene Identification in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct option is (d) E = ln (1 – P)

Best EXPLANATION: For values less than 0.05, E = P. The choice of a < 10-20 SCORE is a conservative one for IDENTIFICATION of orthologs that should have a similar DOMAIN structure.

85.

Eukaryotic genes that encode proteins are interrupted by ________(a) exons of varying length and number(b) introns of varying length and number(c) exons of varying length and but same number(d) introns of varying number but same lengthThis question was addressed to me in a national level competition.My doubt stems from Genome Anatomy in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right answer is (b) INTRONS of VARYING length and number

The best I can explain: In S. cerevisiae (budding yeast), only a small fraction of the genes contain introns, and there are a total of 239 introns in the entire GENOME. In contrast, in individual human genes, introns may be present in NUMBERS exceeding 100 and COMPRISE more than 95% of the gene.

86.

Telomeres hold newly replicated daughter chromosomes together.(a) True(b) FalseI have been asked this question during a job interview.My doubt is from Genome Anatomy topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct option is (b) False

To explain I would say: Centromeres hold newly replicated daughter CHROMOSOMES together. They serve as a point of ATTACHMENT for pulling the chromosomes APART during CELL division.

87.

The first bacterial genome to be sequenced was that of ________ a mild human pathogen.(a) Hemophilus influenzae(b) Lactobacillus(c) Vibrio cholarae(d) Clostridium botulinumI have been asked this question in class test.Asked question is from Genome Anatomy in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct OPTION is (a) Hemophilus influenzae

Explanation: This project was CARRIED out at the Institute of GENOMICS Research. It was carried out in PART to prove a new GENOME sequencing method—the shotgun method.

88.

Which of the given statements is incorrect about ASCII and Hexadecimal?(a) Computers store sequence information as simple rows of sequence characters called strings, which are similar to the sequences shown on the computer terminal(b) Each character is stored in binary code in the smallest unit of memory, called a byte(c) Each character is stored in binary code in the smallest unit of memory, called a bit(d) By convention, many of these combinations have a specific definition, called their ASCII equivalentThis question was addressed to me in my homework.Question is taken from Sequence Formats & Computer Storage of Sequences in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right option is (b) Each character is stored in binary CODE in the smallest unit of memory, called a byte

Explanation: Each byte comprises 8 bits, with each bit having a POSSIBLE VALUE of 0 or 1, producing 255 possible combinations. Some ASCII values are defined as keyboard characters, others as special control characters, such as signaling the end of a LINE (a line feed and a carriage return), or the end of a FILE full of text (end-of-file character). A file with only ASCII characters is called an ASCII file.

89.

An alternative method is to sequence all the subclones, produce a computer database of the sequences, and then have the computer assemble the sequences from the overlaps that are found.(a) True(b) FalseThis question was posed to me in unit test.This interesting question is from DNA & Genomic Sequencing topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer» RIGHT answer is (a) True

Best EXPLANATION: Up to 10 levels of redundancy are used to get around the PROBLEM of a small fraction of abnormal clones. This procedure was first used to obtain the sequence of the 4- MB chromosome of the bacterium Haemophilus influenzae by The Institute of Genetics Research (TIGR) TEAM. Only a few regions could not be joined because of a problem subcloning those regions into plasmids, requiring manual sequencing of these regions from another library of phage subclones.
90.

Once the correct map has been obtained, unique overlapping clones are chosen for sequencing.(a) True(b) FalseThis question was posed to me during an interview.My doubt stems from DNA & Genomic Sequencing topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right option is (a) True

The explanation: However, these MOLECULES are too large for direct sequencing. One PROCEDURE for sequencing these clones is to subclone them further into smaller fragments that are of SIZES SUITABLE for sequencing, make a map of these clones and then sequence overlapping clones. However, this method is EXPENSIVE because it requires a great deal of time to keep track of all the subclones.

91.

Which of the given statements is incorrect about Clustering by single linkage?(a) In First step, a group of related sequences found in the all-against-all proteome comparison is subjected to a multiple sequence alignment usually by CLUSTALW(b) A neighbor-joining algorithm is rarely used in this method(c) This procedure and the algorithms are the same as those used to make a phylogenetic tree by the distance methods(d) A distance matrix that shows the number of amino acid changes between each pair of sequences is madeThis question was addressed to me at a job interview.My query is from Comparative Genomics topic in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Correct CHOICE is (b) A neighbor-joining algorithm is rarely used in this method

For explanation I would say: The matrix is then used to cluster the sequences by a neighbor-joining algorithm. These methods produce a tree or a different REPRESENTATION of the tree CALLED a dendrogram, which MINIMIZES the number of amino ACID changes that would generate the group of sequences.

92.

Evolutionary modeling can include various types of analyses. Which is mostly not one of them?(a) The prediction of chromosomal rearrangements(b) Eu/Hetero-chromatin structures(c) Duplications at gene, chromosomal and full genome level(d) Duplications at the protein domain levelThis question was addressed to me in an international level competition.My doubt stems from Sequence Assembly and Gene Identification topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct OPTION is (B) Eu/Hetero-chromatin STRUCTURES

The best explanation: Option Eu/Hetero-chromatin structures indicate structural studies. Also as mentioned, analysis of the prediction of CHROMOSOMAL rearrangements that preceded the present arrangement is done (e.g., a comparison of mouse and human chromosomes).

93.

Which of the given statement is incorrect about proteome analyses?(a) For BLAST, setting an effective database size appropriate for each search and program is important(b) Due to the large number of comparisons that must be made in these types of analyses and due to the volume of program output, the procedure must be automated on a local machine using Perl scripts or a similar method and a database system(c) BLAST is used for obtaining a correct statistical evaluation of alignment scores(d) BLAST does not give statistical evaluation of alignment scoresI got this question in examination.This interesting question is from Sequence Assembly and Gene Identification topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer» RIGHT option is (d) BLAST does not GIVE statistical evaluation of alignment scores

The best I can EXPLAIN: Each protein encoded by the genome is used as a query in DATABASE similarity searches to identify similar database proteins, some having a KNOWN structure or function. Additional searches of EST databases can be used to identify additional relatives of the query sequence.
94.

_______ of the Drosophila sequence is composed of TEs and _____ is heterochromatic regions that do not include genes.(a) one-fourth, one-third(b) one-fifth, one-fourth(c) one-sixth, one-eighth(d) one-sixth, one-thirdThis question was posed to me in quiz.The question is from Sequence Assembly and Gene Identification topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The correct ANSWER is (d) one-sixth, one-third

To explain: Hence, in the euchromatic regions, the gene density in the Drosophila GENOME is one gene per 9 kb. DESPITE the fact that the lower number of predicted genes in Drosophila is smaller than that of the other genomes, the amount of functional DIVERSITY, as evidenced by protein family representation, is SIMILAR.

95.

Screening the predicted protein sequences against ______ library confirms the prediction and expression of the gene.(a) expressed sequence tag (EST)(b) tags(c) palindromes(d) proteomesI got this question in semester exam.My doubt stems from Sequence Assembly and Gene Identification topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»
96.

(Transposable Elements) TEs have the potential to _____ in number until they comprise a ____ proportion of the genome sequence.(a) decrease, large(b) decrease, micro(c) increase, micro(d) increase, largeThe question was asked in semester exam.This interesting question is from Genome Anatomy in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right ANSWER is (d) increase, large

Explanation: It is a feature already observed in MANY plants and animals. They remain detectable in the GENOME until they blend into the background SEQUENCE by mutation.

97.

A simple way to retrieve sequences of viral and other extra-chromosomal genetic elements such as organelles is through the National Center for Biotechnology Information (NCBI).(a) True(b) FalseThe question was asked in an online quiz.This is a very interesting question from Genome Anatomy topic in chapter Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer» RIGHT option is (a) True

To explain: Prior to the sequencing of H. influenzae, the first free-living organism to be SEQUENCED, a large NUMBER of viruses had been sequenced. Many of these organisms also serve as model systems for STUDYING replication and gene expression. As an example, the nucleotide sequence of bacteriophage lambda was completed by Sanger.
98.

Which of the given is wrongly matched?(a) Escherichia coli – Bacteria(b) Methanococcus jannaschii – Archaea(c) Synechocystis sp. – Archaea(d) Aquifex aeolicus – BacteriaI had been asked this question in an interview for internship.My question is from Genome Anatomy topic in portion Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right CHOICE is (c) Synechocystis sp. – Archaea

For explanation: Synechocystis sp. – Bacteria is the CORRECT PAIR. It is an ancient organism that PRODUCES OXYGEN by light-harvesting.

99.

The program returns the number of matches found and provides an opportunity to narrow this list by including more terms.(a) True(b) FalseI had been asked this question in an internship interview.My question is based upon Using the Database Access Program ENTREZ in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

Right answer is (a) True

The best I can explain: When the number of matching sequences has been NARROWED to a REASONABLE number, the SEQUENCE may be retrieved in a CHOSEN FORMAT in several straightforward steps. This helps in getting to the required data in less number of steps.

100.

The format of an entry in the SwissProt protein sequence database is very similar to the EMBL format.(a) True(b) FalseThis question was addressed to me in an online quiz.The query is from Sequence Formats & Computer Storage of Sequences in division Collecting & Storing Sequences in Laboratory of Bioinformatics

Answer»

The CORRECT choice is (a) True

The best I can explain: The format is quite similar to the EMBL format, EXCEPT that considerably more information about the physical and biochemical PROPERTIES of the protein is provided. Also, the OUTPUT of a DDBJ DNA SEQUENCE entry is almost identical to that of GenBank.