Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

1.

A genome database may also be interfaced with other types of data, such as clinical data.(a) True(b) FalseI got this question in homework.The doubt is from Prediction of Gene Function Based on a Composite Analysis topic in division Genome Analysis of Bioinformatics

Answer»

Correct answer is (a) True

The best explanation: These types of organization, termed DATA warehousing, can facilitate the search for novel RELATIONSHIPS AMONG the data by data-mining methods. These methods include genetic algorithms, neuronetworks, and others.

2.

Once a set of genes that are co-regulated has been found, the promoter regions of these genes may be analyzed for conserved patterns that represent sites of interaction with specific transcription factors.(a) True(b) FalseThis question was addressed to me in quiz.My question is from Global Gene Regulation topic in division Genome Analysis of Bioinformatics

Answer»

The correct answer is (a) True

To explain I would say: Automatic methods for CLUSTERING related sets of genes have been devised. The first of these methods, hierarchical clustering, is COMMONLY used, but the other two methods are better DESIGNED to DETECT differences in patterns over a set of time POINTS or samples.

3.

SVMs (Support vector machines) are a binary classification method to discriminate one set of data points from another.(a) True(b) FalseI have been asked this question during an interview.This key question is from Global Gene Regulation in section Genome Analysis of Bioinformatics

Answer»

The correct CHOICE is (a) True

The EXPLANATION: They are similar to the types of discriminant analyses. For microarray analysis, sets of genes are identified that REPRESENT a target PATTERN of GENE expression.

4.

Two species that have recently diverged from a common ancestor might be expected to have a ________ set of genes and ________ chromosomes with these genes positioned along the chromosomes in the same order.(a) distinct, similar(b) similar, distinct(c) similar, dissimilar(d) similar, similarI had been asked this question in examination.The origin of the question is Functional Classification of Genes in chapter Genome Analysis of Bioinformatics

Answer»

The CORRECT CHOICE is (d) similar, similar

Easiest explanation: Over evolutionary TIME, the sequence of each pair of GENES will slowly diverge, as the species diverge and other changes such as gene duplication and gene loss change the gene content. In addition, the order of genes ALSO changes over evolutionary time as a result of chromosomal rearrangements.

5.

Which of the given statements is incorrect about Microarray (or microchip) analysis?(a) It is a new technology in which all of the genes of an organism are represented by oligonucleotide sequences spread out in an 80 x 80 array on microscope slides(b) The oligonucleotide sequences cannot be synthesized directly on the slide(c) The oligonucleotides are collectively hybridized to a labeled cDNA library prepared by reverse-transcribing mRNA from cells(d) The amount of label binding to each oligonucleotide spot reflects the amount of mRNA in the cellI had been asked this question during an interview.Question is from Global Gene Regulation in division Genome Analysis of Bioinformatics

Answer»

Right option is (b) The oligonucleotide sequences cannot be synthesized directly on the slide

For explanation I would say: The oligonucleotide sequences can also be synthesized directly on the slide at DENSITIES of up to one million per square centimeter. Genes that are responding the same way to an environmental signal, in this case the ADDITION of SERUM to serum-starved skin cells are clustered together in a display. From this analysis, a SET of genes that responds in an identical manner may be identified.

6.

Which of the given statement is incorrect regarding MAGPIE?(a) It analyzes the genome using a set of automated processes(b) It is designed for high-throughput genome sequence analysis(c) It is unable to locate potential promoters(d) It automatically annotates genomic sequence data and maintains a daily up-to-date record in response to user queries about one or more genomesThe question was posed to me in an interview.My question is from Functional Classification of Genes in section Genome Analysis of Bioinformatics

Answer»

Correct choice is (c) It is unable to locate potential promoters

For explanation: The system also uses a set of RULES in LOGIC PROGRAMMING to make decisions that may be used to interpret information from various sources. It has been used to locate potential promoters, terminators, start CODONS, Shine-Dalgarno sites, DNA motif sites, co-transcription units, and putative operons in microbial GENOMES. These sites are shown on a map display of the genome that may be edited.

7.

When two proteins share a considerable degree of sequence identity throughout the sequence alignment, they are least likely to share the same function.(a) True(b) FalseI had been asked this question in an online quiz.The doubt is from Prediction of Gene Function Based on a Composite Analysis topic in division Genome Analysis of Bioinformatics

Answer»

Correct CHOICE is (b) False

To explain I would say: In the mentioned CASE they are more likely to SHARE the same function. A considerable fraction of a genome may ENCODE proteins whose function may not be identified in this manner because the proteins are not related to another of known function.

8.

In cluster analysis of microarray data– For n genes, the process is repeated ________ times until a single element remains.(a) n^2(b) n(c) n^-1(d) n^-4The question was posed to me by my college professor while I was bunking the class.The doubt is from Global Gene Regulation topic in chapter Genome Analysis of Bioinformatics

Answer» CORRECT answer is (C) n^-1

For EXPLANATION: This NUMBER of iterations gives the best results. In the final dendrogram, the order of genes within a cluster is determined by simple weighting schemes, e.g., average dendrogram level.
9.

Which of the given statement is incorrect?(a) Paralogous sequences, frequently are found to have dissimilar functions(b) An early classification scheme for eight related groups of E. coli genes included categories for enzymes, transport elements(c) An early classification scheme for eight related groups of E. coli genes included categories for regulators, membranes, structural elements, protein factors, leader peptides, and carriers(d) Ninety percent of E. coli genes related by significant sequence similarity fell into these same broad categoriesThis question was posed to me in an internship interview.My question is from Functional Classification of Genes in chapter Genome Analysis of Bioinformatics

Answer»

Right choice is (a) Paralogous SEQUENCES, frequently are found to have DISSIMILAR functions

The explanation: GENES that are significantly similar in an organism, i.e., paralogous sequences, frequently are found to have a related biological FUNCTION. This DISCOVERY follows the expected origin of paralogs by gene duplication events, leaving one copy to perform the original function and producing a second copy to develop a new function not too distant from the original one under evolutionary selection.

10.

Which of the given statement is incorrect about the Chromosomal Rearrangements?(a) Comparison of the number of rearrangements in a given period of evolutionary history may vary significantly from one organism to the next(b) If gene A has a neighboring gene B, then if an ortholog of A occurs in another genome, there is an increased probability of an ortholog of B also occurring in the other organism(c) If gene A has a neighboring gene B, then if an ortholog of A occurs in another genome, the B ortholog is more likely to be a neighbor of the A ortholog of the genome of the second species if the two species are more divergent(d) In general, the order of orthologs is not well conserved in prokaryotes when the genomes have diverged sufficiently that the orthologs have < 50% identityI have been asked this question in a national level competition.Question is from Functional Classification of Genes in chapter Genome Analysis of Bioinformatics

Answer»

Right option is (c) If gene A has a neighboring gene B, then if an ortholog of A occurs in another genome, the B ortholog is more likely to be a NEIGHBOR of the A ortholog of the genome of the SECOND species if the two species are more DIVERGENT

Best explanation: The B ortholog is less likely to be a neighbor of the A ortholog of the genome of the second species if the two species are more divergent. By classifying genes using a nine class functional classification scheme, several genes falling into the same functional CATEGORY are clustered together on the chromosomes of both of these organisms, and the clusters are in a similar order.

11.

Other functional classification schemes for genes include a broader category for genes involved in the same biological process, e.g., a three-group scheme for energy-related, information-related, and communication-related genes has also been used.(a) True(b) FalseThis question was addressed to me in an internship interview.This is a very interesting question from Functional Classification of Genes topic in chapter Genome Analysis of Bioinformatics

Answer»

The correct option is (a) True

The best I can explain: By this scheme, plants devote more than one-half of their genome to ENERGY metabolism. WHEREAS, ANIMALS devote one-half of their genome to communication-related FUNCTIONS.

12.

The ultimate step in genome analysis is to collect the information found on gene and protein sequences, alignments, gene function and location, protein families and domains, relationships of genes to those in other organisms, chromosomal rearrangements, and so on, into a comprehensive database.(a) True(b) FalseI have been asked this question in homework.Asked question is from Prediction of Gene Function Based on a Composite Analysis in portion Genome Analysis of Bioinformatics

Answer»

Correct choice is (a) True

Easiest explanation: This database should be logically organized so that all types of information are readily ACCESSIBLE and easily retrievable by USERS who have widely divergent KNOWLEDGE of the organism. This goal is best ACHIEVED by using controlled vocabularies that can identify the same genetic or biochemical function in different organisms without ambiguity.

13.

In Genome-wide prediction of protein functions by a combinatorial method– Each point represents a protein, and branches between proteins indicate a relationship by one of several criteria indicated in the legend.(a) True(b) FalseThe question was asked in class test.My question is taken from Prediction of Gene Function Based on a Composite Analysis topic in division Genome Analysis of Bioinformatics

Answer»

Correct choice is (a) True

Explanation: BRANCH lengths are shorter for closely RELATED proteins and thicker when two or more prediction methods indicate a relationship. The links are based on experimental data, proteins WHOSE homologs are known to operate sequentially in metabolic pathways, proteins that evolved in a correlated FASHION as evidenced by presence in fully sequenced genomes, proteins whose homologs are fused into a single protein in another organism, and proteins whose mRNA expression PROFILES are similar under a range of cellular and environmental conditions.

14.

Other types of evidence for a relationship between two genes are also given that are not dependent in sequence similarity. Which of the following is a wrong statement?(a) genes are closely linked on the same chromosomes(b) genes are transcribed from the same DNA strand(c) gene fusions are observed between otherwise separate genes(d) phylogenetic profiles show the genes are not that commonly present in organismsI got this question in semester exam.My query is from Prediction of Gene Function Based on a Composite Analysis in section Genome Analysis of Bioinformatics

Answer»

Correct option is (d) phylogenetic profiles show the GENES are not that commonly present in organisms

To elaborate: Phylogenetic profiles reveal the genes are both commonly present in many organisms implying they have interdependent metabolic functions. Option a and b imply coordinated regulation in an operon-like structure. Option c SUGGESTS the encoded proteins are PHYSICALLY associated in a COMMON complex.

15.

The designation ECa.b.c.d conveys information. Which of the following is not one of it?(a) One of twelve main classes of biochemical reactions(b) The group of substrate molecule(c) The nature of chemical bond that is involved in the reaction(d) Designation for acceptor molecules (cofactors)I got this question in an interview for internship.This interesting question is from Functional Classification of Genes in division Genome Analysis of Bioinformatics

Answer»

Correct choice is (a) One of TWELVE MAIN classes of BIOCHEMICAL reactions

For explanation I would say: Option “One of twelve main classes of biochemical reactions” should be ‘one of six main classes of biochemical reactions’. The Enzyme Commission numbers FORMULATED by the Enzyme Commission of the International Union of Biochemistry and Molecular Biology provide a detailed way to classify enzymes based on the biochemical reactions they catalyze.

16.

Which of the given statements is incorrect about Microarray Analysis?(a) It is designed to detect global changes in transcription in a genome(b) It provides information about the levels of protein products of the genes(c) The proteins are first separated in a column on the basis of size and then across a second dimension on a slab on the basis of charge(d) Labeled protein samples may also be extracted from treated cells and separated by two-dimensional gel electrophoresisThis question was addressed to me by my college professor while I was bunking the class.The origin of the question is Global Gene Regulation in division Genome Analysis of Bioinformatics

Answer»

Right option is (b) It provides information about the LEVELS of protein products of the genes

Easy explanation: MICROARRAY analysis is designed to detect global changes in TRANSCRIPTION in a genome but does not provide information about the levels of protein products of the genes, which may also be subject to translational regulation. This method also can resolve THOUSANDS of proteins based on size and charge. There are databases of the patterns FOUND in different organisms.

17.

In addition to the care needed in organizing genome databases, a great deal of human input is needed to annotate the genome manually with information.(a) True(b) FalseThe question was posed to me in quiz.My query is from Prediction of Gene Function Based on a Composite Analysis in chapter Genome Analysis of Bioinformatics

Answer»

Correct answer is (a) True

The EXPLANATION: This INFORMATION can be about individual GENES and proteins, effects of mutations in these genes, and other TYPES of genome variations that cannot be READILY incorporated into the database by automated methods. For the human genome, this activity will occupy the time of many scientists for many years to come.

18.

In Reverse-genetics analysis of gene function– Even though a particular gene may be _____ ortholog of a gene of known function in another organism, that gene may be acquired by a _____ function.(a) a highly predicted, similar(b) a highly predicted, same(c) a highly predicted, novel(d) less predicted, novelThe question was asked during an interview.My question is taken from Prediction of Gene Function Based on a Composite Analysis topic in section Genome Analysis of Bioinformatics

Answer»

Right answer is (c) a highly predicted, novel

The best I can explain: For example, a defect in a plant or animal gene that is a homolog of a yeast gene may have an effect on a developmental PROCESS or other biologically unique FUNCTION of multicellular organisms. Information on knockout MUTANTS in model organisms is available through the GENOME WEB sites.

19.

Which of the given statement is incorrect?(a) In a given organism or species, genes are found in a given order that is maintained on the chromosomes from one generation to the next(b) Genes with a related function are frequently found to be distorted on a chromosome(c) A possibility is that there is genetic variation (alleles) within each gene in a cluster of a given species and that only certain allelic combinations of different genes are compatible(d) Clustering of related genes presumably provides an evolutionary advantage to a speciesI have been asked this question in an international level competition.My enquiry is from Functional Classification of Genes topic in section Genome Analysis of Bioinformatics

Answer»

Right option is (B) Genes with a related FUNCTION are frequently found to be distorted on a chromosome

To elaborate: Genetic analysis has revealed that genes with a related function are frequently found to be clustered at one chromosomal location. As genome-by-genome comparisons of the chromosomes of related species are made and the rearrangements are discovered, a further challenge to COMPUTATIONAL and evolutionary BIOLOGISTS is to estimate the number and types of rearrangements that have occurred and also to DETERMINE when they occurred. For example, a comparison of the mouse and human chromosomes reveals many rearrangements.

20.

In case of functional genomics– Two general types of approaches are used—one in which a genetic construct is made that interferes with the expression of a particular gene (and sometimes a set of related genes) and a second in which a large number of random mutations are generated in a population of organisms.(a) True(b) FalseI had been asked this question in an international level competition.My enquiry is from Prediction of Gene Function Based on a Composite Analysis topic in division Genome Analysis of Bioinformatics

Answer» CORRECT option is (a) True

Explanation: The individual with a mutation in a particular gene is then identified. Once mutants are obtained, the effect of the mutant genes on phenotype is determined. The gene function MAY then be predicted on the BASIS of the OBSERVED alterations. Because such extreme genetic experiments cannot be performed with humans, the mouse model for the human genome serves the same PURPOSE.
21.

Which of the given statements is incorrect about global gene regulation?(a) One way to obtain useful information about a genome is to determine which genes are induced or repressed in response to a phase of the cell cycle(b) Sets of a gene whose expression rises and falls under the same condition are likely to have a related function(c) Sets of a gene whose expression rises and falls under the same condition are likely to have dissimilar functions(d) Cell cycle is a developmental phase, or a response to the environmentI have been asked this question in an interview.My question is from Global Gene Regulation topic in division Genome Analysis of Bioinformatics

Answer»

The correct option is (c) Sets of a GENE whose expression rises and falls under the same condition are likely to have dissimilar functions

Best explanation: In addition, a pattern of gene expression may also be an indicator of ABNORMAL cellular regulation and is a useful tool in cancer diagnosis. Because GENOMES, especially eukaryotic genomes, are so large, a new technology has been DEVELOPED for studying the regulation of thousands of genes on a microscope slide.

22.

In SVMs (Support vector machines) Data points are log-transformed and normalized as in method A, where for N observations of a gene i, the log transform Xi of the expression level Ei and reference level Ri is?(a) Xi = \(\frac{Log (E_i/R_i)}{\sqrt{\sum_{j=1,N} Log_{z-2} (E_j/R_j)}}\)(b) Xj = \(\frac{Log (E_j/R_i)}{\sqrt{\sum_{j=1,N} Log_z (E_j/R_j)}}\)(c) Xi = \(\frac{Log (E_i/R_i)}{\sqrt{\sum_{j=1,(N-1)} Log_z (E_j/R_j)}}\)(d) Xi = \(\frac{Log (E_i/R_i)}{\sqrt{\sum_{j=1,N} Log_z (E_j/R_j)}}\)This question was addressed to me in an interview for job.The doubt is from Global Gene Regulation in chapter Genome Analysis of Bioinformatics

Answer»

Correct answer is (a) Xi = \(\frac{Log (E_i/R_i)}{\sqrt{\sum_{j=1,N} Log_{z-2} (E_j/R_j)}}\)

Explanation: SVMs were used to categorize genes based on 79 different sets of data points from studies of the yeast cell cycle and are PARTICULARLY useful for such complex data sets. Gene combinations averaged over all EXPERIMENTAL CONDITIONS are then EXAMINED by a multidimensional ANALYSIS.

23.

In cluster analysis of microarray data– If Xi is the log odds value for gene X at time i, then for two genes X and Y and N observations, a similarity score is calculated. S(X,Y) is also known as the Pearson correlation coefficent. Xoffset and Yoffset can be the mean of the observations on X or Y, respectively, in which case is the standard deviation, or else Xoffset and Yoffset can be set to zero when a reference state is used. Which of the following best represents it?(a) S(X,Y) = \(\frac{1}{N-2}\) ∑i=1,N . (Xi – Xoffset) (Yi + Yoffset)/ϕxQY(b) S(X,Y) = \(\frac{1}{N}\) ∑i=1,N . (Xi – Xoffset) (Yi – Yoffset)/ϕxQY(c) S(X,Y) = \(\frac{1}{N-1}\) ∑i=1,N . (Xi + Xoffset) (Yi + Yoffset)/ϕxQY(d) S(X,Y) = \(\frac{1}{N}\) ∑i=1,N+2 . (Xi + Xoffset) (Yi – Yoffset)/ϕxQYThe question was asked in an internship interview.This intriguing question comes from Global Gene Regulation in section Genome Analysis of Bioinformatics

Answer»

The correct option is (b) S(X,Y) = \(\frac{1}{N}\) ∑i=1,N . (Xi – Xoffset) (Yi – YOFFSET)/ϕxQY

The explanation: After values of S(X,Y) have been calculated for all gene combinations, the most closely related pairs are identified in an above-diagonal scoring matrix. The object of CLUSTERING is to identify GENES that RESPOND the same way to the environmental treatment. Each gene is compared to every other gene and a gene similarity score (metric) is produced.

24.

In Self-organizing maps a choice is made of a number of clusters by which to organize the data.(a) True(b) FalseThe question was posed to me by my school teacher while I was bunking the class.The origin of the question is Global Gene Regulation in division Genome Analysis of Bioinformatics

Answer»

Right answer is (a) True

Best EXPLANATION: The object is to MOVE each node to the center of a cluster of DATA points. At each iteration a data point P is SELECTED, and the node closest to that point is identified.

25.

The hierarchical clustering method generates a similarity score [S(X,Y)] for all gene combinations, places the scores in a matrix, joins those genes that have the highest score, and then continues to join progressively less similar pairs.(a) True(b) FalseThe question was posed to me in an interview for internship.My enquiry is from Global Gene Regulation in section Genome Analysis of Bioinformatics

Answer»

The correct answer is (a) True

The best explanation: The DISADVANTAGE of this method is that it fails to discriminate between different patterns of variation. For example, a GENE expression pattern for which a HIGH value is found at an intermediate time point will be clustered with ANOTHER for which a high value is found at a late time point in the experiment. These variations have to be separated in a subsequent STEP.

26.

Which of the given statement is incorrect about the observations made with regard to gene order?(a) Order is highly conserved in closely related species(b) Order in closely related species becomes changed by rearrangements over evolutionary time(c) As more and more rearrangements occur, there will no longer be any correspondence in the order of orthologous genes on the chromosome of one organism with that of a second organism(d) Order is less conserved in closely related speciesI have been asked this question in an international level competition.This question is from Functional Classification of Genes topic in chapter Genome Analysis of Bioinformatics

Answer» CORRECT option is (d) Order is less conserved in closely related SPECIES

The explanation is: Order is more conserved in closely related species. Another observation is that the groups of genes that have a similar biological FUNCTION tend to remain LOCALIZED in a GROUP or cluster.
27.

An approach to classification of genes that encode enzymes is to examine relationships among multiple enzymes that perform the same biochemical function in the same organism.(a) True(b) FalseI got this question by my college professor while I was bunking the class.This interesting question is from Functional Classification of Genes topic in division Genome Analysis of Bioinformatics

Answer»

Right answer is (a) True

For EXPLANATION: Although catalyzing the same reaction, these enzymes showed VARIATIONS in metabolic REGULATION of their activity. More than one-half of multiple enzymes in E. COLI share significant sequence similarity; i.e., they are paralogs. However, the remainder do not share any sequence similarity.

28.

Which of the given statement is untrue about functional genomics?(a) Known functions are derived from experimental evidence in molecular biology and genetic studies with model organisms(b) Non-Orthologous genes between biologically distinct species can be identified, and it is strong evidence for a related function(c) Sequence-based methods of gene prediction can be augmented by the types of genome comparisons that are designed to identify related genes based on common patterns of expression, evolutionary profiles, chromosomal locations, and other features(d) Genome analysis depends to a large extent on sequence analysis methods that identify gene function based on similarity between proteins of unknown function and proteins of known functionI had been asked this question at a job interview.The above asked question is from Prediction of Gene Function Based on a Composite Analysis topic in chapter Genome Analysis of Bioinformatics

Answer»

The CORRECT CHOICE is (b) Non-Orthologous genes between biologically distinct SPECIES can be identified, and it is strong evidence for a related function

The explanation is: Orthologous genes between biologically distinct species (for example, yeast and fruit flies) can be identified, and the high sequence similarity between them is strong evidence for a related function. GIVEN the more complex multicellular biology of flies, the fly gene could have an additional function that is not predictable by the yeast model. In other cases, the occurrence of families of paralogous genes that share common domains can make a precise guess of function of one of these proteins more DIFFICULT because all match a model protein to some degree.

29.

In cluster analysis of microarray data– A node is created between the _________ scoring pair, and the gene expressed profiles of these two genes are averaged and the joined elements are weighted by the _________ of elements they contain.(a) lowest, frequency(b) average, sequence(c) lowest, number(d) highest, numberI have been asked this question in my homework.Asked question is from Global Gene Regulation topic in division Genome Analysis of Bioinformatics

Answer»

Correct answer is (d) highest, number

To elaborate: The node is CREATED as mentioned. The matrix is then UPDATED replacing the TWO joined ELEMENTS by the node.

30.

GeneQuiz focuses on deriving a predicted protein function, based on a variety of available evidence, including the evaluation of the similarity to the closest homolog in a database.(a) True(b) FalseThe question was posed to me in an online quiz.This key question is from Functional Classification of Genes topic in section Genome Analysis of Bioinformatics

Answer»

The correct option is (a) True

For explanation: GeneQuiz is an integrated system for large-scale BIOLOGICAL sequence analysis that uses a variety of SEARCH and analysis methods using current sequence DATABASES. By APPLYING expert rules to the results of the different methods, GeneQuiz creates a compact SUMMARY of findings.