Explore topic-wise InterviewSolutions in .

This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.

1.

To view the first 3 rows of the dataset, which of the following commands is used?

Answer»

To VIEW the first 3 rows of the dataset, which of the following commands is USED?
Download the dataset from https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Choose the correct OPTION from below list
(1)iris.top(3)
(2)iris.head(3)
(3)iris.select(3)
(4)iris.get(3)

Answer:-(2)iris.head(3)

2.

Choose the correct sequence from the following.

Answer» CHOOSE the correct sequence from the following.
Choose the correct OPTION from below list
(1)Data Analysis -> Preprocessing -> Model BUILDING -> Predict
(2)PreProcessing -> Model Building -> Predict
(3)Data Analysis -> Preprocessing -> Predict -> Train
(4)Preprocessing -> Predict -> Train

Answer:-(1)Data Analysis -> Preprocessing -> Model Building -> Predict
3.

True Negative is when the predicted instance and the actual instance are positive.

Answer»

True Negative is when the predicted INSTANCE and the ACTUAL instance are positive.
Choose the correct option from below list
(1)FALSE
(2)True

Answer:-(1)False

4.

A process used to identify unusual data points is _________

Answer»

A process used to IDENTIFY unusual DATA points is _________
Choose the correct option from below list
(1)Anomaly Detection
(2)Over Fitting
(3)Under fitting

Answer:-(1)Anomaly Detection

5.

Is there a class imbalance problem in the given data set?

Answer»

Is there a class imbalance problem in the GIVEN data set?
Download the dataset from https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Choose the CORRECT option from below list
(1)YES
(2)No

Answer:-(2)No

6.

How many new columns does the following command return?

Answer»

How many new columns does the FOLLOWING command return?
iris_series = pd.get_dummies(iris['Species'])
Download the DATASET from HTTPS://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Choose the CORRECT option from below list
(1)1
(2)3
(3)4
(4)2

Answer:-(2)3

7.

The classification where each data is mapped to more than one class is called _________

Answer»

The classification where each data is mapped to more than one class is called _________
Choose the correct OPTION from below LIST
(1)Multi Label Classification
(2)Multi Class Classification
(3)Binary Classification

Answer:-(1)Multi Label Classification

8.

Images and documents are examples of _________

Answer» IMAGES and DOCUMENTS are examples of _________
Choose the correct OPTION from below list
(1)Structured DATA
(2)Unstructured Data

Answer:-(2)Unstructured Data
9.

Which command is used to identify the unique values of a column?

Answer»

Which command is used to identify the UNIQUE values of a COLUMN?
Choose the CORRECT option from below list
(1)DISTINCT()
(2)unique()
(3)value_counts()
(4)shape

Answer:-(2)unique()

10.

What kind of classification is our case study 'Churn Analysis'?

Answer»

What kind of CLASSIFICATION is our case study 'Churn Analysis'?
Choose the correct OPTION from below LIST
(1)Binary
(2)MULTI class
(3)Multi label

Answer:-(1)Binary

11.

Identify the structured data from the following.

Answer»

Identify the STRUCTURED data from the following.
Choose the correct option from below list
(1)Data from mySQL DB
(2)Image
(3)EXCEL data
(4)Data from mySQL DB and Excel
(5)VIDEO clip

Answer:-(4)Data from mySQL DB and Excel

12.

The fit(X, y) is used to _________

Answer»

The FIT(X, y) is used to _________
Choose the correct option from below list
(1)Evaluate the classifier
(2)Train the classifier
(3)TEST the classifier
(4)Initialize the classifier

Answer:-(2)Train the classifier

13.

Clustering is an example of ___________

Answer» CLUSTERING is an example of ___________
Choose the CORRECT OPTION from below list
(1)Unsupervised classification
(2)Supervised classification

Answer:-(1)Unsupervised classification
14.

A technique used to depict the performance in a tabular form that has 2 dimensions namely actual and predicted sets of data is ________

Answer»

A technique USED to depict the performance in a tabular form that has 2 dimensions namely actual and predicted sets of data is ________
(1)Classification Accuracy
(2)Confusion MATRIX---------
(3)Classification Report
(4)Cross Validation

Answer:-(2)Confusion Matrix

15.

Which type of cross-validation is used for an imbalanced dataset?

Answer»

Which type of cross-validation is USED for an imbalanced DATASET?
CHOOSE the correct OPTION from below list
(1)Stratified Shuffle SPLIT
(2)Leave One Out
(3)K-Fold

Answer:-(1)Stratified Shuffle Split

16.

Identify the command used to view the dataset SIZE, and what is the value returned?

Answer»

Identify the COMMAND used to view the DATASET SIZE, and what is the value returned?
Download the dataset from HTTPS://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Choose the correct option from below list
(1)iris.size,(150,5)
(2)iris.size(),(150,6)
(3)iris.shape,(150,6)
(4)iris.shape(),(150,5)

Answer:-(4)iris.shape(),(150,5)

17.

Select the pre-processing technique(s) from the following.

Answer»

Select the pre-processing TECHNIQUE(s) from the following.
Choose the correct option from below list
(1)One-hot encoding
(2)Normalization
(3)All the OPTIONS
(4)Standardization
(5)Dimensionality reduction

Answer:-(3)All the options

18.

Which classifier converges easily with less training data?

Answer»

Which CLASSIFIER converges easily with LESS TRAINING DATA?
CHOOSE the correct option from below list
(1)Decision Tree Classifier
(2)Random Forest Classifier
(3)Naive Bayes Classifier
(4)SVM Classifier

Answer:-(3)Naive Bayes Classifier

19.

Email spam detection is an example of ________

Answer»

Email spam detection is an example of ________
CHOOSE the CORRECT OPTION from below list
(1)Unsupervised CLASSIFICATION
(2)SUPERVISED classification

Answer:-(2)Supervised classification

20.

Which preprocessing technique is used to make the data Gaussian with zero mean and unit variance?

Answer»

Which PREPROCESSING TECHNIQUE is used to make the data Gaussian with zero mean and unit variance?
Choose the correct option from below list
(1)Normalization
(2)Standardization
(3)Binarization

Answer:-(2)Standardization

21.

Choose the correct sequence for the classifier building from the following.

Answer» CHOOSE the correct sequence for the classifier building from the following.
Choose the correct OPTION from below list
(1)Initialize -> Train -> Predict -> Evaluate
(2)Train -> Test -> Initialize -> Predict
(3)None of the options
(4)Initialize -> Evaluate -> Train -> Predict

Answer:-(1)Initialize -> Train -> Predict -> Evaluate
22.

How many classes will the following command return?

Answer»

How MANY classes will the following command RETURN?
(target classes in the dataset) : classes=list(IRIS['species'].unique())

Download the dataset from https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Choose the CORRECT option from below list
(1)3
(2)2
(3)4
(4)1

Answer:-(1)3

23.

Pruning is a technique associated with _________

Answer»

Pruning is a technique associated with _________
CHOOSE the CORRECT option from below LIST
(1)Logistic REGRESSION
(2)SVM
(3)Linear regression
(4)DECISION tree

Answer:-(4)Decision tree

24.

Model Tuning helps to increase the accuracy.

Answer»

Model TUNING helps to INCREASE the accuracy.
Choose the correct OPTION from below list
(1)TRUE
(2)False

Answer:-(1)True

25.

The commonly used package for machine learning in Python is _________

Answer»

The commonly USED package for machine learning in Python is _________
CHOOSE the correct option from below LIST
(1)bottle
(2)jango
(3)sklearn
(4)pillow

Answer:-(3)sklearn

26.

Cross-validation causes over-fitting.

Answer»

Cross-validation CAUSES over-fitting.
Choose the CORRECT option from below list
(1)False
(2)True

Answer:-(1)False

27.

Let's assume you are solving a classification problem with a highly imbalanced class.

Answer»

Let's assume you are solving a classification problem with a highly imbalanced class.
The majority class is observed 99% of the time in the training data.
Choose the correct option from below list
Which of the following is true when your model has 99% accuracy after TAKING the PREDICTIONS on test data?
(1)For imbalanced class problems, precision and RECALL metrics are not GOOD.
(2)For imbalanced class problems, the accuracy metric is not a good idea.
(3)For imbalanced class problems, the accuracy metric is a good idea

Answer:-(2)For imbalanced class problems, the accuracy metric is not a good idea.

28.

Imputing is a strategy to handle ____________

Answer»

Imputing is a STRATEGY to handle ____________
CHOOSE the CORRECT option from below list
(1)Class Imbalance
(2)Standardization
(3)Missing VALUES

Answer:-(3)Missing Values

29.

The cross-validation technique is used to evaluate a classifier by dividing the data set into a training set to train the classifier and a testing set

Answer»

The cross-validation technique is used to EVALUATE a classifier by dividing the data SET into a training set to TRAIN the classifier and a testing set to test the same.
Choose the correct OPTION from below list
(1)True
(2)False

Answer:-(1)True

30.

Supervised learning differs from unsupervised learning as supervised learning requires __________

Answer» SUPERVISED learning differs from unsupervised learning as supervised learning REQUIRES __________
Choose the correct OPTION from below list
(1)LABELED data
(2)None of the options
(3)Unlabeled data
(4)RAW data

Answer:-(1)Labeled data
31.

Ordinal variables have __________

Answer» ORDINAL variables have __________
Choose the CORRECT OPTION from below list
(1)No logical order
(2)A CLEAR logical order

Answer:-(2)A clear logical order
32.

True Positive is when the predicted instance and the actual instance are positive.

Answer»

True POSITIVE is when the predicted INSTANCE and the ACTUAL instance are positive.
Choose the CORRECT OPTION from below list
(1)True
(2)False

Answer:-(1)True

33.

Ensemble learning is used when you build component classifiers that are more accurate and independent of each other.

Answer»

Ensemble learning is USED when you build component CLASSIFIERS that are more accurate and INDEPENDENT of each other.
Choose the CORRECT option from below list
(1)False
(2)TRUE

Answer:-(2)True

34.

What is the number of categorical attributes in the original dataset?

Answer»

What is the number of CATEGORICAL attributes in the original dataset?
Download the dataset from https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Choose the correct option from below list
(1)1
(2)2
(3)3
(4)0

Answer:-(1)1

35.

What are the advantages of Naive Bayes?

Answer»

What are the ADVANTAGES of Naive Bayes?
Choose the CORRECT option from below list
(1)None of the options
(2)It will CONVERGE quicker than the discriminative models LIKE logistic regression
(3)It requires LESS training data
(4)Both the options

Answer:-(4)Both the options

36.

A classifier that can compute using numeric as well as categorical values is ________

Answer»

A classifier that can COMPUTE using numeric as WELL as categorical values is ________
CHOOSE the CORRECT option from below list
(1)Naive Bayes Classifier
(2)Random Forest Classifier
(3)SVM Classifier
(4)Decision Tree Classifier

Answer:-(2)Random Forest Classifier