1.

Which of the following function can be used to create balanced splits of the data?(a) newDataPartition(b) createDataPartition(c) renameDataPartition(d) none of the mentionedThis question was posed to me in final exam.The doubt is from caret in division Machine Learning of Data Science

Answer»

Correct choice is (b) createDataPartition

Easiest EXPLANATION - If the y argument to this function is a factor, the random sampling OCCURS within each class and should PRESERVE the overall class distribution of the DATA.



Discussion

No Comment Found

Related InterviewSolutions