1.

Provide at least three ways to detect outliers in a dataset?

Answer»

There are various methods.

  1. The most COMMON method is to use visualization using box-plots, HISTOGRAMS and scatter plots to detect outliers.
  2. Another way - Any VALUE, which is beyond the range of -1.5 x IQR to 1.5 x IQR (where IQR stands for Inter-Quartile Range).
  3. Use capping methods. Any value which out of range of 5th and 95th percentile can be considered an outlier.

Others could be as follows: DATA points, three or more standard deviations away from the mean are considered as outlier.



Discussion

No Comment Found

Related InterviewSolutions