InterviewSolution
| 1. |
How do you handle unbalanced class issue in a binary classification context? Explain briefly. |
|
Answer» Response: Imbalanced DATA usually refers to a problem with classification problems where the classes are not represented EQUALLY. For example, in a credit card fraud detection scenario, we may have a 2-class (binary) classification problem with 100 instances (rows). A total of 95 instances are labelled with Class-1 which are genuine transactions and the remaining 5 instances are labelled with Class-2 which are fraudulent transactions. This is an imbalanced dataset and the ratio of Class-1 to Class-2 instances is 95:5. You can have a class imbalance problem on two-class classification problems as well as multi-class classification problems. We can handle it in various ways.
Lot of different aspects can be looked at. All of these vary based on the context, dataset and domain also that we are analyzing on. |
|