1.

What is CRISP-DM? Explain various stages

Answer»

CRISP-DM stands for Cross Industry Standard Process for Data Mining. It is a methodology for data science programs. It has the following phases:

  1. Business understanding – (Typical tasks are: Determine business objective, Assess Situation, Determine Data mining goals, project plan)
  2. Data understanding – (Collect initial data, DESCRIBE data, Explore Data, Verify Data Quality)
  3. Data preparation – (Select data, Clean data, CONSTRUCT data, Integrate data, Format data)
  4. Modelling or Model development – (Select Modelling techniques, Generate test design, Build model, Assess model)
  5. Model evaluation – (Evaluate results, Review process, Determine next steps)
  6. Deployment – (Plan deployment, Plan MONITORING & maintenance, Product final report & Review Project)

Some phases are iterative in nature and any data science project or program which is end to end TYPICALLY follows this methodology.

Below is a diagrammatic view for BETTER understanding



Discussion

No Comment Found

Related InterviewSolutions