9  Monitoring

Why do we need monitoring?

Logging vs Monitoring

Model monitoring happens at two different levels:

9.1 Drift for univariate features

  • Continuous Data: use statistical tests such as the Kolmogorov-Smirnov test to compare the distribution of a feature in the training data vs production data.
    • Kolmogorov-Smirnov test determines the maximum distance between two distribution’s cumulative density functions
  • Categorical Data: use Chi-Squared test to compare the distribution of categorical features in training vs production data.
    • Chi-Squared test compares the observed frequencies of categories in production data to the expected frequencies from training data.

Kind of drift detection tests