Max Kuhn - The Post-Modeling Model to Fix the Model

Transcript#

This transcript was generated automatically and may contain errors.

Thanks for waiting around to the second to last talk here at the conference, I appreciate that. Nobody will probably be surprised that I'm going to be talking about modeling, I think I said that last year too.

So the idea is this, something I've been working on lately, I should say up front that a person I work with at Posit named Edgar Ruiz did most of like the programming on this, so he and I worked together on this for like six months. The idea is the model behind the model to fix like the first model, and hopefully that'll come across well here.

So I was trying to think of like how I could start this, and it's sort of like the idea that you have a data set, you do all this work, and sometimes they're very difficult and you find a small handful of models that if you tune them just right, actually get pretty good performance, which is not really how it usually is. In this particular data set, it was like 1,500 data points and 56 predictors, and a model that seemed to work pretty well was something called a Naive Bayes model, and it had an error in the RC curve of 0.86, and there it is right there, so you think like, oh yeah, this is really awesome, it's great, but a college professor of mine told me that the only way to be comfortable with your data is never to look at it, so then you start plotting things.

And what I did is I took sort of, I did, as you'll see in a minute, some re-sampling, and these are the outer sample predictions, and that Gigi plot on the right basically has things fastened by the true class of this data, and so on the top you see that there's a big bar there at 1, and a smaller bar at 0, it's basically bimodal, and the converse happens for the no event data on the bottom, but it's kind of weird, it's kind of like a skeevy distribution, you wouldn't expect this to happen, and in fact in a lot of cases when you predict it wrong, you're like really confidently incorrect.

So this is not good, this is, you know, it's something where the model is separating the class as well, but not in any realistic way, once you start looking at the probabilities that it generates. And so one way to think about this is, well the way really to think about this is that the model's not very well calibrated, and calibration essentially means that if you had an individual data point, and its prediction is like 80%, like if you had like a Marvel MCU way to like alternate reality this thing like a thousand times, you would expect the actual event to occur about 80% of the time.

The model's not very well calibrated, and calibration essentially means that if you had an individual data point, and its prediction is like 80%, like if you had like a Marvel MCU way to like alternate reality this thing like a thousand times, you would expect the actual event to occur about 80% of the time.

Max Kuhn - The Post-Modeling Model to Fix the Model

Transcript#

Visualizing calibration

Calibration methods

Measuring calibration with the Brier score

Applying calibration in tidymodels

What's next: post-processing in workflows

Featured software#

rstudio