Isabel Zimmerman | Demystifying MLOps | Posit (2022)

Transcript#

This transcript was generated automatically and may contain errors.

All right. Hello, everyone. I am Isabel Zimmerman . I am here to demystify machine learning operations. Mostly because machine learning operations are very hard. So infamously hard that in a past life, I was deploying a lot of models, and I found these tools so unfriendly for data scientists that I left that job, came to the company previously known as RStudio to build tools for data scientists to deploy their models. But maybe more important than this fact is I really love baking. I love baking. I love eating chocolate chips by the handful. I find them to be so delicious, especially like the dark chocolate chunks. Those are my favorite. And as many chocolate chips as I eat, it's never as good as the cookie at the end of my baking session.

And machine learning models are a lot like chocolate chips. They're really good on their own, but they're never as good as they could be or a lot of the value you're losing when they're not deployed as part of some sort of larger system.

REST APIs are the same way. They are interfaces that connect to many different types of applications in a very standardized way. Any model that you can write, you can deploy into a REST API.

All right, we have one last reason to be excited about APIs. They are really accessible for all skill levels, so maybe your team-mate that isn't as modelling-obsessed as present company, they're still able to discover and explore and interact with your model, and they don't even have to download R or Python.

This is a lot of hype, so how do we even deploy this model? In the R side, you start up your plumber router, create a Vetiver API, and our V Vetiver model from earlier, just plug it in. On the Python side, once again, create a Vetiver API and plug in your model V. And once you have run that, you get prompted with this great visual documentation that lives at your API. It will give you a little bit of metadata, like name, the version of Vetiver you're deploying with, kind of what model you're using, as well as where this API is running, so you can see this is a local server. But, if you scroll down, there's also a ping end point to see if your model is up and running. If you keep scrolling down, you can see there's a predict end point where you can actually interact with this model, so we see I'm making a prediction, and then I can edit some things and rerun it in the browser. And I was typing slower than I'm talking right now, but we can try, and, voila, our prediction changed. We can see response headers and curl request information to make your IT people happy.

But, if you're not interested in, you know, looking at this end point too much and you just want to interact with the model that somebody else has deployed or that you have deployed, you can also just live inside the notebook that you're already inside. So you will use predict, just like all of these tidymodels packages, so it's an expected function, give them where your model is running at, so this Vetiver end point, and the data that you'd like to predict with. Here we can see I'm doing batch predictions on MT cars, so you don't have to leave your computational environment if you don't want to.

Monitoring your model

So, we have versioned our model, we've deployed our model, and our cookies are baking in our API oven, but we have to keep an eye on them so they don't burn. It's important to monitor a few different things. The first is data drift, so does your data look the same today as it did two months ago? You also want to monitor for model drift. So, model drift is when your model's performance metrics start to decay, and this is so, so important to track. Models fail silently, so it will continue running without error, even if, you know, your model accuracy is zero. If you're not monitoring your model in some way, you are oblivious to model decay in production.

A good example of this is, you know, I listened to way more Jonas Brothers in 2012 than I do now, but my Spotify algorithm doesn't still suggest that I listen to Jonas Brothers because my tastes have adapted. If this, you know, recommendation algorithm hadn't kept up with my changing choices, they would have lost a customer.

Models fail silently, so it will continue running without error, even if, you know, your model accuracy is zero. If you're not monitoring your model in some way, you are oblivious to model decay in production.

Finally, it's important to know what to do when things are going wrong, so if your model is declining, it might be retraining with new data, and this works in a lot of use cases. You might need to try out a new model type altogether, but it all goes back to this versioning. If you have strongly versioned models, and your API is, you know, connected to this version, it makes it so much easier to take down and put up new models without much pain on your end.

So, in Vetiver, we are going to start by computing metrics. So this is coming from a data frame that has a few different columns, and we're going to pass it into this function, so we want to give them the date column, so this is the date that the prediction was made. We also want to give, you know, the time frame that we're aggregating over, so, for this one, we're looking at one week, so maybe this model has been running for years and years and years, but we want to look at it one week at a time. Then we want to give the actual miles per gallon, or the truth value, and we want to give the predicted value as well. You can send in different metric sets. R does this secretly behind the scenes for you, but you can also customise them. Python side, you have to write a little bit of I want to use RMSE and send in a list.

So, we've computed our metrics. And then we want to pin our metrics. This is, once again, very important to have this running log of your model performance. With vetiver pin metrics, you pass in the metrics you've just created, you give it a name for your pin, and, if you have overlapping dates, you can choose to overwrite them, and let vetiver do all the hard lifting of dealing with date columns. Finally, this is the beauty of it all. You get this out-of-the-box function to plot your metrics.