Isabel Zimmerman - Practical MLOps for better models | PyData Global 2022

Transcript#

This transcript was generated automatically and may contain errors.

Hello, everyone. I am Isabel Zimmerman . I work for a company called Posit. If you're wondering where you've heard that before, there was a talk from my colleague Hadley Wickham yesterday. He was one of the keynotes. We are all about R and Python and thinking about multilingual teams and how to make their lives easier. I specifically work with MLOps and I kind of started my career as a software engineer slash data scientist and I worked a lot deploying models with Kubernetes.

Of course, if you know anything about Kubernetes, it's a little frustrating, so I decided to take out some of my Kubernetes stress with teaching my dog silly tricks. The first trick I taught was to teach him to sit. When I first taught him to sit, I call him to stand right in front of me and I tell him to sit and he figures it out and I give him a treat. But then I started taking him on walks and bringing him out into the real world and, you know, he's walking next to me on my side and I would tell him to sit and he had no idea what I meant. And, of course, the data scientist in me is like, oh, I overfit my model. I over trained my dog. Because he realized that — I realized that I'd only ever trained him to sit, like, in front of me. Never off to the side.

And, you know, that's kind of a hard lesson for a data scientist to learn, but all I knew was I was training for the right outcome to sit. And in my cozy living room, it totally made sense and it worked really well. He knew my task and he would sit on command. But when we went out into the real world on our silly little walks, there was a new set of challenges and he behaved differently. I realized I needed to, you know, expand my tool set and expand my mindset for how to train my dog. But this is not a dog training conference.

The real world value of models oftentimes comes from integrating them into some larger ecosystem. So, my advice for you is to bring your model on a walk. You can learn to operationalize a model using practices called MLOps. And MLOps is a set of practices to deploy and maintain machine learning models in production reliably and efficiently. And these practices can be hard. Especially with the Kubernetes-based models I had started out with, I felt like a lot of the tools that I was using, I had to be kind of a cloud architect as well as a data scientist.

The real world value of models oftentimes comes from integrating them into some larger ecosystem.

And I don't think that data scientists should be oblivious to everything in the DevOps or MLOps world. But there's definitely a space for tools that help data scientists more effectively communicate with their IT or DevOps teams. And these tools can still feel ergonomic for data scientists. So, I actually changed career paths. And a package called vetiver was made. Vetiver is an MLOps tool for data scientists specifically to help them version, monitor, and deploy models in production in both R and Python.

My dad had always told me growing up that if you haven't written it down, you haven't thought it out.

Deploying models

So, we know about the beginning of this life cycle, and we know about what versioning a model is. And so, what happens when we have to deploy it? So, deployment means a lot of different things for a lot of different people. But the way that our team has defined it as is bringing a model off a local laptop into some sort of other architecture. We do this by creating a REST API endpoint. It makes your software engineer friends happy because they are pretty robust and testable, and it makes your data scientists happy because there's a lot of great tools to help you spin them up and maintain them.

To do this with vetiver, if you want to run an API locally, our vetiver model named V from earlier can be put into a vetiver API and you can run it. But, of course, the end goal of this is to not have a local API endpoint. So, if you were to want to deploy this onto our pro products, which is Connect, you could set up a Connect server and then just send it a model board, the name of your model, and then the specific version that you'd like to deploy. And it will do the rest of the work for you.

If you're looking to move this into maybe a Docker file or anywhere that ingests Docker files, such as AWS or Azure or Google Cloud Storage or Google Cloud Platform, you can do this in two different steps. First is to write out an app.py file. As long as you're giving your board and pin name to this function, it will generate this app.py file for you and get you at least most of the way there. And then you can write out a Docker file and send in the app.py file that you have just created. And then this Docker file will be written out for you that's usable out of the box, but that you can also edit to customize for your own deployment. And that's what deployment looks like using vetiver. These helper functions are really made to make it feel accessible to get this model off your laptop, or at least to have a Docker file in hand to pass off to somebody else.

Monitoring models

Now, once your model is off your laptop, a data scientist's work is not done. And monitoring means something unique in this context. We're not necessarily monitoring things like CPU usage or runtime. Here we're specifically looking at the statistical properties of the input data or predictions. And vetiver helps this out with a few helper functions. I won't go too deep into them right now. But it essentially will help you compute metrics over certain rolling window timeframes that you specify. Here it'll be for one week. It'll help you pin your metrics, especially if you get into that awkward situation where some of the dates overlap. Vetiver pin metrics will sort that out for you, overwrite with the newest data if necessary. And then finally, vetiver also helps you plot your metrics in the same format that is coming out of vetiver compute metrics to get kind of an out of the box quick peek into how your model is performing.

And this is really important to track. If you are not monitoring your model, you are oblivious to model decay. And of course, that makes sense. You need some data to make sure you're doing the right thing. But this is especially important because models break quietly. They'll continue to run and really just proudly give you very wrong answers. Even if your accuracy is like zero percent. Whereas on the other side, you know, applications will often give you big red Xs. Models will continue to give you bad answers. So, if you're not monitoring your model, you are oblivious to this decay.

Models break quietly. They'll continue to run and really just proudly give you very wrong answers. Even if your accuracy is like zero percent.

And that has completed our cycle in a very fast way. We have gone over versioning a model, deploying a model, and monitoring a model. But if we think about vetiver as a whole, why should I be excited about vetiver? Well, why am I excited about vetiver?

I think the first piece of this is composability. I've shown you some strong building blocks of vetiver API and vetiver model that are pretty simple to use right out of the box. But they're also composable within themselves to add new endpoints to your API or to make more complex models or custom models. So, not only is it composable with itself internally, it's also composable with other the larger ecosystem. So, externally. And this is because vetiver is built on really tested tools by the community. It's built on things like FastAPI and Pydantic. And there's such a community around these different tools that you can leverage all of the fun and amazing other projects that people have created.

The other reason to be excited about vetiver is the ergonomics. It feels good to use. It's pretty lightweight. And it works with the tools that you like to use. It's supposed to feel like a really natural extension of the data science workflow that you're already using. So, overall, vetiver helps you version, deploy, and monitor models in a composable and ergonomic way. Thank you all for joining me here today. It has been my pleasure to present for you all.

Q&A

Thank you, Isabel, for your presentation. At this point, if anybody has any comments, please put it into the your YouTube comment section and we will bring it up here.

In the meantime, I just wanted to say I think that vetiver is definitely filling a very important gap in the current data science ecosystem. I feel like this year we've seen a lot more talks compared to last year and the year before on sort of how to do the monitoring, whether it is maybe doing better testing to make sure that the model doesn't decay or whether it is a full solution like what you guys have here. So, thank you very much.

Yeah. It is my pleasure. I think we've taken a lot of knowledge from Python and the other. We have just taken a lot of knowledge from these different ecosystems. We've talked to a lot of data scientists and tried to understand really what are they looking for and what do they want from a MLOps solution. And a lot of that was I like my workflow the way it is. Can I just have something that adds on to it? There's a lot of different solutions that maybe you have to really declare something early on and change your workflow. So, this felt like the right way to go for us to help serve our customers and the needs that we had heard.

So, we have a couple questions. Let's start with the first one. Seems like vetiver is built on top of many dependencies. Isn't it hard to manage them all? So, this is okay. I'll try to not get too deep into the weeds on this one. This was a really fun task for my team and I. Because it's hard. You know, if you're deploying a scikit-learn model, you don't want to have to download a framework that has PyTorch installed and stats models installed and all of these other things. So, you know, how do you make sure you don't bloat your own project but also serve so many different people?

And we're able to do that with something called single dispatch. So, we actually can, like, break things down and you only install what you need. The dependencies that are core that we do have to manage are things like FastAPI and Pydantic, which a lot of packages have a lot of dependencies. So, these are manageable and they're pretty lightweight. Where things really do get difficult is when you're trying to manage multiple machine learning model platforms at once. So, we're able to break those apart. If you ever want to, like, chat about how weird and funky this is, I would love to hop on Discord and chat with you about that. It was so cool.

Cool. Thank you. And just as a reminder, there is on Discord a channel called Talks-Discussions, which Isabel and other speakers will be monitoring.

A question from Patrick. It looks like Vetiver is a decentralized version of some of these other platforms that require spinning up an on-prem or a cloud server to track this stuff. Is that true? So, let me make sure I answer this correctly. To have something that is shareable with your larger team, a lot of times the right tool is to be on-prem or in a cloud server that's using Docker files. So, Vetiver is able to bring your model onto those different places as well. I think one of the exciting things that we like to play around with with Vetiver is it makes it really easy to spin things up locally. So, that might be where you're feeling this decentralized bit. And that's really important because a lot of times when you're testing or development versions are very different from what your deployed versions feel like, things break very quickly. So, being able to quickly, rapidly prototype a small API and scale it safely is something that Vetiver focuses on a lot. So, yes, it can be easy to put up locally in a really decentralized way. But it also makes it easy to ship it to an on-prem or a cloud server instance.

And a question from Neil. How does Vetiver compare with MLflow? When would you prefer one over the other? Yes, this is a great question. So, there are a lot of ML ops tools out there. And MLflow is one that we've looked at a lot. And what they've done really well is they have a lot of investment into experiment tracking. So, if you are looking to track all of the hyper parameters that you've used when training your model, like every single time you've trained your model, that's something that MLflow does very, very well. So, if you're looking for experiment tracking, I think that's a good place to go. If you're looking for making APIs quickly and safely, I think that's where Vetiver shines, kind of for the reasons I just mentioned. Because it is like three lines of code to do a very simple spin up a model within an API. So, I'd say if you're looking for experiment tracking, MLflow is a great place to start. If you're looking for deployment of APIs, Vetiver might be your better option.

Isabel Zimmerman - Practical MLOps for better models | PyData Global 2022

Transcript#

The data science workflow gap

MLOps components

Versioning with pins and vetiver

Model cards

Deploying models

Monitoring models

Q&A