SatRdays London 2023: Julia Silge - What is "production" anyway? MLOps for the curious

Transcript#

This transcript was generated automatically and may contain errors.

I am going to go here, I'm going to try that, I might take the clicker. Well thank you so much for that introduction. Thank you for Jumping Rivers for organizing this and I have been to a couple of these Saturday conferences and what I love about them is the variety of people in the community you get to hear from and so I'm really happy to be here participating in a Saturday's conference.

I am hoping that together we can ask this question, what is production? What does this mean? As people who work with R, I bet you have heard, R is no good for production or to have experience that maybe there's a lack of production knowledge or experience in our community and that people think we can't and so what we're going to talk about is asking and answering this question and talking about what MLOps is.

So I want to tell you a little about me and who I am and the perspective that I come from so that you can understand the perspective that I bring to asking and answering these kind of questions. So my academic background is physics and astronomy, kind of moved around in my career and eventually landed in data science. I worked as a data scientist in tech in some organizations and then now I work on open source software full-time for my job.

So if you think about that path, probably you think like one thing I want to notice or highlight is that I spent a lot of that time writing code but I was always writing code to answer a question or to ask a scientific question to do a scientific analysis and I, like many of you, I don't have a formal like computer science degree. Like that's not my background and my training and so like this is who I am and if we talk about like who you all are in this room but a lot of you have titles like data scientist. I bet a lot of you are statisticians by background. Maybe some of you are data analysts. You have, you write code but you probably don't primarily think of yourself as a software developer.

So today, as of today, my title is actually software engineer and there might be people here also who have like you have a more software developer research software engineer or you know regular software engineer. You might have a title like that but a lot of us who are here because we like to use R for our data practice, we probably primarily think of our identity from the data point of view and maybe less so from the software engineer point of view.

The case for model developers owning deployment

So what that means is that you are probably someone who has developed a model at point, at some point, right? Like you, you, you know how to, you know, use code to do exploratory data analysis, how to make plots, you know how to use code to probably you have trained a model at some point. And what I want to have the, if you have a one big takeaway from this, it's that if you are someone who has developed a model, then you can be the person to, let's say, operationalize that model.

The other thing that I want to make, like call out and really communicate is that if you are someone who knows how to develop a model, if you are someone who in your org ever builds, trains, develop model, then you are the right person to do the operationalizing of that model. So if you have spent time with EDA for that data, if you have chosen which kind of model to use, you've spent time tuning, evaluating that model, then you are the one that has the context, the domain expertise in the data and in the model. You know caveats about that model. You know what makes it more or less appropriate in different situations.

And if you have some kind of dynamic in a company where one person trains the models and then we kind of kick it over a wall to someone else, maybe it gets rewritten in another language or something like that. What that does is that leads to inappropriate use of modeling predictions. And if you as someone who is a model developer, a model practitioner, can gain these tools to go in your toolkit, then we move towards more reliable, more appropriately used, and fairer use of models in this way.

And if you as someone who is a model developer, a model practitioner, can gain these tools to go in your toolkit, then we move towards more reliable, more appropriately used, and fairer use of models in this way.

What this is about is being a good, being a good collaborator, showing that you are the person that can take the model the last mile, and get it into the hands of the people that need to interact with the, with the model.

Monitoring deployed models

Okay so so we talked about versioning we talked about deploying right putting into production lifting the model getting it somewhere else, and now let's talk about monitoring, so that ever has in it. Uh, functions and code to help you get set up with either you know we can start with default metrics, but we can use custom metrics and one thing that we have found when we talked to people who were working on monitoring problems is that monitoring problems often are very specific to. To people's business problems like it's not uncommon for people want to monitor monitor something that's not you know RMSE but rather related to a KPI in their in their organization and so what we what we do here is we would this really highlights for us that a code first approach to model monitoring is almost required like it's basically what we have to offer people so there again is like a template and.

That ever that generates code for you based on your own model, but it is code. So then you take it and you just you just go with it. So if you've heard discussions, you know lately about like how will LMS make you faster like you start with something and then you can edit it like that's the kind of mental. I mean there's no LMS here to be clear, but like that's the mental model right like it gives you it gives you code that is generated and it runs and it works and we show you. Things say oh here's if you have say that feedback loop where you get true values and you can compute some kind of statistical metric or maybe you don't and so what you monitor is just the input data like the statistical properties of the predictors the input to the model and of course we want to be able to show this to you know to our coworkers so they know how these things work, but this is all code that is generated and that you have access to so you can customize it in the way that is. Appropriate. To your work.

Who Vetiver is built for

So using Vetiver is designed to be a good option for people who are just getting started with mlops so it is it is designed with with a user persona top of mind who is someone who has never deployed a model before and we want to be on them be able to deploy their first model so we want the barrier to entry to be low. So we want the barrier to entry to be low we want the learning curve to start out low at the same time Vetiver is not meant to be a toy project or a tour tool only for beginners we want to give like give people a tool that has you know great defaults easy for people to get started with but when you have more complicated needs whatever that might mean in your case maybe high compliance needs maybe it means high scale needs. Maybe it means you have a lot of different tiny models you know depending on what is specific about your particular use case we it's important to choose a tool that can. That can scale with you and your org as it grows so this is what Vetiver is built to do and that is that makes it kind of unique compared to other mlops tools that might be out there that might focus much more maybe on a high scale and they might focus much more on a on a like a like a software engineer persona thinking that's the person who is deploying the models whereas we really do think if you develop models.

You can be the one to deploy the model and then your skill can grow as the needs as the needs change so I'll I'll just remind us here the things that Vetiver does is to version deploy and monitor and this this also makes Vetiver a little unique because a lot of these other tools sit in slightly different places so it may some of them may be more interested. In being involved in the model training process the model like the model like say hyper perimeter tuning and you can't actually deploy a model with some frameworks if you didn't use it to train your model and but a lot of people don't want to use the tool to train you know this that that tool say to train your model so it is. This puts it kind of in a unique place and the other thing I think that is unique about Vetiver is that Vetiver was built from the ground up for R in Python model so there's an R package and a Python package a lot of the tools that are out there. Um, you know actually are like almost unusable for R but because they are designed in the way that they were they actually sometimes don't give a great experience for Python people as well.

So I don't know what language we're going to be using in 10 years 20 years, but the design choices that underlie Vetiver use technologies that I would be willing to bet are you know like we're talking things about how do we carefully think about smart blob storage right? Pretty sure will still be doing that. I bet rest API's will still be here and you know in that long so what and I think you know HTML and dashboards right here to stay here to stay so I think that. What Vetiver can like can be a good option to learn because it makes your skills appropriate in many uses there.

Um, so if you're interested, I know we this isn't our conference and I pretty much you know I run. I run Python code, but I don't like I don't write a lot of Python code, but if you are someone who maybe moves back and forth a little bit between this is one of our documentation sites that shows what are the functions that you used to do the same kinds of tasks and are in Python been really interesting to work on a project where we want to focus on tasks and then make. You know support and functions for people to approach those tasks in ways that feel comfortable like we don't want to write a Python package is that no Python people actually want to use and same because we have had that experience as our as our user. So it's been really an interesting project to work on for that reason.

So again, here's what mlops is I'll focus here on this idea of like you deploy and you maintain and there's these things that you need to do in order to be able to get that in a good place with your model and the last thing maybe is maybe some of you are sitting here and you're like. Well, this all sounds like totally disconnected from the work that I do. Maybe you're someone who does largely do data analysis or statistical analysis. Maybe you are someone who you know it spends all your time writing shiny apps and you think like what like why should I care at all about mlops or what it is.

So I think the first reason why it was be smart for you to learn a little bit about mlops and what it is would be how do you how do you learn how that your work can be can be lifted and moved like can you start to build some of those muscles about what does it mean to put your work? Into production to to deploy your work and the other thing is that it does when you are when you are the one who can take your work the last mile. Then that's how you can really scale the impact of your work in your organization.

Then that's how you can really scale the impact of your work in your organization.

So I'm going to put you probably have seen this URL here at the bottom, but you can use the slides are available at that URL here and you can come and see you know learn more in these sort of recommended resources. So feel free to go there and click through those and with that I will. Oh yeah, one more there you go and with that I will. I will say thank you for having me. Thank you so much and think maybe we have some time for questions.

SatRdays London 2023: Julia Silge - What is "production" anyway? MLOps for the curious

Transcript#

The case for model developers owning deployment

A concrete example: house price prediction

What is MLOps?

The model life cycle

Introducing Vetiver

Versioning models

Deploying models with REST APIs

Monitoring deployed models

Who Vetiver is built for