David Smith | MLOps for R with Azure Machine Learning | RStudio (2020)

Transcript#

This transcript was generated automatically and may contain errors.

Welcome back after lunch. It's been a great conference so far. It's been great to be here. My name is David Smith. I'm a cloud advocate with Microsoft. So basically that means I'm Microsoft's connection with the R community. So if there's anything you would like me to communicate back to the development teams back in Redmond, just have a chat with me and I'll pass that on.

But today I would like to talk to you about machine learning operations. And that's the process of efficiently and reliably building machine learning models at scale. We call it MLOps. It's a relatively new term. You might see it being related to something you're more familiar with, which is DevOps. And DevOps itself, I don't have time to get into all the details of that, but a definition that I like, which comes from one of my colleagues at Microsoft, is that DevOps is the union of people, process and products to enable the continuous delivery of value to your end users. And note that word value there. We're not talking about software. We're not talking about applications. We're talking about value in general. So it seems like it's something that we could also apply to machine learning in the same way that it's traditionally applied to regular applications.

So what DevOps gives us is a nice process for delivering applications in general. We plan, we develop and test those applications, we release them to production. And once they're in production, we're not done because in production we monitor and learn from our users' experiences with those applications and go through that entire cycle again. And so the question that I want to ask here is, can we apply a similar process to DevOps for building applications to building machine learning models with R in particular in this case?

And I don't know about you, but at least in my experience when working with machine learning models, it's a little bit different than building applications with regular programming languages. You start with some data, you might try a model, you might try another model, you'll evaluate the results, you'll decide that you might need some more data, you might need a different model. It's a very iterative process. But at the end of that process, what you get is a model that you'd like to put into production and you throw it over the wall to the engineering team to do something with it. At least that was my experience up until relatively recently. I don't know if it was yours as well. But you can see that this is kind of very different to the modern way of building applications with continuous integration and continuous delivery. So can we get to that stage with machine learning?

Differences between DevOps and MLOps

And I think one of the ways that we can think about that is to have a look at some of the differences between traditional development processes, DevOps, and machine learning building, MLOps. Now in both cases, we're working with files, you know, that source code. You know, it might be C++ or .NET files on the regular application side of things. But with machine learning, we're also working with other artifacts, things like data files. There's a whole thread I could go down to get down in at this point about thinking about what does it mean to have a 55 megabyte data file that you're analyzing and putting that into Git? Short hashtag don't do that. But lots of issues around managing data that don't come up in traditional development processes. Same things if you're working with notebooks or RMD documents or R Markdown documents. How do we deal with those and how do they manage in source code control systems?

A good practice when building applications is rather than managing infrastructure directly by pointing and clicking in consoles and terminals is to manage that infrastructure with commands in code. And those same principles apply on the machine learning side as well, with an additional wrinkle that we often have to manage environments. Think for in the R case, managing the packages and the package versions that need to exist for your R model to run in a reliable fashion. So we want to be able to manage those as code as well.

I touched a little bit about those issues of working with source code control systems. Those also apply in the machine learning operations scenario. But there are other things we need to track changes in as well, not just the source code. We also want to track changes in our experiments. And by that I mean, as we try out different types of models, perhaps different transformations of our data, we're looking for some kind of outcome, perhaps the model with the best accuracy. And we'd like to be able to track that as well in a reproducible way so that we can always go back to prior experiments if we end up using that one instead.

When we're building applications in a DevOps environment, typically we're building binary executables. Those builds might take minutes, maybe hours. That's kind of the typical limit for most types of applications. You're typically building on sort of commodity computing infrastructure. But on the machine learning side, rather than building, in addition to building executables rather, we're building the models, we're training the models that become part of those executables. And that model training, especially in sort of deep learning type of environments, could likely take hours, might well take days. I've seen examples where model training takes months. And so how do we put that into a traditional DevOps type of environment? And also how do we incorporate the exotic computing environments like the GPU computing environments that we need for deep learning?

When it comes to version control, version management, when we're building applications, we like to give versions to the applications that we build and release. On the machine learning side, we would like to assign versions to models so we can track when we change a model and then go back to the underlying data and code that created that model and do so reproducibly. And another sort of side branch that I could go into for hours is this whole concept of tests when it comes to machine learning as opposed to developing applications. On the application side, tests are fairly deterministic. Did it break? Did it cause an error? Did it give the right response? But in machine learning, tests tend to be a little more probabilistic. You know, think of the question of, is this a picture of a cat? That's not so much of a yes or a no type question. It's more of a probabilistic thing that we need to determine. And so the way we do testing is different.

I can't go into all of these topics today, but I'm just going to touch on a few of them. But I just wanted to give you this as just an illustration that there are quite a few differences, in my opinion, between machine learning operations and traditional DevOps. And we need to incorporate those differences into any process that we're using when we build applications with machine learning.

There are quite a few differences, in my opinion, between machine learning operations and traditional DevOps. And we need to incorporate those differences into any process that we're using when we build applications with machine learning.

David Smith | MLOps for R with Azure Machine Learning | RStudio (2020)

Transcript#

Differences between DevOps and MLOps

Azure Machine Learning Service

The Azure ML SDK for R

Demo: building and deploying a model

CICD integration and summary

Featured software#

rstudio