Kevin Kuo | Introducing mlflow | RStudio (2019)

Transcript#

This transcript was generated automatically and may contain errors.

All right, thank you guys for coming to this session and I'm so glad that Javier was the one that ran into AV issues and not me.

So one of the things I learned from giving talks is that there's a very strong positive correlation between the number of memes and pictures you put in your slides and how happy the audience is, you know, from the session reviews I've been seeing. So I'm going to just put a bunch of pictures on here, I won't do too much talking, but at the end I'm going to do some demos because you do need some content in a tech talk.

So we're going to be talking about mlflow and it's an open source platform for managing the machine learning lifecycle, whatever that means, we're going to find out in a little bit and it's a community driven project mostly led by the folks at Databricks with RStudio providing the R interface.

Motivation for MLflow

So let's talk a little bit about the motivation for a project like this. One of the things that data scientists have trouble with sometimes is that you don't remember every model that you tried. So let's say you're building a neural net, you've got a few layers and maybe a residual connection here and there, and you stumble across some hyperparameters that seem promising, so you sort of go into that direction and tweak some numbers, and then, you know, maybe like an hour later you realize that, you know, the performance isn't great, so you want to sort of roll back to the previous version, but then you were so excited about your results you didn't bother, you know, pushing any of your, committing any of your code, so you don't remember what hyperparameters you used.

So, you know, chuckle, please chuckle.

So we're going to talk about how mlflow might, you know, help with that situation by logging your metrics and your hyperparameters. And another thing that sometimes people have issue with is replicating your results, and you might want to reproduce your teammates' numbers or you might just want to reproduce your own numbers from yesterday or last week or last quarter, and sometimes, you know, the code will run and then you'll get some maybe different results, maybe due to some, you know, random initialization of ways or whatnot, but sometimes it just doesn't work, you know, missing libraries, mismatched architectures, and not every data science project ends with a PowerPoint deck and a steering committee meeting.

Sometimes you actually have to take the models to production, and, you know, it's hard to find common ground, depending on where you are, between the folks who prototype the models and the folks who actually need to take the models to production, and this is exacerbated by the fact that there are just so many different machine learning libraries out there, and there are also different deployment targets, right? So there are different libraries you can use to train the models, and then there are different ways to sort of deploy these models. So if you got, you know, the number of ways you can sort of combine the prototyping and the deployment just expands exponentially.

So if you are at some specific companies, mostly out there in Silicon Valley, you might say, oh, you know, we got this all solved, but then what about everyone else? What do you do? So you'll go online, you'll do some Google searches, you'll be like, oh, what's everyone else doing? And then you'll find that, especially for model deployment, there are just so many different things out there, and it's kind of tough.

So over the past couple years, there's been a lot of efforts in the machine learning community and also within the art community to try to solve these problems, or at least attempt to solve these problems. And I personally think that we won't ever come up with one framework that's just going to work for everyone, but I think we can try to segment the problems such that you can have a set of standard practices that sort of will apply to most people.

And I personally think that we won't ever come up with one framework that's just going to work for everyone, but I think we can try to segment the problems such that you can have a set of standard practices that sort of will apply to most people.

the cool thing about this is that it records the exact commits on which this code was pulled from, so you can actually, you know, click on this to navigate a repository at that specific point in time.

So I think that's all I have to show you guys, so, you know, definitely think about what sort of applications you can apply this to in your own endeavors.