Heather & Jacqueline Nolis | Push straight to prod: API development with R and Tensorflow

Transcript#

This transcript was generated automatically and may contain errors.

Hi, I'm Jacqueline Nolis. I'm a data scientist. I'm joined here today by Heather Nolis, machine learning engineer from T-Mobile, and this talk is about how we did something we're so excited about, and it's how we got R working in R in production environments.

So what does it mean to put something into production? And we heard about that a little bit already during the keynotes. But we think about it like this. With data science, there's like two types of data science. There's A-type analysis, where you're trying to figure out an idea, do an exploration, figure out something, and your deliverable is an idea.

And then there's the building type, where you make a machine learning model, and you want to run it. You want to run it over and over, like if you're designing a product recommendation engine for a website.

And so at T-Mobile, we really define putting code into production as making it so that customers can interact with it. And this morning, we heard about making dashboards that many users can use at once, maybe 20. Well, we have 70 million customers. And we just merged with Sprint, who has another 60 million customers. So we have lots and lots of people.

And when we're talking about machine learning, we're really talking about making machine learning models that all of those people can use. Hopefully not all at once, but all of them could use.

The AI at T-Mobile project

And so the project was called AI at T-Mobile. So T-Mobile has been doing data science for many years. And we have lots of people who are very good at things like that exploration, building churn models on our customers, that sort of reporting work.

But we didn't have very much of putting machine learning models that we developed into production environments so our customers can use them. So this project was to actually take machine learning and get it in front of our customers in a way that really improves the customer experience. And I say that because T-Mobile has a really big deal about being the uncarrier.

So we're not going to make you have to go deal with chat bots and have bad experiences. We want things to be pleasant for our customers.

So our first scope was customer care messaging. So right now, today, you can go on your phone and you can text T-Mobile. And you can say something like, I don't have my coverage is bad. And a human being at a desk will respond back to you and help you try to diagnose your problem. And so we can do that through text message. We can do that through Facebook Messenger. You can slide into T-Mobile's DMs and send a message and a human being will respond back to you or use our in-app messaging. So we have all of this text data. We have all of this stuff that's ripe for natural language processing. And our objective was let's get some machine learning around that that makes things better for the customer.

So our first particular use case was consider, you know, our customers, they talk in all sorts of weird ways. So suppose we had one particular customer come to us and say, this high bill shall not pass, which is an utterance we had never seen before. And, you know, we have the goal, if we have an agent at a computer who's responding to these messages and maybe has like eight conversations going on at once, we have the goal to try and prep them. That like, oh, it would be really helpful if that agent when they started the conversation already knew the status of the customer's bill. And so what is our method? Well, we're going to build a classification engine with machine learning. So in this case, this high bill shall not pass would be classified as a bill breakdown message. And further, we can improve that by actually using customer data. So if we know something about this customer, like if we know their recent account activity, their current signal strength, their bill status, like maybe it's overdue, that may change what we want to show the agent. So you can imagine if a customer just got their account suspended, like 20 minutes ago, and they come to T-Mobile and they say, hi, well, we probably know what they're talking about.

Building models in R

So we can use that data to help inform our machine learning prediction. So how do we create the models in R? Well, our workflow was like this. One, we would start by using R markdown for exploratory analysis. So we would take those many conversations that we have between customers and agents, and we would try and build an understanding of what happens on them. What are the things people kind of talk about? And then once we get a better idea, we would build a machine learning model in R, and then we'd save those models to flat files. And in particular, we would do the model building with R markdown so that we could actually have a log of exactly how it was built, what data went in, how well the fit went, and then we'd save all of that. And then before we even get to actually like deploying or anything, we need to get the business people to be excited about this.

So we show our model off with a shiny demo. So after we do the exploratory analysis, we would start building a model. We ended up building neural networks using Keras. So Keras is a really, really cool R package from RStudio that lets you very easily build deep learning models. And so for us, we ended up using a convolutional neural network, which is a particular type of neural network, to do the text parsing and understand what was the topic going on in the text. But we also had to bring in all those other data types too, like what's the order status or, you know, is their account currently suspended.

And so we had more traditional neural networks parse that data. And then we put that all together with one final neural network that outputs which classification. So in this case, we would train a neural network that would learn that unlock my phone with a recent order, that has an 80% chance of being categorized and unlocked.

So we built these networks, and we were pretty excited about them. But we have these problems because we'd go into meetings with the business people and be like, guys, we're having look at these ROC curves. 80.8. And we're like, whatever. And so we had one meeting with some business people, and 10 minutes before, we're like, let's just throw it in a Shiny demo. And this changed the game. The fact that we actually had Shiny demos where a business person could type in, I want to unlock my phone, and actually watch that would be classified as unlock. This got us, and I'm not exaggerating, millions of dollars of funding, multiple people added to our team. And it's because we could take this, show it to a business person who showed it to the director, who showed it to the VP, who eventually we ended up showing this stuff to the CIO, all because we had a really nice Shiny demo that we built around our models.

This changed the game. The fact that we actually had Shiny demos where a business person could type in, I want to unlock my phone, and actually watch that would be classified as unlock. This got us, and I'm not exaggerating, millions of dollars of funding, multiple people added to our team.