Warren Hearnes @ OptiML AI | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Hi everybody, welcome back to the Data Science Hangout. I'm Rachel Dempsey, I lead Customer Marketing here at Posit. And you know what, I just learned that some people are actually hearing about Posit through the Hangouts, so I wanted to add a little bit about Posit as well if it were new to you. We're the open source data science company building tools for the individual teams and enterprises. And so I'm so happy to have you joining us here today.

The Hangout is our open space to hear what's going on in the world of data across different industries, get to chat about data science leadership, and connect with others facing similar things as you. And so we get together here every Thursday at the same time, same place. So if you're watching this as a recording on YouTube in the future, and you want to join us live, there'll be details below on how you can add it to your calendar. Just a quick note to make sure it adds it for you at 12 Eastern time.

If this is anybody's first Hangout, I would love to see you say hi in the chat just so we can welcome you in here as well. We're all dedicated to keeping this a friendly and welcoming space for everyone and love hearing from you no matter your years of experience, titles, languages that you work in, or industries. You could be a part of the party happening in the chat. You could also jump in and ask questions or provide your own perspective.

So you can raise your hand on Zoom. If you're wondering how you do that, it's in the reactions bar in the Zoom bar below. And I'll call on you to jump in. You can put questions into the Zoom chat and just put a little asterisk next to it if it's something you want me to read instead. And then third, I see Curtis just shared the Slido link where you could ask questions anonymously too.

But with all that, thank you again so much for joining us here today. I'm excited to be joined by my co-host Warren Hearnes, founder of OptiML AI. And Warren's held a variety of data roles across his career from chief data scientist at Best Buy to roles at Home Depot, UPS. And it's fun that this is actually the first time I'm getting to meet you live on camera here as well. So I'd love to have you introduce yourself first and maybe share a little bit about your new company.

Warren's background and career journey

Sure. I'm glad to be here. So I'll give a plug for Posit. I've been using RStudio for many, many years. So if you don't know about Posit and RStudio and some of the other things that I'm sure I don't know about, talk to Rachel or to Jake. I also like to see, it's a great crowd. I see Curtis is on the treadmill. I like it that he's multitasking and getting some exercise in.

So I'll give you all a little bit longer background than I normally do for a discussion. But I've been in what we know as data science for, I guess, it's 2024, so 32 years now, not including some time in operations research in the Army. So I've always loved using math and data to make better decisions. And my undergrad was at West Point. I started in 1985 and graduated in 1989 and got a degree in mathematics and operations research.

And so operations research is the more traditional things like image programming, linear programming, statistics simulation, things like that. So thought I was going to spend a career in the Army. Did all the fun Army stuff like airborne ranger, field artillery, got to blow some things but decided to get out in the early 90s. I got in and got out in 1992.

And that was before we had any search engines or anything like that. I knew I wanted some type of job that used math. So I was visiting someone in Portland, Oregon, and I actually had to take a taxi down to Portland State University to look at a physical bulletin board with physical job postings up on it. And I saw that all of the job postings, and there weren't a whole lot, but all of the job postings in industry required a master's or a PhD. And so I thought, well, maybe I'll get a master's.

I applied to Georgia Tech, got accepted into their program, knew nothing about grad school, got to Atlanta from Hawaii. That's where I was stationed. So I got to Atlanta, found out that they don't fund master's students. They asked me if I wanted to be a PhD student. And I said, sure. So I just jumped into a PhD program. And this is where one of those just chance meetings changes everything.

So at the time, this was the fall of 1992, machine learning was around, but it wasn't as popular as it is these days. There were only two professors at Georgia Tech who were doing funded machine learning research, and one of them was in the industrial engineering school at Georgia Tech. And he was working in reinforcement learning for robotic control. And I was already studying neural networks, fuzzy systems, and things like that. Turns out it was a chance meeting got me into that research. And so I started doing reinforcement learning for robotic control using adaptive dynamic programming and things like that throughout the mid-90s.

Now, machine learning wasn't as popular. So a lot of the people said, well, if you can't prove it has the optimal answer, then it's not really worthy of publication and some things like that. So I decided, I also met a girl from here in Atlanta. We were getting married. I needed to buy a house, so I needed to have a job.

And so I started working at Lucent Technologies. And there at Lucent Technologies here in Atlanta, at the time, we had the world's largest fiber optic cable factory. So we were using integer programming, so mixed integer linear programming to schedule all the work in the factory and to assign the raw materials to all of the orders. I mean, you were really using a lot of very complex math to make that factory run much more efficiently.

Then spent about six years doing that. UPS is also headquartered here in Atlanta, so got a chance to move over to their corporate headquarters where I started doing machine learning once more. So we were doing forecasting. We were doing fraud detection. We were also using some integer programming to move the empty, relocate some assets around the United States. So spent eight years doing that.

Moved over to the Home Depot, which is headquartered here in Atlanta, and that was my first foray into marketing analytics. So our team was in charge of all the direct mail and all the email that went out. That was in 2011. So if you got something from us back then, that was our team. It's how do we target? Who do we target? How do we execute the marketing campaigns? How do we measure them? And how do we get a good return on investment from that?

That got me noticed by a small startup in Atlanta called Cardlytics. So I switched after a year at Home Depot. They still do have a great business model. Work with the banks. The banks know really the best predictor of your future purchases is your past purchases. A lot of companies will use demographics and things like that to help as a proxy for what you're going to buy in the future, but the best predictor really is what you bought in the past. And so with that partnership with the banks and eventually got 50% of all the card swipes in the United States and the UK, we were able to grow that company, take it public in 2018.

And I built up a analytics and data science capability at that company that really did some amazing things. And T. Roo was on here. He was leading our team in India, doing some very amazing things with NLP and trying to take these merchant strings and classify them. So great company with some reorgs and some changes. Left that company in 2022.

And got a chance to become chief data scientist for Best Buy. Best Buy was growing a technology and analytics hub here in Atlanta. So I helped start that. We were basically applying optimization, machine learning, AI to a variety of problems around the enterprise, whether it was supply chain, labor, customer segmentation, media mix optimization, call centers, store operations, you name it.

So then about, maybe about one month ago, almost to the day, we reorged at Best Buy. I got a good package. I left that company. And right now I've technically started this company called Optimal, and it's spelled Opta, M-L-A-I, OptiML AI, because it's the combination of the things that I've been doing for my entire career, optimization, machine learning, and artificial intelligence. I haven't done much with it yet because I've just been playing a little golf and doing some traveling, but that's really going to be my focus. I want to, if I'm doing something either part-time or full-time from this point out, I'm close to retirement, then it's going to be helping companies learn how to use the newer things that are data-based, like machine learning and artificial intelligence, and combine them with some of the more traditional methods, like energy programming.

AI hype cycles and historical context

Yeah, I think in some ways it's similar. So AI has had its ups and downs for many, many years. You know, I think in the 80s, that was like an AI winter, and then things will happen. Something significant will happen, like what's happened with Gen AI lately. I want to make sure that people in this field don't gloss over the things that have happened in the past. And I'm not talking about the past months, couple of months, or a few years. I'm talking about decades of thought and research.

So I like to say that, you know, in January, there was a paper called Steps Towards Artificial Intelligence. And in that paper, they talked about basically five things. And that's the problem of search, the problem of learning, pattern recognition, induction, and planning, those types of things. And so those are all some, you know, every single one of us are working in search and pattern recognition and learning. But the point that I make is, and I've got a copy of it right here, because I do this example all the time. That wasn't January of this year, that was January of 1961. That was Marvin Minsky in 1961, talking about reinforcement learning and pattern recognition, and everything that we are still working on today, over 60 years later.

That was Marvin Minsky in 1961, talking about reinforcement learning and pattern recognition, and everything that we are still working on today, over 60 years later.

It's not to say that we haven't made some fascinating leaps and bounds. But when I tell people that I was working in reinforcement learning in the 90s, a lot of them are surprised that we were doing that in the 90s. Well, we were doing it in the 50s and 60s. It's just that we're doing it at a much higher scale now.

So I think right now we are in a very, you know, we're riding the wave of gen AI to talk about what we can do with it. And then there's going to be that, you know, standard hype cycle. There's going to be, let's get it into practice. There's going to be a big cost. We're going to figure out how to do it better. There will be a plateau. Then there will be something else. And I guess my last point on that would be, I had heard about large language models from many, many people. So it's not as if just one thing came out. You know, in your career, you're going to see a hundred ideas and maybe one or two of them are big ideas. So you've got to learn to sift through those. And to be honest, there are a couple of people on my team at Best Buy, Jimi Hendrix and Will Armstrong, they were telling me about some of the new advances in large language models. And I was more like, yeah, I've heard about those advances in the past and they haven't really panned out. And it wasn't until basically GPT 3.5 that I saw a big step change.

I guess the last thing I'll say on this philosophical bent is failure is absolutely key to learning and improving. So failure is key to learning. And so don't be afraid to try something and fail at it.

And I guess the, what I would say for neural networks is, you know, we all know that the more connections you have in a neural network, the better it is, you know, and basically we're like that too. You know, it's the connections that you're making here, the connections that you make in your job. And I'm finding now at the end of my career, it's not necessarily the money that I made or the algorithms that I created. It's the connections that I've made with people that made me successful, but also are the things that I look back on and say, that's what's going to last.

But I would say for reinforcement learning, if you have any type of sequential decision-making process that you're doing at your work, and you can get some type of feedback from that, it doesn't have to be robotic control. It can be a process of, you know, anytime you can get a decision, you make a decision and then you can somehow generalize that decision based on something that's good or bad that happened. That is reinforcement learning. And there's a big surge over the past, say five to 10 years of, you can look it up as sequential decision-making processes, approximate dynamic programming. There's dozens of algorithms that are out there that can help, but just think of it as anytime you're making more than one decision and you can get some feedback on it.

Environmental impact of LLMs

Yeah, so have you, so are you familiar with Sasha Luciani? Yeah, no, she's a AI researcher and she gave a TED Talk about, you know, the environmental impact of ChatGPT. And her analogy was essentially that every time you type a knock-knock joke into ChatGPT, it's essentially the equivalent of driving your car around the globe. So I'm just wondering, do you have any opinion on that, on the environmental impact, on the amount of resources that are required to run these data centers, the amount of time it takes to run the data centers, the CO2 that's being produced?

I don't, you know, I do know this, as in the hype of generative AI, everybody saw 3.5 and then 4 and, you know, and we're going to have 5 pretty soon, and that's just with open AI. What I was telling the other executives at Best Buy is we're going to use this to see how, you know, like, what's the best that can happen. You know, you're going to use, let's say GPT-4 to summarize your call center and to figure out the intents and the topics and things like that. Well, my go-to saying was I don't need an LLM in a call center that can respond in a Shakespeare sonnet like Snoop Dogg would say. You know, it is overkill what these very, very large LLMs can do.

So what I think is going to happen is you're going to see a lot of companies really go into generative AI and then they're going to, just like with the cloud, they're going to say, holy crap, this is a lot more expensive than I ever thought. And back to the ROI, the ROI is not going to be there. So people will start to figure out what's the smallest model that I can do that can be. And so I think there's going to be this, you know, maybe at Hugging Face, something just for call centers or something just for, and it will reduce that environmental impact. But I think the hype right now is everybody wants to go into it and try it.

And then, you want to use like a hundred thousand tokens. Well, I don't want to pay for a hundred thousand tokens. I want to pay for 8,000 tokens or 16,000 tokens. I know I can get a better answer with a hundred thousand tokens, but it's probably not going to be worth it. And then lastly, there are some things where you'll see that I think they're going to pre-compute some of these. For some of the conferences that we're thinking about, instead of letting people use generative AI every single time they ask a question about some abstracts, we have like 6,000 abstract talks at one of these conferences. Let's pre-compute the abstracts. Let's say here's the original, very complex abstract. Now let's do it one time. Let's simplify it. The prompt is simplify this so that a first year's master's student in operations research or statistics would understand it. So I came up with, for I think $10, I came up with $10 worth of tokens for 6,000 abstracts, pre-computed all of these things. Now let's let Elasticsearch and do some things that are a little bit more reasonable. So I think that just capitalism will, for some of these companies, they will say, I can't afford to spend that much.

Career advice

Yeah, I've already talked about, you know, doing what you love, because if I had switched out of machine learning and gone to what was popular at the time, then I wouldn't have the career that I have. So, but I was fortunate that even though I said, people told me you're not going to find a job in what you're doing, that eventually I did. I guess the other is, especially for those of you that have masters or especially PhDs, early on in my career, I let the data do the talking for me. And we hear a lot of these days about data storytelling. And I think that's an important skill. You're going to have to make sure that you prove your value and the value of the models.

And basically every organization says they want to be data driven, but that's until the data actually goes against the gut feeling or instinct of the executive that's running the store operations or the executive that's running supply chain or something, then they don't necessarily want to be as data driven. Because think about it, back into reinforcement learning, they didn't get to be super successful executives by making bad decisions in the past. They've been rewarded by their gut decision is more often right than it is wrong. And all of a sudden you're telling them, well, this is not the best decision.

And so another plug for RStudio, one of the things that I did when I was at Cardlytics is, you know, the Shiny , using R Shiny was the best way that I could, that I could do to let somebody visualize, let an executive that's not analytics or data science visualize what either uncertainty or their assumptions about the process could actually do. And so, you know, that's, I would say, you know, learn how to incorporate that uncertainty or visualize it or do some what-if scenarios so that you're either going to learn that your model doesn't take everything into account and you're going to fix it, or you're going to get more buy-in from the other executives.

One of the things that I did when I was at Cardlytics is, you know, the Shiny, using R Shiny was the best way that I could do to let somebody visualize, let an executive that's not analytics or data science visualize what either uncertainty or their assumptions about the process could actually do.

Thank you so much, Warren, for taking the time to join us today and for sharing your experience with all of us. This has been a really fun, fun Hangout. Thank you all for the questions too. I always like to add this reminder at the end for anybody joining right now, just double check that you have the Hangout on your calendar for the right time. If you want to re-add it to your calendar, I'll put the link in the chat because it's always from 12 to 1 Eastern time.

But I did see a question about, if you decided to follow your idea of testing less common models, people would love to see the community effort around that. Is the best place to connect with you LinkedIn? LinkedIn is the best place right now. And then I'll come up with something else. But yeah, I'm always on LinkedIn. I love it. Okay, perfect. Well, thank you so much. Have a great rest of the day, everybody.

Warren Hearnes @ OptiML AI | Data Science Hangout

Transcript#

Warren's background and career journey

AI hype cycles and historical context

Job requirements and generative AI skills

Bridging statistics and machine learning

Most proud projects

Return on investment for data science

Defining ML vs AI

Pareto distribution of techniques and dormant ideas

Reinforcement learning in practice and life

Environmental impact of LLMs

Career advice