Failure (and Mistakes) (Laura Gast, USO) | posit::conf(2025)

Transcript#

This transcript was generated automatically and may contain errors.

Magic. Okay. So, we've heard three great talks, and all week we've heard great talks about people failing their way into success, right? And so, I want to talk about failure itself, not just about how we fail to success. So, in my now almost 20-year career, I have failed a lot more than I have succeeded. Like, a lot more than I have succeeded. And this happens for all of us when we're tackling these really interesting or new or complex or stubborn problems, right?

And I've learned, as I've moved from the more hands-on individual contributor role into work leading on data governance in some particularly messy situations, that I've had to think about not just the failures, but how we fail in these situations. And as I've worked through this, I've identified kind of core tenets of failure. And we'll go through those really quickly. I think, first, it's safe to say failure is inevitable, right? And I don't mean this fatalistically or pessimistically. I mean, like, cool. Failure is inevitable. Run with it, right? We're not building easy things. We're not tackling easy problems. We're taking big swings. And so, failures are going to happen. And the second principle is that failure is helpful. Failure is how we learn. The best discoveries are born out of failing, right? The reason we have sticky notes is because someone failed miserably at making a permanent glue. The reason we have artificial sweeteners is because someone failed so badly at making anti-ulcer medications and they didn't wash their hands before having lunch, right? Big failures. Cool things that came out of it.

And I'm going to add a third one here. And that is that failure is a bias. And what do I mean by that? So, failure is the way failure is a bias is the way that you see failure is shaped by your experience, your role, your tools, your teams, and your culture. So, it's by nature it is distorting, not just framing.

We're all going to be biased against this idea because of what happened with metaverse this specific time.

So, that's the symbolic failure, right? It's not about crashing planes or, you know, servers breaking or what have you. It's losing that credibility, and once the credibility erodes, it's really hard to rebuild.

Ambient failure

So, that's the spectacle of failure, and not every failure is that way. Some failures are creepy, insidious, and invisible. They don't break all at once. They erode over time, until suddenly one day you just go, wait, what? This isn't right. That's an ambient failure. And for this, I want to talk about Google flu trends. And that launched in 2008, and if you worked in public health at the time, it seemed like magic. They were able to get predictions for outbreaks of the flu weeks before the CDC saw it coming with their traditional clinical surveillance models, and they did this by tracking search terms for things like flu symptoms or pharmacies near me or how do I treat a fever or ordering tissues, what have you. And, you know, this was it looked like the future. If you were in public health at this time, you were like, oh, this is great. We're going to track the social movement of something before it shows up in our hospitals. This is great. It was a bold vision, and it was a sea change at the time.

But unfortunately, what happened is that that started to drift. The system started to move to be less reliable and seem weird, and because it was so successful at the beginning, everyone was like, well, it has to be true. What are we missing on the clinical side then? That became the pivot of, like, this can't be wrong, so this has to be wrong. And so it was offset, like, up to 140%. It almost more than doubled the expected cases of a week compared to the CDC surveillance. And the reason for that was because the definitions underneath were changing. So it used those Google search trends, right? But people, the media started to talk about the flu more. We had, you know, big conversations about other places, and people were looking up those articles, right? People were, you know, the algorithms that Google used itself started changing. And so it was no longer measuring people looking for care. It was measuring people paying attention and hyping up flu. And so no longer were we looking at what we thought we were.

We didn't change anything as the Google flu trends team. Not we. I was not a member. But underneath, the definitions changed without us understanding. So there was no day when it went bad. It just slowly got worse. No one continued using it. They all moved away from it, and then one day they just turned it off in 2015. And so this is what makes ambient failures insidious, is that they rot invisibly. They are hidden to you unless you're really paying attention, and by the time you notice, the damage is embedded in your system. You are already making bad decisions on it. You are already misplacing your confidence. You have already lost opportunities because it rotted from underneath. The bias here was that overconfidence in the first, you know, those first years when it looked so good, as it started to drift away, like I said, it can't be the flu trends. It's got to be our clinical surveillance that is incorrect. And then we write it ourselves later, but few people thought to ask, as soon as that ship started turning, is the data still measuring what we think the data is measuring? Few people ask that. So that's what I call an ambient failure. It's not loud, it's not spectacular, but it is corrosive, and sometimes this can be worse than a sudden collapse. Because it fools you into thinking everything is okay, and you're making decisions on that okay, but it's not.

It's not loud, it's not spectacular, but it is corrosive, and sometimes this can be worse than a sudden collapse. Because it fools you into thinking everything is okay, and you're making decisions on that okay, but it's not.

How the three failure types interconnect

So these types of failure, my three types of failure, don't live in isolation. They are interconnected, and there is a trap, and when you overprotect against one, you are creating exposure to another. So Southwest, they optimized their efficiency, and they allowed for some forgivable failures, and then the system collapsed. Or meta, when they took great pride in their structural, you know, they were protecting against a structural failure, they took a big swing with this public promise, and they suffered a really bad failure in that symbolic world. And then Google flu trends was the reverse, is that the system didn't snap overnight. It slowly drifted. Those proxies were incorrect. So in each case, bias played a role. The leaders never ignored the list. Or the risk. They were just picking the ones they thought were going to lead to the worst outcomes, given what they knew. And that left them exposed.

So that brings us to today. We're entering this world with AI, making risk decisions for us, and it is already doing that. And they're not reflecting just our personal biases, but they are compounding our personal biases, right? They're automating them, they're scaling them, and do we have any input into them? Do we know all of the places where AI is making those decisions for us? Do you know where you're AI? You keep talking that out to your LLM, you're querying your results, but you're querying your results based on your history of failure and the things that you are worried about most.

So AI is going to start to break things in all three of my categories in new ways, and in ways that we have not yet imagined. As is the history of failure. We have imagined failure in various different ways, all the way from moral failing to normal accidents to our high efficiency systems where nothing can fail, all the way to move fast and break things and into wherever we're going now. And it's really important to understand that the challenge isn't whether the failure is going to happen, it's whether we're going to recognize which failures we're inheriting, which ones we're creating, and which ones we're ignoring or we don't see coming. So I don't want you to walk out of here thinking that failure is something to fear. I think we heard a great keynote this morning that, like, don't be afraid of failure. Failure and running headlong into it is actually a good thing, because that's how you learn. But paying attention to your own historical bias, your organizational bias, and your societal bias, it doesn't mean eliminating it, it means paying attention to it and getting better at learning from it. And so I'm going to leave you with a Ben Franklin quote that I really like, is that perhaps the history of errors of mankind, all things considered, is more valuable and interesting than that of their discoveries. Truth is uniform and narrow, but error is endlessly diversified. And that's what makes it really cool. So thank you.

Q&A

Thank you so much. We have a couple of questions here. The first one, in the year 2000, Blockbuster both passed on a $50 million purchase of Netflix and launched a failed video on demand partnership with Enron. Do either of these business failures fall into your taxonomy?

Yes. So they I well, I'm going to talk about the Blockbuster one, because we're going to go that way. So they are big structural failures, because there's a big swing that brought something down. Rather than they do incorporate, like I said, they're all interconnected. That's why there's so many taxonomies of failure. It's impossible to put them all cleanly in one bucket. But taking that big swing, they didn't comprehend that where the pieces were. Their new tenure into on demand was that brittle piece, right? And that brought down the company. Had their move been part of the company, you know, like a metaverse, like we're taking a big swing, but we're not putting all our eggs in one basket, people would lose their jobs and it would not be a good thing, but it wouldn't bring down the system. And so I put that kind of in a structural.

What is the biggest failure you've seen related to LLMs so far, if you have seen any? Well, the first one that comes to mind is the what's the word, sycophancy, the chat GPT people using it as a therapist, and then we're just going to say to stay positive, bad results. And so there was, you know, an unintended consequence there of that people used things in a way that were never really predicted, and it ended up particularly horrible. And that's the biggest one I can kind of identify right now. And then there's all the data leakages that we're all prone to that horrify me and apparently lots of others. Right. Thank you so much.

Failure (and Mistakes) (Laura Gast, USO) | posit::conf(2025)

Transcript#

The healthcare.gov case study

A taxonomy of failure

Symbolic failure

Ambient failure

How the three failure types interconnect

Q&A