LLMs for Data Science

Transcript#

This transcript was generated automatically and may contain errors.

I wanted to start with a little bit of level-setting just about sort of LLMs and AI in general. I think one of the things that makes them really hard for me to kind of reason about is they sort of flip on their head things that computers are traditionally good at and things that computers are traditionally bad at. So, you know, you can ask ChatGPT, the latest ChatGPT, how many N's are there in unconventional? And you know, usually we expect computers to be pretty good at counting things, but I'm pretty sure there are one, two, three, four N's in there.

Or people have done various experiments asking ChatGPT to multiply large numbers together. And here the red is the, the color scale is the accuracy, right? Traditionally computers are pretty good at multiplying numbers together and LLMs are terrible at it.

But LLMs are also like really good at things that computers traditionally could not do. Like I can ask for a limerick about the R programming language and maybe it's not the best limerick in the world, but it's better than I can do at 15 seconds of thinking. Or write an acrostic about ggplot2 . Like it really turns on its head the things that we're used to computers being good at and the things that we just expect computers can't do.

That said, however, still might be able to do poetry, but they still cannot do jokes. Why don't data scientists ever go to the beach? Because they're afraid of overfitting their sunscreen and ending up with a collinear tan line. Like it has kind of all the bits of a joke, but doesn't quite land.

The jagged frontier

And I think a really useful term for kind of understanding the state of AI right now is this idea of the jagged frontier. That the difference between things that LLMs can do well and the things that they can't do well is often very, very narrow and it makes it hard to get a sense of like where can I, where can I use this tool? What's it good at?

But today I really want to focus mostly on the things that it is good at and I'm going to show you across a few areas, kind of places where I've had a lot of success are really useful for like brainstorming, like give me a bunch of ideas. Doesn't matter if a bunch of them are shitty. I'm going to find four or five good ideas and go from there. Another thing I find profoundly useful is this kind of idea of the blank page problem. Like often getting started is really hard. It's often much easier to look at something that's like bad and be like, oh, that's wrong. That's wrong. I'll fix that. And so getting an LLM to spew out something that's kind of in the right ballpark can often be a great way to get started as well.

Clearly they're awesome at rapid prototyping. You can spin up and do experiments very, very quickly. And they're also great at getting rid of kind of boilerplate in your code and your everyday life.

Clearly they're awesome at rapid prototyping. You can spin up and do experiments very, very quickly. And they're also great at getting rid of kind of boilerplate in your code and your everyday life.

And so I want to sort of, so I think like overall the thing that I think is like hard is trying to figure out like where do we as humans play? Like where, what are the things that we are uniquely good at that we want to spend our time on? What are the things we want to hand over to LLMs to do? And what are the things that we're going to use our traditional, you know, things that computers are traditionally good at for?

And my kind of fervent hope is that we will become like data science centaurs. I got this idea of this, there's this idea of these chess centaurs. I don't know if you've heard this term before, but these teams of humans and AIs playing chess together who do better than any of them individually. So the kind of hope is to, my fervent hope is that we can form this kind of centaur where we have the AI as being like the horse buddy that can race us. As fast as we need to go, but we've still got the human on top directing us to where we need to go.

I will say I tried very hard, but I could not get ChatGPT to generate a correctly proportional horse body for this data scientist, one of AI's great weaknesses today.

So, this kind of combination of, you know, LLMs, which are stochastic, they give you slightly different responses all the time, and moving into this kind of more, like, statistical mindset, where I don't actually care too much about the individual answers. I'm looking more about the averages and the trends. That just feels like a really, really nice pairing to me.