Jenny Bryan | Object of type ‘closure’ is not subsettable

Transcript#

This transcript was generated automatically and may contain errors.

Good morning, and welcome back to RStudioConf. It's really great to see you all here again, whether it's in person or on one of our livestreams. But now it's my very great honor to introduce our next keynote speaker, my colleague and friend Jenny Bryan .

Jenny's work has almost certainly touched you in some way, whether it's one of her books like Happy Git With R or What They Forgot To Teach You, or perhaps it's because you're afraid that she'll set your computer on fire because you use setwd. Or maybe you're using one of her packages like Read Excel or Google Sheets to get data out of spreadsheets, whether it's Excel or Google and into R.

But of all the packages, of all the work that Jenny has done, I think my favorite package is the reprex package. Not only because it's such a great tool to help you get help from other people, but I think it's one of the rare packages that has, like, no precedent in any other package. It's in any other programming language. It's something that's genuinely new. So without further ado, I'd like to welcome Jenny Bryan.

So this, I think, is R's most infamous error message. Objective type closure is not subsettable. It was my title as a joke for a while, and then people thought I should actually stick with it.

So I have 20 years of experience triggering this bug, which is why I can now do it in two lines of code. And this is also, I think, commonly how it actually happens, although it's usually never quite this clear.

But you create your main data object, you call it dat, then you promptly lose all memory of having done so, and you ask for the X column of DF, which you haven't made, but DF exists. It's a function that gives you the density of the F distribution. So what you've asked for makes no sense, and R tells you this in this very special way.

And my sort of fantasy message down there is maybe it would be able to somehow read my mind, which is obviously not going to happen. And so this sets the mood for the next hour, where I want to talk about general strategies for coping with confusing and frustrating situations.

So you went into data science, you were probably told that it's going to be glamour and fun like 24-7, and you make very creative concoctions that you present, and people love to consume it. But there's all this drudgery, as there is in any job, where we actually spend a much greater proportion of our time and our mental energy.

And so I've sort of made a habit of talking and teaching about those things so that you feel really cool and have fun, but you get your drudgery done as well.

So we're not using Slido for live questions, although you're welcome to ask them, because I'm going to blog about this talk later, but I am using Slido live for some polls. So if you're willing to get your laptop or your phone out, I'm curious what your current main debugging method is, and if you use multiple, as you probably do, you will have to pick a favorite.

And while I'm letting you take this poll, I'm going to say a few more words about why I think this is so important, the drudgery part. So if we don't give a name to these things and give them dignity, when you lose half your day to doing something like this, it's extremely demotivating, because you feel like you haven't actually gotten any real work done.

And the other risk, especially with debugging, is if you're only reactive and you're always dealing with today's bug, it means that you are constantly putting out fires and you don't probably have the time at that point to develop your debugging skills and be a little bit proactive about it. But you shouldn't be perpetually surprised that there's a new bug. Like really? Again? Today? This is going to happen every day, and it's actually worth giving some thought to how you want to do things.

And the other risk, especially with debugging, is if you're only reactive and you're always dealing with today's bug, it means that you are constantly putting out fires and you don't probably have the time at that point to develop your debugging skills and be a little bit proactive about it.

But a lot of people report that when they finally decide that they're going to post something on GitHub or Stack Overflow or RStudio's community site, 80 to 90% of the time they solve their own problem.

And that won't happen every time. And so when it doesn't happen, it still means that you have this beautiful version of your pain that you can post somewhere in a way that other people are more likely to engage with it.