Malcolm Barrett | You're Already Ready: Zen and the Art of R Package Development

Transcript#

This transcript was generated automatically and may contain errors.

My name is Malcolm Barrett. I'm a clinical research data scientist at Teladoc Health. I'm an epidemiologist and I'm also an R developer. I use R packages every day in my work. They're also fundamental to the way that I organize my code. And the reason for that is that R packages are the fundamental unit of shareable code in R. They solve many of our problems for us. They make our code more robust, easier to share, and much safer over time, whether that's somebody that's using it in the future that we've shared it with, or it's just us in six months, right? Our code is less likely to break in that time because of the robustness of R packages.

And yet many people, when they encounter the idea of writing R package, they think, this is too advanced for me. This is beyond my scope. This is beyond my skill set. Many people also think that they don't have something to offer. They think that if they're not writing something like ggplot2 or dplyr, something that's really incredible, then they don't have anything to offer, anything to share.

The parable of the lost son

I'd like to tell you a story that comes from Buddhism. Once there was a father and a son. And at some point, the father and the son got separated. Here, their lives diverged dramatically. The father became quite wealthy. He developed a huge estate and amassed a great amount of riches. The son, on the other hand, became absolutely destitute. He was embodying poverty.

Later on in their lives, the son actually stumbled upon the father's estate. Now, the son didn't recognize the father. The father had changed so much in his riches that he was unrecognizable. But the father, even though his son was draped in poverty, recognized the son immediately. He tried to bring him into the house to say, look, look, you can have all of this. And the son thought he was crazy. He tried to run out of the house. He thought, this guy is nuts. I don't want anything to do with this. So the father sent his servants to go after the son and hire him instead as an employee of the estate.

The son worked his way up from the very bottom until many years later to the very top of the estate. At this point, he's the right-hand man of his father. On his deathbed, the father finally reveals to the son who he is, that in fact, everything that's around him is already his. And in some versions of the story, at this point, the son says that he actually has already understood that. He has naturally come to understand that what surrounds him is his wealth, his treasure, something that is of himself.

This is a talk about why you already have the skill set from the techniques that you use every day in your data analysis to pursue the path of our package development. In a Zen text called the Sandokai, there's a saying, if you do not see the way, you do not see it even as you walk on it. This is to say that we're actually always already perfect and complete, already ready. So in R, we might say, if you don't see the R package, you do not see it even as you develop it.

So in R, we might say, if you don't see the R package, you do not see it even as you develop it.

The process of writing unit tests is to formalize this iterative process of kicking the tire of our code to making sure that it works the way that we expect.

The use test function will help you create a test in the right spot, set up all the infrastructure that you need from the test that package, which is one of the most popular testing libraries in R, put everything in its right place, open up for this file for you, and set it up so that you can automate your tests. So if I go and I write the informal test that I just did in the command line directly in a test script, I can now run the test function from DevTools, also which has a very convenient key binding in RStudio. It will load the package for me and will run all my tests. And now I can know if I make a change to my code that everything is still okay. I've got a green light still. More importantly, if I don't, I know what's wrong and where it comes from.

Three techniques to get started

There are great many ways that you can use the R package system to extend what you already do in your analysis. But these are three really useful techniques that you can really get off the ground with. The first is using a description file. This lets you provide metadata about your project or your package. It lets you tell us what your dependencies are. And it gives you access to this whole ecosystem, such as loading and testing, that is available to you when you're developing an R package.

Write your code as functions. You're already using functions in your everyday work, so this step is to actually take that, wrap it in your own function. And finally, write down the tests that you're actually already doing and then automate them. Take advantage of the description file of this ecosystem for R packages and automate this process.

What would be the next step in coming home to R packages, taking advantage of this treasure that's already yours? We put together a workshop called My Organization's First R Package that's really focused on developing internal R packages, personal R packages, things for you, your team, your research group, things like that. So I recommend checking out this resource. And I also highly, highly recommend looking into the second edition of the R Packages book. In particular, the first chapter, which is called Whole Game, walks you through the whole process of creating an R package from A to Z. And it's great and will help you get off the ground right away.

So this is my invitation to you. Write an R package, whether it's for your own personal use that you'll never share with anybody else, perhaps for your team, perhaps changing a project into a package, or creating a package to help with a project, or maybe you've got a great idea that you want to develop out into a package that you're actually going to share with lots of people and maybe even submit to CRAN. This is my invitation to you, is to try this out, walk this path, and take advantage of this incredible resource that's already available to you with the skill set that you already have.

Trungpa Rinpoche, a Tibetan Buddhist teacher, had good news and bad news for us in our meditation practice. The first, the bad news, is that you're falling out of an airplane. You don't have anything to hang on to, you don't have a parachute, and things seem pretty bad. But the good news is, actually, there's no ground. There's no way that you can fail at this process. My name is Malcolm Barrett. You can find me on Twitter, GitHub, and my website. Thank you for coming, and good luck.

Malcolm Barrett | You're Already Ready: Zen and the Art of R Package Development | RStudio

Transcript#

The parable of the lost son

You already structure your project

You already write R code

You already declare your dependencies

You already test your code

Three techniques to get started

Featured software#

rstudio

usethis