Tiger Tang | Saving 1,000 hours with RStudio: selling R in your workplace

Transcript#

This transcript was generated automatically and may contain errors.

Good afternoon. So, many of you probably have seen or tried to help a workplace adopt R.

So this is the typical case I see. Your company hired an R trainer, went over a lot of fancy things possible in R with the team. After the training, everyone finds R very cool. Tidyverse , amazing. R Markdown, awesome. Shiny , fantastic. But then after a few days, most people just went back to the go-to tools. I'd rather not call it out, but something that's not R.

Several years ago, my company moved to R, and we faced the same situation. To help change that, I initiated a work automation project with R, and now we have 20-plus team members who consider R the go-to tools, 60-plus stakeholders who use our product on a daily basis, and this has saved over 12,000 hours and counting. In the next 15 minutes, I'm going to talk about key parts of the project, lessons learned, and the structure you could follow if you would like to implement it.

I guess sometimes it makes sense for us to start with the one that does not require too much context to understand.

Gathering requirements and ranking tasks

So after getting the decision makers buy-in, time to get the things we need to do the actual automation. High level, there are two things. A document of the current process so that we know what needs to be done and information for us to compare all the tasks so that we know which one to start with. To get the documentation, we will need to understand the current processes. Well, you may say, wouldn't that just need tools and steps? Mostly, yes. But we will also need to understand the business reasons so that we can accommodate any changes that may end up making the process better. We will also need to understand the occurrence, whether it happens daily, weekly, or ad hoc so that we can choose the best platform or tool for it. Lastly, we tend to forget communication is part of the automation. And I would always recommend saving a few communication examples in your document.

On top of that, we also want to know the current efforts in terms of the overall time for each run before the automation. And what's the manual versus processing time breakdown and identifying the tough items through this process. Ideally, the automation should always work if we code things right. But oftentimes, it is not ideal, which is why it is critical for us to know how often should we update the process of the document so that we know when it will be obsolete. And oftentimes, we're not always the original report owner. So, we need to know when to stop and call for additional help.

Now, all of these will get us a detailed document of the requirements. Of course, it's always a great opportunity to practice our markdown. But other than the document itself, we now know the complexity of potentially automating this process. The impact, as well as the stability, which would allow you to rank all your tasks in a table that looks like this. So, if you're wondering which one to start, you may not want to start with task number one, where it is complicated as small impact and may require you to update the code every other week. You may want to consider starting from automating task number three, where it is not overly complicated, earning you some recognition, and there's less need for you to update the code. And then, maybe you can move on to the harder ones.

Doing the actual automation

After knowing what is needed, identifying the task to start with, time to roll up the sleeves. Now, to do the actual automation, there are so many things to cover here. In fact, I've been working on an automation book trying to cover all the common scenarios using two dozens of chapters. So, I will just be brief here with my top three recommendations.

My first recommendation is to always start with components. Let's just say if you have a process that separately involves SQL, Excel, and Outlook, you want to code them one by one because within the same team and organization, the same processes will involve the similar components where you can just reuse the code. My second recommendation is that we should definitely do plenty of tasks to capture all the scenarios possible. Whether it be dependency tasks, user tasks, dev tasks, unit tasks, we should do all the applicable ones. I know everybody trusts their own code. I do, too. But the reality has taught me to trust the tasks even more.

Lastly, be practical and stay on target. I know it feels great to be able to build all solutions within R. But the thing is, not everything needs to be fully automated. At the end of the day, it is not about building something cool with R, but building something impactful with R. I've got lost in that so many times. So, let me say this again. It is not about building something cool with R, but building something impactful with R.

It is not about building something cool with R, but building something impactful with R.

Keeping the project going

Now, after the coding task is all done, we can hand off the process and move on. But there are still things we need to look out for in order to keep the project going. Overall, there are three deadly situations that could affect your project.

The first situation is that after you handed off your automated processes, someone ran the code. But the result isn't what's expected. Maybe because the script wasn't handled properly, like running it on a new machine or a new environment without the proper setup. Believe it or not, after all these years, it still happens to me.

So, to avoid this situation, what I would recommend is to always have a handout document that discusses the requirements of the job, instructions of running the process, testing the process, and what to do to maintain. The second scenario is that as you build more and more automated processes, chances are multiple processes may run into issues at the same time. And sometimes someone will come up and need your help to fix it. Sometimes multiple people will show up, which could look pressing. So, my tip for you is this. Always discuss the fail safe in your handoff document. In most cases, a line like this will do the trick. This is almost like when the autopilot stopped working. You don't stop flying or driving. You switch back to manual. And this often reduces the urgency and gives us the time to properly fix the issue.

Lastly, everything went well. You just got a new feature request. Should you directly jump in? And my recommendation here, well, two recommendations. The short term is that always treat it like a brand new task so that you can start from gathering requirements and best determine when to work on it. And then the longer term solution here is that it is also a perfect opportunity for you to train your individual team members to gradually take on the task and accumulate more knowledge.

Now, after you're comfortable with dealing with those three situations, we just need to give it time for the project to make progress. And just like any of the other projects, from time to time, you want to share updates. And to do that, I often find it helpful to start with the key stats. For example, hours saved, which is something you can easily extract from the requirement document that you have built in the previous steps. And the number of process documents now available because of this project. As well as the all the training that happened because throughout this project, whether it would be the official one or the one on one sessions that you had with your team members. On top of that, you also want to share any of the success stories, learnings that happened along the way. And maybe give kudos to your team member that you collaborated with. And lastly, you don't want to forget talking about the hurdles where you could use additional help on. Other than that, you also want to determine a good cadence to share progress. For me, it was once a month. But you should always set up your own and know that you can share progress whenever you have critical updates.

A structure you can follow

Now, that wraps up the key parts of the project. If you would like to start something similar at your workplace, this is a structure you could follow. First, start from identifying the task in your workplace. Then, build your proposal with the benefits that would matter to your decision makers and workplace. After that, build a requirement document with possibly with R Markdown, identify the right task to start with. Then, code by component. Do plenty of tasks. And stay on target. While at the same time, try to stay away from the three deadly situations. And don't forget to share progress from time to time.

Now, I would like to ask everyone to think back about the original situation. As we build more and more automated processes, we have completely moved on from the original question of why the R adoption rate isn't what's expected after the R training to now having most team members owning several of these R processes where they were at least involved in building the requirements, testing the process, and execution. Rather than expecting everyone to connect the R functionalities to the business needs through the R training, we connect the business needs to the R functionalities through this project. And we just happen to save thousands of hours.

Rather than expecting everyone to connect the R functionalities to the business needs through the R training, we connect the business needs to the R functionalities through this project.

Now, if you're a decision maker in this room, this might be a good project to make some impact and accumulate some R knowledge. If you're a team member in this room, bring this back and start from identifying the tasks. If you're one of the two team members in this room, text your boss now and say you have an idea. Thank you.

Tiger Tang | Saving 1,000 hours with RStudio: selling R in your workplace | RStudio (2022)

Transcript#

Why R adoption stalls after training

What's possible with R automation

Selling the project to decision makers

Gathering requirements and ranking tasks

Doing the actual automation

Keeping the project going

A structure you can follow

Featured software#

rstudio

Tiger Tang | Saving 1,000 hours with RStudio: selling R in your workplace | RStudio (2022)

Transcript#

Why R adoption stalls after training

What's possible with R automation

Selling the project to decision makers

Gathering requirements and ranking tasks

Doing the actual automation

Keeping the project going

Sharing progress

A structure you can follow

Featured software#

rstudio