Kate Hertweck | R training and documentation for different levels of expertise

Transcript#

This transcript was generated automatically and may contain errors.

Welcome, my name is Kate Hertweck, I'm the bioinformatics training manager at Fred Hutchinson Cancer Research Center in Seattle, Washington. My career focus is helping scientists learn the coding and data skills they need to apply cutting edge computational methods and accelerate the rate of science.

I'm excited to tell you about one of the best parts of my job, helping researchers unlock the mindset and tools that help them make the most of their data and get more work done in less time.

Here's how the general process of teaching someone R usually happens. I work for a biomedical research center with a huge breadth of research topics, but everyone wants to make figures for publication. My community loves R, especially Tidyverse , because it makes sense to them to be able to achieve those goals.

In my introductory R class, I start by teaching some general skills in data manipulation and filtering. My participants learn to create elegant, effective data visualizations that help them explore their data.

After a few hours bent over their keyboards, I can see the slowly dawning realization in their eyes. With the power unlocked by coding, I will surely obtain a statistically significant result supporting my insightful hypothesis, yielding fame and glory, but more importantly, a paper in Nature or Science, as well as future grant funding.

In case you're not a scientist and this doesn't resonate with you, let's think about it from a layperson's, well, lay dog's perspective. For scientists, unlocking the power of coding is like when we're dogs getting a one-way ticket to the dog park, complete with all of our favorite toys and lots of interesting things to sniff.

That moment when the dog realizes the dog park is near is like that moment when an important concept about data or computational work clicks for someone learning R, and they see a world of opportunity open up for them.

Unfortunately, all too often, when the time comes to apply their skills to their own data, they're confronted with a console that looks more like this. These red error messages are sometimes scary, often confusing, and always frustrating. The prevalent feeling they experience is disappointment, tinged with a touch of, how am I supposed to achieve fame and glory when I can't even get R to acknowledge my data exists?

That crushing defeat in the face of such promise is also something a dog can relate to. It's the equivalent of jumping in the car, thinking we're heading to the dog park. We can almost feel the breeze, taste the rubber ball in our mouth, and smell the grass, instead ending up at the vet. This is exactly the opposite of what we wanted. It's disappointing and demoralizing, so much so that we don't want to get in the car again, even if we might end up at the dog park eventually someday.

From learning to applying

So how do we jump over this barrier, restoring a belief that we can achieve those lofty goals? When I accepted my current position to develop and implement a training program in reproducible computational methods two and a half years ago, I thought the most important part of my job would be time spent in a classroom. I had spent the previous four years teaching computational skills to students at the university level, and after an entire career in academia, I thought that the highest impact I could have at a research center would involve teaching formally structured short courses.

During my career, I've had the privilege of personally teaching hundreds of people to code in R. Especially in my last few years focusing on adult learners outside of formal university courses, I've learned a lot about what it takes to help people succeed in learning R, and that success is not solely reliant on formal short courses.

Now I define success as whether people continue to work with R after leaving my classroom, or at least if they gain some general literacy in data and computing skills so they can work more effectively with computational staff.

Now I define success as whether people continue to work with R after leaving my classroom, or at least if they gain some general literacy in data and computing skills so they can work more effectively with computational staff.

Given the breadth of research topics members of my community pursue, I am faced with the monumental task of helping people apply R skills to many different types of data and research questions. The question I then continually ask myself as I prepare to deliver training materials is this, what prevents people from using these tools after they've learned them?

My job as a trainer is to provide guidance that fills in those gaps, which allows learners to keep moving in the same direction, but without a leash, so that they have guidance, but more importantly, autonomy.

So while I'm developing support materials, I consider identifying what knowledge is assumed in the materials I'm developing, as well as collecting curated examples of code that apply methods commonly desired in my community. In fact, one of the most effective things we can do is share high-quality training materials with the people for whom they are most suited.

The ecosystem of tools available in the R community really reflects the interests of the people comprising that community. With so many tools to maintain and develop, is it worth developing and maintaining support materials, too? Well, we share our code so other people can benefit from our hard work, and to increase the impact of our efforts. If sharing our code results in a multiplier effect for our effort, then making it easier for people to use the code multiplies it yet again.

At a time when disparities in the world feel more stark than ever, access to information makes a huge difference in promoting equity for developing these skills. These values are why we support open science projects, and that means considering how even our small actions can continue to uphold these values. In many cases, we don't even have to create these materials ourselves. Knowing what resources exist and helping raise their visibility can have a huge impact, too.

Advice for every level

Regardless of what level of expertise you think best describes you, please consider the following. For those of us beginning to code in R, it's likely we'll encounter a piece of documentation we don't understand. That's okay, and it's not our fault. That information probably isn't written to be accessible for people with our type of expertise. There's almost certainly a better resource out there, or materials that can help us bridge that gap.

For those of us who are practitioners, worrying that, even if our code works, it just isn't good enough, it's okay. Every expert coder has been where we are now. There's also no requirement that we keep learning more advanced skills, but the community is here to help us, and it'll probably be very satisfying to learn.

Finally, those of us who are experts developing R packages, unsure how to make the information accessible to a broad audience, it's okay. This is a great opportunity for us to collaborate with someone with a different type of expertise. These relationships can help us share our tools more broadly while supporting other community members in advancing their own skills.

For all of us struggling to communicate with people who possess different knowledge and skills, remember this is something everyone confronts. Given that we're all continually learning things, and yes, also forgetting things periodically, our own knowledge and that of those around us is constantly changing too, making for a sometimes rapidly moving target. One of the best things we can do for each other is understand that we can't expect a resource to be one size fits all, and that helping someone else learn is a way of supporting the entire community, as well as yourself in the future.

One of the best things we can do for each other is understand that we can't expect a resource to be one size fits all, and that helping someone else learn is a way of supporting the entire community, as well as yourself in the future.

I learned about the basic framework of levels of expertise from the Carpentries. This nonprofit group teaches reproducible data and computing skills, including R, to communities worldwide and have a number of resources related to training diverse audiences and technical skills. My current work at Fred Hutch builds on methods and materials from the Carpentries. I've adapted their workshop content to directly apply to the research community I serve. We otherwise apply the concepts I've discussed here to training and community development.

I'm also proud to be advising MetaDocencia, a group focused on a different type of targeted community development, supporting educational practices for Spanish-speaking communities, and this includes teaching technical skills. Each of these groups facilitate training for diverse audiences and demonstrate how even small changes can mean a lot for individual learners in the R community.

If you'd like more information about the ideas I've mentioned here, check out the resources I've included in RStudio's collection of conference materials. If you'd like to talk about computational training and customizing R resources, follow me on Twitter. I can also recommend my Twitter account if you'd like to see pictures of my favorite coworker, Loki. He wouldn't consider himself to be even a novice R coder, but he does listen very intently to all of my thoughts about training and community building.

Thanks for watching, and for helping support a bigger, more inclusive global R community.

Kate Hertweck | R training and documentation for different levels of expertise | RStudio

Transcript#

From learning to applying

Levels of expertise

Meet Ash, Avery, and Quinn

Prioritizing support and documentation

Guidance without a leash

Advice for every level

Featured software#

rstudio