Rika Gorn | From Zero to Hero: Best practices for setting up Rstudio Team in the Cloud

Transcript#

This transcript was generated automatically and may contain errors.

Hi, my name is Rika Gorn, and thank you for coming to my talk, From Zero to Hero, Best Practices for Setting Up RStudio in the Cloud.

So last year, I was given the keys to an incredible treasure. And when I say keys, I mean literally keys. I was handed three product keys so that my data science team at Spring Health, where I work, could start using RStudio Server Pro, RStudio Connect, and RStudio Package Manager.

Now my name is Rika Gorn, and I'm a data scientist. I'm not an engineer. I work at an incredible organization called Spring Health. We're a mental health care tech startup that provides a comprehensive mental health care solution to employers all over the world. Over the last year, we've been growing and scaling a ton. And side note, we are hiring. And RStudio Team would be a huge win for a very quickly growing data science team.

So what did this mean for me and my team? Well, RStudio Server Pro would allow us to run R in a secure, remote environment, and we wouldn't have to rely on our local computer for computationally expensive jobs. RStudio Connect would allow us to publish all of our data products, including Shiny apps, R Markdown, dashboards, Plumber APIs, and quickly source them to other departments. And Package Manager would allow us to centralize and manage how our team uses internal and external packages.

So needless to say, this was an incredible win for my team, and we were super excited to get started. But because of how quickly everything was moving, while I was given the support of a fantastic engineer, the bulk of setting up and configuring RStudio Team would fall on me.

Now, I'm very comfortable coding in an R IDE or working with R Markdown documents, but I didn't know anything about servers or setting up an infrastructure in the cloud, but I was pretty sure that you couldn't code up a server in an R Markdown document. So what did I do?

Well, of course, first I went to Google. Now, RStudio has a ton of great guides for administering their products, but since these are geared for engineering or sysadmins, I didn't even know where to put all the code the guides were talking about. And so this was the beginning of my journey into the world of data engineering.

As I started learning, I realized that there was a huge disconnect between resources available for engineers and resources for data scientists to learn engineering tasks. I also learned that data scientists, especially in smaller organizations or startups where engineering support can be scarce, desperately need access to engineering skills and resources if they want to learn to quickly deploy their own data products.

I also learned that data scientists, especially in smaller organizations or startups where engineering support can be scarce, desperately need access to engineering skills and resources if they want to learn to quickly deploy their own data products.

Rika Gorn | From Zero to Hero: Best practices for setting up Rstudio Team in the Cloud | RStudio

Transcript#

A roadmap for data scientists learning engineering

People: partnering with an engineer

Learning: Bash, architecture, and security

Implementation: getting started

Configuring your server and training your team

Summary

Featured software#

rstudio