Jeff Allen | RStudio Connect Past, present, and future | RStudio (2019)

Transcript#

This transcript was generated automatically and may contain errors.

All right. Well, thank you for being here. I'm excited to talk to you today about RStudio Connect. I want to talk to you a little bit about where the product's been. I'm going to unveil a few things that we've been working on in recent months today. And then I want to talk to you about where we're going in the future.

But I'm going to do this through a vehicle that's a little bit unorthodox. So just between us, and this doesn't leave this room, I've been working on a little side hustle called Hats4Cats.me. This is outside of my role at RStudio. And basically, Hats4Cats is all about addressing the booming market of cat owners who need headwear for their pets.

And so one of the things that we've been trying to work on here, we've raised a round of funding, hired a data science team, and I kind of got a front row seat to sort of our experience using R there and sort of what that looked like. And so I thought I'd share that with you today and see if any of that resonates with kind of how your journey.

So things started off really great. Everyone was in R. We had two data scientists originally. They were both using R. And they both installed R just recently when they got started. They were using the latest versions of R packages. And everything was just fine.

There were only a handful of artifacts that they produced. So they had a couple R Markdown reports. We had one Shiny app. Everybody knew where everything was. And so there wasn't really any ambiguity. And we were all using the latest versions of everything. So everything just worked out fine. But pretty quickly, things went downhill.

So after about four or five months, we had enough content now that people started getting confused about where does this report live? Where does this Shiny app live? And our data scientists were spending all their days not actually doing data science, but answering these requests about can you send me this report? Can you rerun this report to get this week's data? Or can you run this report just for my product?

And so they spend all their time doing these menial tasks instead of doing the stuff that we really want them to do, which is unlocking value by tapping into and exploring the data that we have. Secondly, it was all irreproducible. So we started having problems where one analyst would send the other analyst some code, and they weren't able to run it successfully. Or they would go to rerun something that they'd run six months ago. And all of a sudden, nothing works because some package changed. And so they'd spend all day fixing this thing that should have taken five minutes.

And the content is spread all over. So we have dozens of reports now. They're on our network share. They're on Slack. They're trickling around in email. And nobody knows where anything is, which creates a huge problem. And then lastly, we had some issues with security. We had, at a couple of occasions, we had some sensitive financial reports that ended up on a network share that were accessible to the whole company, which nobody wants.

And so now I have one R Markdown document that I'm maintaining, one bit of source code that I've worked on. And yet, all these people are able to kind of get the customized, tailored views that they're interested in, that they can even define themselves without me having to be involved at all.