Sean Lopp | R & Python: Going Steady | RStudio

Transcript#

This transcript was generated automatically and may contain errors.

Hi, my name is Sean. I'm a product manager at RStudio . As a product manager, I get to talk to data science teams, both small and large, all over the world. And unfortunately, in interacting with these teams, we often hear that they're confronted by a false choice to either pick R or Python.

This choice might occur when they're writing a job description to hire a new data scientist. It might occur when they're trying to decide what to learn as individuals who want to upskill themselves, or it might occur when they're interacting with IT, asking for resources to support a project.

So why do teams face this false choice? Do we really need to pick between one language or another? Well, the answer is no. And today we're going to talk about why. We're going to look at some of the common myths that lead to this choice, how to debunk those myths, and ultimately how data science teams can be most effective when they choose to use R and Python together.

The screwdriver analogy

Now, to set some context, imagine you're a craftsman, someone who is handy and builds things. As a craftsman, you'd probably be familiar with screwdrivers. I mean, who hasn't used a screwdriver to put something together? And you might be aware that screwdrivers come in a variety of different shapes and forms. You can have a flathead screwdriver, a star bit, a Phillips head, and each of these screwdrivers is designed to serve a specific purpose.

Now imagine as a craftsman, if you were told that you have to choose for the rest of your career between using one type of screwdriver or another, you'd probably look at that person and say, that's crazy. There's no way I can practically make this choice. For some projects, I'm going to need a flathead. For other projects, I'll need a Phillips. It doesn't make sense for me to pick one screwdriver to use for the rest of my career.

Instead, what you might do as a craftsman is to say, I want to opt in to using a smarter tool, a tool that's going to be more powerful and allow me to take advantage of all these different bits that are out there. Specifically, as a craftsman, you might be interested in something like a drill, a tool that regardless of what bit you're going to need for a project, allows you to work faster and to accomplish more and allows you to work in an easier fashion.

So craftsmen have this drill. What about data science teams? Well, I believe as data scientists, we should refuse that same false choice between R and Python and other languages, just as the craftsman refuses to pick one type of screwdriver. Instead, we should work alongside of folks in IT and the leaders of our teams to build something like a drill, something that regardless of what language we use, is going to give us the power to accomplish our projects faster and easier.

Debunking myth one: more languages means more work

But first, I want to address some of these common objections that you'll hear. People that say, no, no, no, there's no such thing as a drill for data science. We have to pick a single language or a single screwdriver. So where do these objections come from? What's the biggest objection that data science teams face?

Well, the first one is this belief that if we are to support more than one language, we'll end up doing a lot of duplicative work. So for example, if I were a team that wanted to use R and Python, IT might be worried that now I have two times the amount of work to do. Instead of supporting one language, I now have to support two. That means twice the number of installs, twice the number of support tickets, twice the money spent on IT resources.

And luckily, this line of thinking simply isn't accurate. The reason for that is because regardless of what language you use, the core things that IT needs to support are the same. Things like computation, logs, authentication, security, data access. These provide a common core, that drill, that we can invest in regardless of which drill bit, which language, we end up using.

These provide a common core, that drill, that we can invest in regardless of which drill bit, which language, we end up using.

So I want to show you some of those tools that we've worked on at RStudio to make that type of underlying investment, that core of data science infrastructure, accessible.

Said another way, one of the big reasons organizations invest in data science in the first place is because black box solutions like Tableau or Power BI simply don't meet the needs of the rich, complex real world problems that they're facing. So why would it make any sense for those same data science teams that have recognized they need the flexibility and power of code to achieve outcomes to then sabotage themselves by limiting themselves to only one language?

And so today, my final plea is to pick the people that will make your data science team effective and then supply them with what they need. Don't make people subservient to tools. It should be the other way around. Allow data science teams to pick whatever language or tools can be most effective. And to help you do that at RStudio, we've invested in building out drills so that regardless of what drill bit you need, you can effectively build something faster and easier. Thank you so much for your time, and I look forward to your questions.

Sean Lopp | R & Python: Going Steady | RStudio

Transcript#

The screwdriver analogy

Debunking myth one: more languages means more work

Development: RStudio Server Pro

Production: RStudio Connect

Debunking myth two: multilingual teams can't collaborate

Optimizing for people, not tools

Featured software#

rstudio