Shirbi Ish-Shalom | Using R to Up Your Experimentation Game

Transcript#

This transcript was generated automatically and may contain errors.

Have you ever heard the phrase, the results were significant early, I guess I can call it now, or my boss is getting really antsy and I think she wants to test results soon. They seem directionally positive, or we really need to balance statistical rigor with the needs of the business.

The same flurry of phrases would always overwhelm the data scientists in my meetings, bad fluorescent overhead lighting illuminating the battleground between statistical rigor and the omnipresent needs of the business. As a former data scientist turned product manager and self-declared data nerd, I see both sides of the story. I wanted to help bridge the gap between my business and data peers by understanding experimental methods that allow our organization to run tests more quickly and maintain statistical rigor.

Today, I'll walk you through three easy to follow steps motivated by my own experiences at Intuit so that you can apply it to your own use cases for running experiments more quickly and statistically rigorously. First we'll talk about how to take big swings so you can learn and fail quickly. Two, we'll talk about what sequential testing is and how to use it so that you can end tests early confidently. And three, we'll talk about how to use R so that I and you can share your learnings with your organization, even non-data scientists.

The danger of reading tests early

In reality, businesses do need to move quickly to stay innovative. However, the results of reading a test early could be catastrophic, leading to your business decision being no better than a point flip.

Here is data for a simulated AA test, a test where we have two cohorts but they have exactly the same experience. We know that since these two experiences are exactly the same, there should be no statistically significant difference between the two. However, as you can see over a 30-day window, the p-value actually crosses the significance threshold a number of times, all due to random chance.

If you were to call your test early, you could be reading the same noise. And instead of making an informed, data-backed business decision, you could simply be rendering your business decision no better than if you had flipped a coin.

And instead of making an informed, data-backed business decision, you could simply be rendering your business decision no better than if you had flipped a coin.

And finally, the most important of all, remove the temptation to cut corners, and instead be able to guarantee statistically rigorous results at shorter time points.

Three steps to run better experiments

So how did we get there? As a reminder, there were three simple steps you needed to take to be able to achieve these goals for yourself. One, remember to take big swings. Don't take a minimum detectable effect that's too small. That's a huge lever to be able to make your experiment shorter right off the bat. Two, use sequential testing. Now that you know how, it can be simple to just implement an overlay on top of your own A-B tests. And three, use an R Shiny Flex dashboard or your own tool of choice to be able to demonstrate how to do this to everyone in your organization and share the goodness that is sequential testing so that everyone, even non-data scientists, can run shorter, more statistically rigorous tests.

Now with just those three tips and tricks in your tool belt, you can avoid early significant results, disappointing your antsy boss, and business constraints, because now you have all the tools you need to be able to run your tests shorter and reliably so that everyone in your organization can feel confident in your results.

Shirbi Ish-Shalom | Using R to Up Your Experimentation Game | RStudio

Transcript#

The danger of reading tests early

Taking big swings: lessons from a first experiment

Applying big swings to a second experiment

What is sequential testing?

Three steps to run better experiments

Featured software#

rstudio

Shirbi Ish-Shalom | Using R to Up Your Experimentation Game | RStudio

Transcript#

The danger of reading tests early

Taking big swings: lessons from a first experiment

Applying big swings to a second experiment

What is sequential testing?

Building an RStudio dashboard to share the methodology

Three steps to run better experiments

Featured software#

rstudio