Using paired programming to have fun & sell your solutions (Kris Fabick & Kristin Carr)

Transcript#

This transcript was generated automatically and may contain errors.

All right, great, thanks for coming to our talk today. We're gonna talk about self-sufficient deployment and how you can use paired programming to have fun.

Okay, I know we just had a good lunch. We'll have a slow start. Let's start with our imagination a little bit. All right, so you hear the crunch of the gravel as you pull into a makeshift lot in the foothills of Appalachian Mountains.

You look out and you see the billboard. There it is, the course map. 25 obstacles over 12 miles. Seven of those obstacles are mystery obstacles. And the course is gonna take place over 1,600 feet of elevation change. This is exactly where Kris and I, two data scientists from Nissan, found ourselves one crisp fall morning. We came ready to take on an intense obstacle course race, but what we actually got was a real-world lesson in paired programming.

How? There's actually a lot more in common than you think. So the 25 obstacles over 12 miles took a lot of preparation. We call this stability in our data science projects. The mystery obstacles, arguably more important, it's that adaptability to those problems or obstacles that come up. The elevation change, that's the backdrop of every race, of every problem. It's the difficulty, and you need to come with the right mindset.

So sometimes that's just getting started, like we did with our wristbands. So at the end of the day, we found that projects at work, as well as obstacle course races outside of work, we worked better together than alone. I'm gonna hand it over to Kris to talk about our Spartan races at work.

The case study: self-sufficient deployment at Nissan

So I'm gonna tell you about a case study that we did at work at Nissan. But the details of the case study aren't important. We're just using it to show you that we actually really did use paired programming in a real project. So like Kristin said, we work at Nissan, and we both work in supply chain. So sometimes we have to bring in parts, we have to bring in shipping containers, and we were approached with a problem of predicting a thing called monthly container fill ratio.

And container fill ratio is just the KPI we use in supply chain, and it's the simple ratio of the space the package takes up divided by the full space in the container.

So we needed to deploy this self-sufficiently. We are a business team of data scientists. We are not an IS team. We were not gonna hand this off to an IS team. We're deploying this solution to some other people in supply chain, and we needed to be able to do that self-sufficiently. If you've ever tried to deploy a project self-sufficiently, you may have encountered, there are sort of these two competing project components that you have to figure out.

There's reactivity. You need to be able to respond quickly enough to solve the problem before the problem becomes obsolete. And you also need to have stability. So you kind of need to go slow enough so that when you deploy it, it can last over time.

You need to be able to respond quickly enough to solve the problem before the problem becomes obsolete.

Let's take a look at this first problem. Solutions need to be reactive. So when we're developing projects, we need to not develop like this. We need to make sure that we have, we don't wanna wait until the very end to have a working solution. We wanna make sure that we're actually having little tiny working solutions as we go. I think everybody agrees this is the right way to develop. I'm gonna argue this is also a deployment type.

So each part of this deployment, we wanna have something working from the beginning, right? So we have a skateboard and it can actually work. It may not be your full project. You're gonna add a component to it and then it's gonna be working and you're gonna keep going like that.

In our case study, right, that simple machine learning problem that we were gonna predict, this may have looked like in the first stage we have developed a few models and then we go back to the business and say, hey, do you want the model that's most interpretable or the model that is the most accurate? And then we can go forward and now we know that piece and we can add some handlebars to it and go forward with the project. So you may be starting to see how working with a second programmer is starting to benefit your projects.

The second problem that deployments often have is that solutions need to be stable over time. So I'm currently teaching my daughter how to ride a bike. If you've ever done this, you understand the importance of support to develop that stability.

At the end of the process of teaching her how to ride a bike, the goal is that she can ride by herself, right? I know how to ride a bike, she can ride a bike, either one of us can ride the bike.

Stability requires sufficient support or else everyone is angry crying. She was very happy she could be in my presentation and she did not mind me sharing this unflattering photo of her.

This is true not only in learning to ride a bike, it's also true in data science projects, right? So for our project to be deployed, this little simple CFR, we wanna predict something, we needed to be able to walk away from it and go on vacation or come to Posit Conf and speak to you. And so we needed stability in that deployment so that we could do that.

So the biggest thing to take away is pairing is a mindset. It's not a technique.

The next thing is if I could do every project with my work bestie, of course I would, but friction will find you no matter what, and that's not bad. We've heard in other talks that that is what helps. Creativity makes better solutions. So embrace it, roll with it. I mean, of course, be respectful, but it's a good thing.

And finally, slow down to speed up. So you've heard a lot about refactoring. This is also like when you're working with your teammate and you're talking through problems, take that time so you both fully understand it and move forward with the progress.

So the second is know your strength and adapt for your weaknesses. So for our particular project, our skills were very isolated, and in fact, Vetiver was outside both of our circles of knowledge. After the project, we were both experts on everything. So now you see where that reactivity, that stability, that paradox of delivering a solid project comes in.

And then finally, the final lesson is create this checkpoint. So you've made this progress. How do you save it? And so refactoring is really important. This is your code-level checkpoint. Make sure that your code is good. Take out all the leaks so that you don't have that catastrophic damage way down the line.

So let's take that code and let's expand it to our team, our pair programming team, right? So let's document this together. We're gonna create standards. For us, this looked like a readme. We created a standard readme, and that's all those quick facts that you need to know about the project, all of the things that you've discussed documented in one place. So if I'm on vacation, the other person can work on it.

And in fact, let's expand that to our team. So now that we have a readme, not just the two of us that worked on the project, but our whole team can support this. So now you see where those iterative approaches in development can happen quickly, and they're also reliable.

All right, so have we convinced you? If not, here's some more resources. So the slides that we went through today are here in our QR code. At the end of those slides is the actual raw code, in case that is what you came here for, to look at the actual app that we built or the model we built behind it using tidy models and better. We also have links to the article about pairware. So the review article we talked about earlier, as well as some other articles that we used to help bolster our argument for pair programming. The readme template that you saw briefly, it's there on this site as well. In the technical details, we've kind of called out a few different steps that we've broken down, in case any of these are of interest to you.

So at the end of the day, it might get a little bit messy, but we really encourage you to give pair programming a try for any of your self-sufficient deployment projects. Thanks.

Q&A

We have time for a few questions before our next speaker. So the first question from Anonymous. Why is paired programming only 15% more time? My intuition is it would be 100% more time.

Well, when you're working with someone else, it's not, you're not, you're not, you're able to go quicker, right? Like somebody else is gonna know maybe something that you don't know, and you're able to debug a little bit quicker. So it's not, that's an understandable question.

You know, and we didn't do the research. We're just reporting what the research said. But yeah, it's a good question. But I do think, we do definitely experience that you can code quicker. It's kind of like having an AI assistant, right? Makes your code quicker. Having like a real life assistant also makes your code a little quicker.

Did that hold true for you, the 15% savings? It definitely did. I mean, I think this whole project was actually, I don't even think it took us 15% more time because we did that unstructured asynchronous approach. We did have to come back and do that time to refactor, but to be honest, the refactoring doesn't take as long as the initial coding, right? So I actually think we were quicker than that. But yeah, I would say it definitely held true for us.

Using paired programming to have fun & sell your solutions (Kris Fabick & Kristin Carr)

Transcript#

The case study: self-sufficient deployment at Nissan

Balancing reactivity and stability

Making the case for pair programming

Pair programming strategies

Applying pair programming to the case study

Lessons learned

Q&A