David Sluder @ Institute of Nuclear Power Operations | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Happy November, everybody. Welcome to the Data Science Hangout. I actually just shared our 100th Data Science Hangout recording to YouTube yesterday and made me realize that we never actually celebrated that. So thank you all for being here and making this space what it is. I can't believe we're already over 100 Data Science Hangouts. If this is your first time joining us, hi. So nice to meet you. I'm Rachel. I lead customer marketing at Posit. This is our open space to chat about data science leadership, questions you're facing and getting to hear about what's going on in the world of data across different industries.

So we're here every Thursday at the same time, same place. So if you're watching this recording on YouTube later and you want to join us live, you can use the link below to add it to your calendar. But at the Hangout, we're all dedicated to making this a welcoming environment for everybody. Love to hear from everyone, no matter your years of experience, your titles, industry or even the languages that you work in. It's totally okay to just listen in here. I say if you're out walking the dog, maybe you're at lunch, it's okay to just listen in.

But there's also three ways to jump in and ask questions or add your own perspective. So you can raise your hand on Zoom and I'll keep an eye out here. You can put questions in the Zoom chat and just put a little star next to it if it's something you want me to read out loud instead. And then lastly, we have a Slido link where you can ask questions anonymously too.

And thank you, Curtis, for sharing that there in the chat. Sometimes I forget to say this, but if anybody is hiring, feel free to share any open roles in the chat as well. That's definitely not spammy at all to me. I think it's great to share those jobs in the chat too. But with that, I am so excited to introduce my co-host for today, David Sluder. David is data science senior program manager at the Institute of Nuclear Power Operations. And David, I'd love to have you just introduce yourself a bit and share a little bit about your role, but also something you like to do outside of work too.

Introducing David and INPO

Sure. Thank you, Rachel. And good morning, good afternoon, good evening, depending on where you are. I'm really excited to be here. I've got two disclaimers though I have to start off with. So number one, I just want to publicly say I'm here to represent my own opinion. I am not here to represent my employer, Institute of Nuclear Power Operations or the broader nuclear industry. Of course, I'm going to talk about them to give the conversation some context, but this is purely my opinion. And then the second piece is that I really need to make sure everyone knows that anything I talk about today is, it's not just my work. So I work as part of a broader team and the team has worked so hard to build out data science at INPO. I work with a bunch of smart, passionate people and it's just really important that they get a lot of credit too here.

So with that said, my name is David Sluder. I'm a data scientist here at INPO. I've been at INPO for almost 12 years now, only about two years in the data science space, about 10 years in the IT space here at INPO. And it's a really interesting company and I didn't know anything about INPO until I started to work here. So maybe I'll kind of give you a briefer on what we are and what we do. So INPO is an independent nonprofit that is funded by our members and our members are the nuclear power industry. So within the US, I think we have 54 nuclear power stations and they all kind of, you know, pay us a member-based fee to perform our service. And now our service is that we set and assess safety and reliability standards for the nuclear power industry. So we set these standards, we send out evaluation teams to actually kind of determine how well the different stations are adhering to these standards. We, you know, kind of give them a score at the very end of that. And, you know, we do that on a regular cadence. So we do that along with also kind of facilitating the sharing of knowledge across stations, which is really interesting in an industry where you have different companies competing together, but then also sharing information with an independent organization that allows them to, you know, kind of share what they've learned so that we're all able to learn from each other's experiences, mistakes, challenges, that sort of thing.

I've been doing, you know, kind of data science for about two years now. Before that I worked in IT, but my background is at, my bachelor's, you can call it my background, is in humanistic psychology, which is, you know, kind of the most qualitative thing that you could do. So it's been a fun road to figure out how I got here. You know, outside of work, I like to do a lot of things. I like to read, I like to laugh, and I like to travel. So this past weekend I was able to indulge all three. You know, Posit Conf this year was in Chicago, and it was the first time I had been to Chicago with any chance to explore it. So I got back home and I told my wife, Katie, hey, we need to go to Chicago. So we went to Chicago this past weekend and we saw, you know, our favorite comedian. We went to go see some improv. We found kind of a punk bookstore, which was a whole lot of fun. And we got to do just a lot of wandering around the city. So it was an awesome trip. And, you know, I at this point kind of recommend Chicago to anyone, especially if it's the weekend before Halloween, because then you just get to ride around on the L train and watch everyone's costumes.

Moving from IT to data science

Well, David, thanks so much for the intro and sharing a little bit about your background there. I know when we were chatting, you shared some more info on how you had previously worked in IT as well. And so I was just curious to kick off the conversation with that because it's something that comes up quite a bit in this space, but having moved from IT to data science, I know you have probably pretty unique perspectives on both sides. What would you recommend to some of us who might be struggling to communicate across those lines?

That's a good question. And, you know, I think a challenge that I hear in the Hangouts and then also just kind of conversations I've had with other people, you know, to me, a lot of times you have these challenges and it really just kind of boils down to building a good relationship, you know, so that can pan out in a lot of different ways. And it's really dependent on your organization and I think how it's set up and just, you know, kind of the general size of it. The way that we did it was, you know, we kind of built data science, you know, not quite from scratch, but we had one organization at INPO that did some sort of kind of data science-y work. And then our senior leadership team kind of identified it as a strategic priority for the company to actually build out that capability. So then there was a lot of emphasis and a lot of focus on it. And that was right after I had moved into the data science organization. So we knew that, you know, data science and IT needed to work together because, I mean, every single project that we work on has an IT component to it.

So some of what we did is we started having leadership meetings between data science leadership and IT leadership on some sort of regular cadence, just to kind of talk about what the roadmaps look like, what projects are on the horizon, what challenges there might be, and just kind of putting everything out in the open as much as you can just to have those conversations. You know, on a more personal level, I think it's really important for the two groups to kind of learn more about what each group does, you know. So sit down with a network engineer and understand why, you know, they might be a little cautious about opening up all of, you know, opening up their infrastructure to all of the packages on a repo or, you know, having a network engineer sit down with a data scientist and understand why that's a challenge for them. It really, to me, just kind of comes down to having conversations and being open and honest and transparent and figuring out a way to move forward together.

It really, to me, just kind of comes down to having conversations and being open and honest and transparent and figuring out a way to move forward together.

That's great. How did you actually first start those conversations? Well, so data science kind of became an emphasis for INPO because the way that we were doing business was changing. So we go out and evaluate stations every two years. That's kind of been the regular cadence for about 30 years. But a lot can happen in between those two-year assessments, right? So we understood that we needed to get into more of a kind of a continuous monitoring sort of fashion. And the only way that we can do that, because we don't want to go out and evaluate more often, it's a really costly thing to set up an evaluation team and send it to a station. So instead, it meant that we needed to figure out a way to use the data that we have to do some modeling, some analysis, and try to kind of understand what their quote-unquote performance looks like in between those evaluations. So that was kind of where it landed at a strategic level. And then it kind of became our job to actually implement that. And having recently, it was real lucky having moved right from IT to data science and being able to kind of bring that perspective to both sides, you know, I think put us in a really good situation. But I know that's kind of a rare situation to be in, too. So, you know, I think, yeah, just kind of like sitting down, you know, putting your chairs in a circle and saying, what's going on? What can we work on? How do we figure this all out together? It's just kind of the right way to start.

And then we built this tool that helps give context to the changes. And now it's kind of crickets every time we publish the model results, which is good news for everyone.

Past that, we're going to be building more kind of tools, models, APIs, but we haven't done it yet. So I'm not really sure what the challenges are. So if anyone has any lessons learned, I'm so ready and willing to hear what y'all have learned as part of kind of pushing out Posit Team or any other sort of data science platform.

Estimating ROI for data science projects

Yeah, sure. Hi, everyone. So basically, I'm trying to relate to something I'm currently going through at work where we just set up an analytics team. And before the business is willing to really commit any resources and invest, we keep getting this ask to estimate the ROI we expect from the project before the business is willing to commit any time or resources to help us with it. Basically, I'd like to ask you for ideas on how do you approach estimating the ROI from a project before you even have the resources or the buy-in to start looking into it? And follow-up question, once a project is implemented, how do you look back and estimate the ROI from a project you've implemented already?

So this is a really challenging question to ask because we are a nonprofit with a mission to make sure that the industry performs the highest standards of excellence that we can set. So coming up with an ROI, it's really hard to quantify that. And we have different ways that we do that, but I don't really maybe think that they're more generally applicable to what you're talking about from a pure project management standpoint. Ideally, I feel like what you would want to do is, you know, put together. I was going to try to answer it well, but I don't have a good answer for you. It's just, it's not something that we deal with in the same way that I think that you do.

I can share one from another hangout we had with Natalie O'Shea at BetterUp. Natalie shared a bit how when she was going through this process for getting approval for Posit Team, she focused in on one specific problem that the team had. And so the use case she gave was their consulting organization. So their sales team had to make these PowerPoints over and over again with new data and based on different industries. And to actually make a great deck for a presentation was taking hours and hours. And so they sent out a poll to all of the sales reps to ask how much time were they taking building these, and they actually put a dollar amount to it. And so I think it ended up being like $1.2 million a year in people cost. And then she showed how she could automate that with a Shiny application that the sales team would go to the Shiny app, kind of put in whatever they needed for the customer, and then automatically generate the PowerPoint, which was tied directly to their database as well. And so in using that one example, she was able to put a dollar amount to it. So maybe that's one tip is to think in your, when you're building this business case, to think about one team that you might be able to help, because then they might be the ones kind of joining onto your business case too.

Yeah, thanks for that. Only thing is that I think when it comes to automation of daily routine tasks and activities, it's a bit easier to estimate ROI as opposed to analytics modeling where you're dealing with uncertainty, right? And sometimes you're not even sure if you get anywhere with the time you invest, right? So yeah, it's not an easy task. And I would really appreciate any thoughts or feedback from anyone that's gone through that experience before as well.

Nick, this is Eduardo Castillo and I, actually I'm from the nuclear industry. I know Dave, we are a working group at INPO. And one thing I can offer from a data science perspective is that it's important to kind of go for big targets. So one of the big projects we're working on right now is around reducing diving frequencies for cleaning some of our intake structures. And that's a very big, you know, $1 million a year project where we had no ongoing data science efforts to improve performance of that evolution. And so having a big target, I think was important because even if we could cut, you know, 5% of that, it becomes a good hard dollar ROI. And the second piece is looking at things like risk, like when you have like an activity like diving, you do have a lot of occupational risk. And we're actually able to tie, you know, what the occupational risk is and how it compares to other activities that we do at the plant. So it allows us to kind of capture both the hard savings and also the qualitative software savings. And again, just starting with a big target of project that, you know, is targeting a big budget, I think it's important.

Career advice and looking ahead

So a question that I love to get to ask everybody towards the end is, is there a piece of career advice that stands out to you, whether it's been something that you've received or advice you've given that you'd like to share with us?

I knew you were going to ask this, so I had to think about it in advance. And I had like five or six different things that I came up with. The one that I think I want to stick with is be open to doing things that make you feel uncomfortable. So I, you know, as a human being, but also professionally, you know, that's kind of where the growth happens. You know, if you just do the same thing day in, day out, you're just going to kind of, you know, tread water in a way. But if you, you know, lean into the things that make you uncomfortable in a healthy way, obviously, then I think that's, you know, kind of, that's where you can grow your, you know, your professional relationships, your technical capabilities, like your mental health in a way is kind of, you know, leaning into that and being okay with the discomfort knowing that, you know, you're going to learn something from it.

The one that I think I want to stick with is be open to doing things that make you feel uncomfortable. So I, you know, as a human being, but also professionally, you know, that's kind of where the growth happens.

Absolutely. Can you share an example with us of where you're doing that?

Being asked to do a data science hangout?

Yeah. Yeah. And the other piece, and I've been a part of them for seven years, that is Toastmasters. That's something, you know, working on your communication skills is as critical as a data scientist. You know, you can, you can create the coolest, coolest, you know, model, next big app, that sort of thing. But if you can't really communicate it well, then it's, you're going to have a hard time getting other people to understand why it's so important. So figuring out ways to kind of hone those skills are really, really important, just as much, in my opinion, as kind of the technical skills.

So this year was kind of focused on building out what we call our data science environment, getting Posit built. And now it's built, it's stable, and we're ready to use it. And I'm just excited to try to figure out all the different ways that we could deliver these really incredible data science products to the industry at large, and then also to INPO internally. It's something that we've never really had and really excited to put it to good use.

Awesome. Well, thank you so much, David, for joining us today. And thank you all for all the great questions and spending your Thursday with us. I'm trying to get better about saying this, but I talk to a lot of people from companies who might not know that other teams within their company are using our tools. So if you are ever just curious and want to chat with us about it, I made a little form for myself so that it was in one place. So feel free to put your name there, and I'm happy to connect you with others within your company too. But thank you again for spending time with us today. I really appreciate it, David.