Novice to data scientist: a pediatric anesthesiologist uses RStudio to help kids access surgical ca

Transcript#

This transcript was generated automatically and may contain errors.

Hi, my name's Nick Pratap. I'm a pediatric anesthesiologist. Taking care of kids having procedures done to their hearts. So you might ask, what am I doing standing in front of you all here today at PositConf? And it's a good question. So I'd like to tell you today about my self-guided journey into data science. I'd like to offer some shout out to some influences and sources of help that I've had along the way. I'd also like to maybe provide a little bit of encouragement to anybody who's on a similar path to me, or perhaps even if you know somebody or come across somebody that's on that kind of a path.

She said, you're a smart guy. You can do this.

So, I thought, okay. So, I did start to do a bit of reading, and I realized that patient level cancellation prediction is a machine learning problem. She was right. It was supervised learning. There was a binary classification of cancelled or completed. I realized there were some specific challenges. One of them, like I told you, there's like four or five percent cancellation rate, so there's some class imbalance. Anyway, at this point, I was fortunate to come across the applied predictive modeling book by Max Kuhn , and through that, the carrot package. I wish tidy models had been available in those days, but that really got me started, and I was able to actually do some machine learning.

Key findings

So, we wanted to find some actionable insights from this patient level prediction. The first thing I discovered was actually it was good enough for risk stratification. We got pretty good models, especially for the second and third causes, the no-show and MPO violation. The patient illness was rather more difficult to predict, and perhaps that's unsurprising because kids do get sick. What we found was that the strongest predictor was if patients had cancelled for surgery before. Importantly, we found out that kids who came from socioeconomically disadvantaged backgrounds were at the highest risk of cancellation, and so going back to when I was talking about the Institute of Medicine's dimensions of quality, it turns out that surgery cancellation is an important contributor to disparities in access to surgical care.

Importantly, we found out that kids who came from socioeconomically disadvantaged backgrounds were at the highest risk of cancellation, and so going back to when I was talking about the Institute of Medicine's dimensions of quality, it turns out that surgery cancellation is an important contributor to disparities in access to surgical care.

In the Cincinnati area, interestingly, this is confounded by race, and this is a point that I want to come back to in a moment. What was very important to discover is that cancellation wasn't really the fault of any one particular surgical specialty as such. It was really down to the demographics of their patient populations, and this enabled me to go back to this cancellation myth, dental patients are the worst for not showing up. Well, they are in the sense of their backgrounds, but it's not something that the dental service is responsible for.