Data Science in the Energy Industry | Frank Hull | Data Science Hangout

So, I just say, like, get out of your chair throughout the day and, like, just go talk to people. Tell them what you're doing. Just, like, go and talk to an executive. You're going to feel so awkward and uncomfortable, probably.

And the last one to lead us out here, Zach asked a fantastic question. And I think it's a good one to talk about, like, what you're excited about. It is how do you see data science evolving within the energy industry? Is there stuff that you think that's happening that's cool and new? What are you excited about? Yeah. Yeah. I think it's, like, constantly evolving. Like, I think, like, 15 years ago, you wouldn't need to build those complex models to handle the simulations. And then 15 years from now, I have no idea what might occur in energy that might make it twice as hard, but we'll have twice as much compute. So, we'll see what happens. But, you know, like, also another crazy thing in energy right now is bringing data centers back to the United States to train artificial intelligence, which creates so much demand. And then it's like, okay, well, now I think, like, the next five, 10 years is, like, how do we build on enough supply to serve this demand? So, it's constantly changing. There's always new risks and all these new problems to be solved.

I think Thomas brought up energy efficiency. Like, that's a huge problem in itself. It goes all the way down to a house level and trying to simulate, like, every single appliance in someone's house and, like, understanding how that changes over time. So, that's great.

Yeah. This is a good follow-on to this. If anyone did not attend Sajay Suresh's episode for Microsoft, Sajay is a Senior Director of Applied Science and Data at Microsoft, but Sajay works on forecasting demand for these data centers, and that's a really complicated and hard thing to do. His episode was fantastic, so go watch that one. And then I wanted to give Rachel a chance to ask a question and also mention really quickly that Frank also does open source package creation, and he, I think, Cusco just came out. I love the name Cusco because I love the Emperor's New Groove. I don't know if that's why it's called that, but it's a package for R to help with computer vision, right? Yeah, yeah, yeah, and I don't know if Isabella's here, but thanks for the mug.

Rachel, what did you want to ask? Oh, I was just curious, Frank, because you mentioned a few different packages, like tidymodels and Targets, and you heard about the Orbital package too. Like, how do you keep up to date on new packages? Especially for me working at Posit, it's always really helpful for me to hear, like, how people find out about things. Yeah, I don't know if everyone wants to know that I stay up all night just, like, surfing all of Posit's websites and keeping track. No, like, you know, like, sometimes now that I'm more familiar with GitHub, sometimes I, like, will peek at one of the packages and see if it has any merges lately, and I can kind of get ahead of, like, a CRAN release, which is kind of fun when you're on to something like that, like Mirai with her. It was something that my team and I was, like, watching for weeks, and I was, like, is it going to get posted to CRAN this week or next week? Like, when's it going to happen?

You know, so, like, I'm on Blue Sky. I'm constantly watching any new releases from, like, Simon. I follow him, I think, everywhere. Simon's on the tidymodels team, but also leads, like, tons of LLM stuff, so. I just saw Simon's here, too. Simon's in the chat. That's awesome. Oh, weird. I've never actually talked to Simon face-to-face. Simon, I guess we have to unmute you.

Yeah, so, like, just following a few of the software engineers across Posit is a great resource. Just following the boots on the ground, going straight to the grassroots, and just, like, following the people they're building. I think that's where I get most of my information.

Just following the boots on the ground, going straight to the grassroots, and just, like, following the people they're building. I think that's where I get most of my information.

I love it. Say their names, and they appear. You never know who's out there in the Hangout, right? We have over 100 people. Usually, we have about 150 people lately, so somebody might be hanging out and listening. Well, I want to wrap up and let everybody know who is coming next week, because I am so excited to have Jenny Bryan next week. Get your Laptop on Fire merch ready, and come hang out with Jenny and everybody, and we'll talk about all kinds of good things. Get your Posit Conf registration in. There is still time to book tickets, to book flights, to book hotel rooms. If you cannot attend in person, come hang out with me on the Discord and attend virtually. I promise it's still going to be amazing and fun and a great place to connect. And if you think that the chat was super valuable today, you can go save it. Click the little three dots in the top right of your chat and go to Save Chat. That will let you keep all of your resources. But when these videos end up on YouTube, I do try to get all those resources in the description for you as well. So if there's something you've missed in the past, you can go find it. Thank you, Frank, for hanging out with us today. Thank you, Rachel and Isabella, for helping behind the scenes. Thank you so much, Frank. Bye, everybody.

Data Science in the Energy Industry | Frank Hull | Data Science Hangout

Transcript#

Understanding the energy industry and model scale

Go-to models and machine learning approaches

Stochastic models vs. machine learning for time series

Full-stack data science and career paths

Convincing regulators and documenting models

Handling missing data in time series

Long-term electricity prices and the future of energy data science

Career advice and what's exciting in energy data science

Featured software#

tidymodels