The changing landscape of data science | Kanchana Padmanabhan | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Welcome back to Data Science Hangout, everybody. If we haven't met before, I'm Libby. I am a community manager here at Posit, helping to enrich our beautiful, wonderful Data Science Hangout community. I am also a Posit Academy mentor, where I help people learn R and Python to do better work with data in their everyday job. We are so happy to have you joining us today, and if you have not been here before and you're not familiar with our format, it's an open space to hear what's going on in the world of data across all different industries.

This is where we chat about data science leadership, we connect with other people in our spaces who are facing similar things as we are, we learn about other industries, and we get together every Thursday, almost every Thursday, same time, same place here on Zoom. So we hope that you have this added to your calendar. If you are watching this recording sometime in the future and you want to join us live, there's going to be details below in the description box on how to add this to your calendar.

Thank you so much to everybody who has made this the friendly and welcoming space that it is today, that it has been over the last few years. We are all dedicated to keeping it that way, so if you have any feedback about your experience that you'd like to share with us anonymously, good, bad, whatever, maybe even suggestions for topics that we could cover or people that we could have on, we are going to share a Google Doc or a Google Form in the chat, and that will allow you to give us your anonymous feedback, but you could also find Rachel and I on LinkedIn and let us know there or leave comments on any of our posts, of course.

We learned, Libby, that we can automatically launch a survey right at the end of the Hangout, too, and then that way we can know which specific Hangout it was associated with as well. So thank you to everybody who shared feedback in that Google Form before, but now you'll see when you exit out of the Zoom, there'll be a pop-up survey there, too.

So we really encourage you to connect with other people in the Hangout. This is a place for us all to get together and the chat is your space to have a party, have fun. We really recommend that you introduce yourself. What do you do? Where are you? What do you like to do for fun? Leave a link to your LinkedIn so other people can find you after our chat goes away.

There are three ways today to jump in and ask questions or share your experience. So we are going to be having a group discussion and that doesn't happen without everybody asking questions. So you can put your question in the chat. If you can't talk today, if you don't have a mic, or maybe you're in a very loud place, you can just put an asterisk somewhere in your question, we'll ask it for you. You can ask anonymously on Slido. You can also raise your hand here on Zoom and we will call on you to jump in, maybe if you have a follow-up or something to the current conversation.

So I think if you have the understanding of just, just one level of like how these transformers work, right? Not, not every math, not how they are trained, not even like the exact algorithm, but just understanding what they do from the perspective of, oh, it's just, it's a weighted average, right? You're learning the weights and then you're producing some vectors and then these vectors are thrown into an N dimensional space. And then you're predicting based on what's close to each other.

For my own, I think company, I think one of the things I've had to do is two things. One is that my team is a mix of what you call data scientists. You know what I would call more traditional, like the ones we were like, we built predictive models. We understood data, we did exploratory analysis. And a few people who are now, who have done research in LLMs, who do understand the deep parts of the math behind it. So once there's a cross-pollination, like together they're learning, you know, some of them come from research, they're not very used to how to work in a product environment, how to build product features, but they know how, they understand the math and the research and then vice versa, right?

I've also had to like set up what I'm calling the center of excellence, because with ChatGPT, I think you technically don't even need a data scientist to run these prompts. Like everybody in my company is running stuff using prompts and building assistance as they call it for different purposes. So I've really had to set up the center of excellence around like teaching and teaching how these things work, even being technical, talking about evaluations, talking about hallucinations, what they could mean. And so it's become a constant like teaching, providing guardrails, you know, making sure people are thinking about the right things.