SAS to R, data harmonization, & a career in pharma | Dony Unardi | Data Science Hangout

Transcript#

This transcript was generated automatically and may contain errors.

Hey there, welcome to the Paws at Data Science Hangout. I'm Libby Herron, and this is a recording of our weekly community call that happens every Thursday at 12 p.m. U.S. Eastern Time. If you are not joining us live, you miss out on the amazing chat that's going on. So find the link in the description where you can add our call to your calendar and come hang out with the most supportive, friendly, and funny data community you'll ever experience.

Can't wait to see you there. With that, I would love to introduce our featured leader today, Dony Unardi, Principal Data Scientist at Genentech. Dony, it is so nice to have you here today. Can you tell us a little bit about yourself, what you do, and something you like to do for fun?

Yeah, for sure. First of all, thank you so much for having me here and talking to you and everybody, wonderful people here. We have such great participants here, great numbers. So thank you so much for having me. So my name is Dony. I've been in Genentech for eight years, no, nine years, going to a decade now. I've been leading this TEAL framework for about three years now. And that's why Libby was saying that I'm a morning person by choice, is because our standup, because I have a global team, my standup starts at 7 a.m. every day. So I think it used to be a lot earlier, but I'm like, nah, let's just make it 7. I can't do it. Otherwise, I'll be like a zombie in the meeting.

My background is in computer prior to joining Genentech. Well, maybe prior to TEAL, I worked closely with study data. So I was a data manager. And also I switched to a role where I do a lot of R development, trying to harmonize data for a molecule. So this is depending on whether it's a recent study data or an older study data, my job is to harmonize and make it ready for analysis. And then I also have the background in, as a web programmer, I used to do a freelance web programming and a SAS programmer as well. So it's been such an interesting career. I was sharing with Libby at some point where I thought I want to do web programming. And I was about to make a career switch from that to the web to the front end. I would say I'm so glad I did it because that data turns out to be such an amazing career path and a lot of things to learn. And I think with a product like R, we can even make it so sophisticated and until I just apply so many different object-oriented techniques into the product. So it's such a versatile and flexible tool to make great product.

Yeah. The funniest thing that Donnie said to me was, yeah, I was doing web development and yeah, I was considering making a switch, but I didn't think data was going to go anywhere. I didn't think anything was going to come of this data thing.

Yeah. I was really like almost on what I felt like a ceiling, career ceiling for me. And then I was really deep into web too. I was doing Angular. I was learning Vue.js. I was doing a LAMP stack, all this different stack. I was trying to learn it. I did a little freelancing, helping my wife. Some of this, she used to have a logistic business. So I built the front and back end of that, of her business. And it was fun, but I'm glad where I am right now. Definitely our data science is such an interesting and fun topics.

Introducing the Teal framework

It is. And I, speaking of topics, like I would love to give everybody a sort of rundown on things that I think it would be fun to talk with Donnie about that can help you inform some of your questions. So one is a little bit more background, which I will ask from Donnie in just a second around the Teal framework. What is that? How does it work? Why is it important? Another one is just the sort of switch that happened from like SAS programmer and web developer to data. What that sort of skill set and background gave to him and his ability to do stuff. Also being a technical leader. We've had a lot of conversations lately in the community about being a technical leader without being a people leader as well.

I know that Donnie also does technical interviews. So he has people who are on this team with him. And while he does not do the behavioral part of the interview process, he does do technical interviews. So if you have questions about that, that might be interesting. And then also I loved the topics, the technical topics on one, just, you know, working on Teal, working in frameworks that I've never worked at in R, like R6 , S3, S4. All of that feels so above my head as far as R programming goes, but if you have questions about that, I know Donnie loves talking about those frameworks.

He's also an R champion coach. He onboards study teams and really helps with R adoption inside of Genentech. Those are great topics. I also loved something that Donnie said was, hey, not everybody in AI and ML is doing the actual model building, because he has always been more on the data prep, data cleaning, the QC side, which is quality control, validation, things like that. So if you have questions about that, or if you're like, hey, I'd love to have a career in data, but maybe I don't want to be a modeler, Donnie can talk about those topics too.

Hello. Sorry, I was in front of the stove because I am cooking my post. Not run. Emphasize not run meal, because I do Strava, but I go at a leisurely pace. But regarding the question, what is Teal? I'm unfortunately somewhat ignorant about it, and why use it? What does it do? What's so great about it? Is it just a really good color?

Fantastic question, Noor. Thank you. All right, Donnie, take it away. Tell us about Teal.

It does have a good color. But anyway, hopefully it's more than that. So Teal is essentially a Shiny app, but we built a framework around Shiny app to add a lot of feature out of the box. So it's using Shiny components, it's using Shiny techniques to make a Shiny app, but with what we would like to call Teal flavors. So what are these flavors? What we're trying to achieve is accelerate on how people can create Shiny apps. So no longer, I mean, while you can make a traditional Shiny app, we want to provide more functional programming way where with a small number of code, you can make your Shiny app.

Another thing that we're trying to achieve with Teal is that we want to add components that kind of important in clinical trial setting, but we also know it's important in other settings as well. One of our key features that we always try to market is that it has code reproducibility. So what that means, if you create a Teal app, you're using the modules, the prebuilt modules that we have, and you run your analysis with these modules and you see something that you like. Now, Teal was positioned an exploratory tool. So in order for you to run it through some more validated environment, what Teal can do, it will provide you the code that can reproduce what you see on your Teal app or visualization. So that's very important for us to achieve because in a pharma, this is something that we always want to do. We want to be able to reproduce our analysis.

One of our key features that we always try to market is that it has code reproducibility.

Another thing that we want to promote out of the box was our exploratory features. That means subsetting, filtering, and so on. So we, by just running a very simple Teal code to make a Teal app, the filtering, it comes out of the box. So we encapsulate this. And so this is where the object-oriented comes in. We provide some API for you to control how the filter behave of if you want to define a predefined filters, but the intricacy of it, the details about that, we just encapsulate them. So by default, you make an app, you will have filtering ability. And then we kept improving on this. We have predefined modules. We have line plot, bar plot, graph that we know quite standard. And so we provide about maybe 50 plus more modules that people can look around. And the idea is that you don't need to know on how to build a custom Teal module. We just provide, we already know a standard module that could be beneficial for your analysis. So why don't you just take that in and plug it in into your code and you can run it.

There's a lot more thing that I can think of. I think we have a website about this. We also have a Shiny Life session in our website, all of documentation. We try to show this, the capability of Teal in the website. We also have what we call Teal Gallery. You can Google it, teal.gallery. Maybe it's a website to show you a couple of finished product of a Teal app using synthetic data. So these are all fake data. And this is what I like about our open source Teal. So right now, like I said, it's very flexible. Everything is just Shiny module. And we're looking to several other categories that can be beneficial to other department of our team. So right now, Teal, I know it's being used very heavily in our study trial, but we also want to be used on other purposes, like medical data review and so on. So maybe this is a long answer to your question, but Teal is a product to make you easily make a Shiny app with analysis that you need.

I think Pharmaverse is a great one. And these are all data scientists and we are all very friendly compared to, I mean, I know tech could be competitive, but I would say data scientists, one of the warmest, kindest people that I've met so far.

That sounds amazing. There were so many wonderful questions that we couldn't get to. There were questions about, you know, career limitations for growth. We had questions about clinical trials. We had questions about the influence of AI in the research space. Everybody asked such amazing things, but we only have this one hour with Donny. So Donny, thank you so much for hanging out with us. This was so much fun.

Yeah. Thanks for having me, Libby. And thanks for everybody for your question. I really enjoyed it. I hope you had a great time. I mean, yeah, feel free to connect with me if you want, if you have any questions.

Thank you. Go find Donny on LinkedIn, Donny Unardi. Also, I realized as we're talking about announcements, I was trying not to spill the beans because I didn't know if it was out yet, but Isabella has informed me that it is out. So if you are registered for the Posit Conference, in-person, virtual, doesn't matter, the Discord server is open. Yay! So there's a link there. There's a little button. You can go join the Discord server. You better find me, tag me, say hello. I will right now go make a Posit Data Science Hangout channel for us. We can all pile in there. And yeah, really get talking. My recommendation to you, make your name your real name. If you have a weird Discord server name, I'm not going to know who you are. Make it your real name so I know who you are, okay?

If you want to save the chat, because it's full of amazing resources, go click the three dots in the upper right-hand corner of the chat. If you are not able to save the chat and that distresses you, fill out the Zoom survey that comes after this. Put your email on it. Rachel and I can help you get the chat. Find us on LinkedIn. Find me on Blue Sky, where I spend most of my time. And we will see you next week, where we have Lisa Elkin. She's a Senior Principal Computational Toxicologist at Pfizer. Lisa is a delight. Please come hang out with Rachel and me and Lisa. All right. Have fun on the Discord server. I will see you there, and I'll see you next week, everybody. Bye. Bye, Donny.

SAS to R, data harmonization, & a career in pharma | Dony Unardi | Data Science Hangout

Transcript#

Introducing the Teal framework

Bay Area, biotech, and networking

SAS to R: tips for the transition

Technical interviews

Data quality and harmonization in clinical trials

Reproducibility in Teal and pharma

Data harmonization explained

Prioritizing work and collaborating globally

Standing out with personal projects

Community, open source, and career advice

Featured software#

Shiny