Sep Dadsetan - CONNECTing with our clients

Transcript#

This transcript was generated automatically and may contain errors.

I'm really excited to be speaking to you guys today. Um, uh, it's my first time speaking at Posit Conference. I've been a long time participant and I'm also really humbled to be speaking on behalf of Concert AI.

Uh, Concert AI is a real world data company. We get, collect and analyze patient level, uh, oncological data from a variety of sources, such as electronic medical records, and then we generate real world evidence for supporting drug development and, and, uh, offerings like that, but don't worry. Uh, the talk is not about oncology or real world data. The hope is here that this is going to be agnostic enough that maybe there's something that you can take away from it.

Um, but being able to kind of like make some quick configurations, deploy the pipeline and have like control over this fleet of servers is like a really cool feeling, thought I'd share that.

Um, so that's great. This infrastructure is really important. This is like the underlying architecture of how everything else gets done. What does this mean operationally now?

Well, so operationally, this would be what would kind of be a typical connect instance on an internal basis. So these purple outlines really represent, um, concert focused internal resources. We have our data in this case, uh, we use Redshift, but pick your data of choice or your data source of choice. It doesn't really matter. Um, we're using Posit Workbench of course, because we've purchased the teams thing. And so we have our users that have an ability to go use Python or studio, whatever they like to build, whatever tools they need to share internally. And then they can publish this to our internal connect server. Right.

But as I showed you previously, we have these now external client facing, uh, uh, client facing ones. And so how do we then do that? Well, it's kind of similar. Um, you know, you still do, you still perform your work here on the left-hand side, uh, but now we've, we enforce that GitHub or version control be used. So the only feature of these external instances is that we use Gitbacks deployment on these connect servers. That means that you can't just directly publish to a particular server. You need to actually deploy through version control. And this is actually really, really cool.

Uh, and I, and I think a nice tidbit to take away from this is that one, it enforces the development, not everybody admittedly, and I've seen this at multiple institutions will be familiar with or use version control one. And so that's a nice little cherry on top too, is it enforces governance, right? So now there's content and code review that's required because these are going to client facing instances. We want to make sure the quality checks are there. And then three to appease the IT gods, um, is the fact that you could do security. It's a nice security checkpoint. So the material that's being published there can go through either automated or, uh, or manual review of security, uh, items. And so this has been very, very helpful.

The added benefit is now because this content is version controlled, um, it can now get deployed to multiple instances, assuming let's say it's the same content and we can go even further and make like programmatic changes to kind of create different types of content for an individual client. And so this is a really nice way for them, us to kind of have a system internally, and then for the clients to then be able to engage with it.

Outputs and showcases

So now that I've showed kind of the architecture and a little bit operationally, how it, how it kind of comes together, what I wanted to showcase are some of the positives or some of the outputs that have come out And so as noted in the keynote earlier, documentation with like Quarto, for example, isn't necessarily like the sexiest thing, but to my, in my opinion, this is actually one of like the coolest aspects of, of this platform.

Um, big shout out to Conrad Svitek on our team who has kind of like single handedly been able to like put all this together. But what we ended up doing was, uh, we took a lot of documentation. So our documentation process involves like 30, 40 different people from different groups, like epidemiology and informatics and, uh, and a variety of different teams, right? So it's a, it's a, it's a big effort. A lot of expertise is required for real world data because it involves clinical data, clinical data, genomic data. Um, it could, uh, claims data and you have experts that come to the table from, from all these different groups. So putting documentation together seems like a simple thing, but it's actually requires a lot of process.

And so in this case, um, what Conrad has been able to do is actually, you know, create a process behind the scenes that allows us to take advantage of like Git and Git flow techniques to allow these different teams to participate and bring their voice to the table. And then what we can also do is build profiles so that if, for example, a client subscribes to a particular dataset, their documentation represents now their data, and this is now all of the data that we have available to us. And this is now all kind of through an automated pipeline. And so it has saved basically weeks of work and has greatly improved our, our ability to determine who's changed, what, when they changed it, et cetera, and give us a lot more providence.

And this is all kind of built on top of this infrastructure and benefits the clients. The clients come back to us saying like, this is great. Now it's a single source. There's no worry about what version I'm looking at, right. As we were talking about earlier, that was part of our hypothesis. And so they can go to that resource. They can find it, their search functionality, all the things you'd expect out of the web. So that's been a really, really cool, cool win.

The second showcase is a data browser. And so one of the teams that works specifically on our genomics product wanted a way to kind of elevate some of the counts that they find in the dataset so that the clients and other people can basically be able to change some of the parameters and be able to pull out what they want. And the reason I like to highlight this one is because originally one of our hypotheses was if we build something like this, if we provide that infrastructure, we can reduce the time that subject matter experts can get towards an output, right?

So now we have subject matter experts, thank you, subject matter experts that are working on this and can develop this stuff with technologies such as Quarto and build these. And so in a matter of like two weeks without having any experience like with Quarto or R Markdown or even JavaScript, they were able to build this and it's multilingual in the sense that parts of it are built with R, parts of it are built with Python. And I thought that's a really cool highlight to make. And so shout out to Prita Ghosh on our team who was able to build this out.

The third one is a favorite of mine because Connect allows hosting of APIs. We can now provide an ability, a utility for individuals to be able to like create this stepping stone. So we create this infrastructure that allows other people to interact with the data in a programmatic way so that they can go ahead and build additional capabilities. And so in this case, it gives us a lot of flexibility, similar to how an R package or a Python library would kind of encapsulate functions. We can now just build this in the language agnostic manner for people to consume that information. And so this is a extremely helpful utility that is now enabled by that. And so shout out to Tyler Lifke and Conrad who helped build this.

Now, the fourth one is kind of experimental. And I don't have a screenshot because it's still early. But Connect, being this kind of hosting platform, obviously supports a lot of the things that I had mentioned earlier. But what we wanted to do is we wanted to kind of push the boundaries and see what can Connect actually do. We have a desire of building web applications, right? But we didn't necessarily want to build it in Shiny . There are certain limitations that Shiny has that was preventing us from kind of being able to go forward. And we want to see, can you build a web app in a more traditional, let's say, like a React framework or something like that and be able to publish that to Connect? I've seen examples of this by other companies doing it.

And so we don't have JavaScript developers in-house to be able to do that. And we came across this framework called Reflex, which is a pure Python framework, basically a wrapper for Next.js, which is a framework for React. And the team was able to actually go through and with a little bit of help from Posit Solutions Engineering, we were able to come up with an MVP and actually deploy this on Connect so that people could actually engage with that content, which I thought was amazing. So this really expands the possibilities of what's there. Now, is it a little bit like, you know, unstable in terms of like, hey, this isn't necessarily purely supported, perhaps, but I just wanted to kind of showcase some of that.

Feedback and benefits

So some of the feedback. So drawbacks, it's a little bit hard. Configuration's a little bit hard to wrestle in the beginning. Management of content across all of these systems was a little bit tricky. It's solvable, but, you know, that's that. And then not all content types are supported. So it's not, you know, all puppies and rainbows, as I like to say, but, you know, we were able to kind of be able to get there and there's a lot of benefits.

So deployment and server management, because we use the infrastructure as code and its possibility is fantastic. We're really excited. We've excited our internal team members because they can very quickly see outputs from what they're building. They get excited and it kind of elevates the company as a whole. We've improved the way obviously our clients have consumed information. We've improved the speed at which we can deliver and make these updates. We've given, we've gotten better business insights. We can not only, we can see some of the activity and who's, are they even looking at the documentation, right? Something that if you were just to email, you wouldn't be able to see. And it's just opened up possibilities as I've kind of mentioned in the last slide.

We can not only, we can see some of the activity and who's, are they even looking at the documentation, right? Something that if you were just to email, you wouldn't be able to see.

So a big thank you to the team at Concert AI. There's a lot of people, but these are kind of core people that have helped out on the project. Eli from Atorus has been a fantastic resource. And then obviously Posit for accepting my proposal to speak here. Thank you.

Sep Dadsetan - CONNECTing with our clients

Transcript#

Gaps in the data science process

Addressing the business impacts

Client-facing Posit Connect deployment

Outputs and showcases

Feedback and benefits

Featured software#

Quarto