Building the Future of Data Apps: LLMs Meet Shiny

Transcript#

This transcript was generated automatically and may contain errors.

Right, welcome. Thank you so much. We're just going to give everyone a minute here to get on the bridge and we're going to kick things off with an awesome session today. I've got a few words that I'll mention and then we'll pass it over to Garrick for the session. Thank you so much for coming to the Gen AI Day by R&Pharma. My name is Phil. I help to run the non-profit with many people here that are behind the scenes helping to make today possible.

Open Source and Pharma is the group that helps make this event and other events a part of the R&Pharma open source community. Today, we've got a really exciting program and some very technical sessions for everybody to learn about some of the new tools. It's amazing that this event started out last year just as a couple people, Anna, Victoria, Harvey, Jared, Eric, myself, we were just chatting during one of the Hangouts and we said, why don't we get a few people together for Gen AI Day just to learn about how pharmas are actually using these tools.

We don't really want to learn too much about general applications in the industry, but actual technical details of how people are implementing Gen AI. And so last year, we hosted this single day event. We were expecting probably around 50 to 100 people coming. We almost had a thousand people register again this year, same boat. It's just amazing to see the community come out to learn about the intersection of pharma, open source, Gen AI, and really excited to mix real applications by pharmaceutical companies with Gen AI, but also have some technical sessions.

So I'll introduce now, Garrick Aiden-Bue, who's here today, an engineer at Posit, works on the Shiny team, is doing some really amazing work with Shiny, LLMs, Gen AI. He's going to take us through a technical deep dive today and give everybody a little bit behind the scenes, lift up the hood scene into the world of open source R and LLMs. So with that, I will pass it over to you, Garrick, and we'll kick off the session for today.

Awesome. Well, thanks, Phil. It's a pleasure to be here, and I'm really excited to be talking about LLMs, Generative AI, and Shiny, and all of that together, because those are basically all of my favorite things these days.

So you have another tool as a programmer, which is that you can set a system prompt. It's essentially like a message from the user, except it's from the developer, the person who is doing the programming with the LLM. It's shown to the model in the same way that a user message is, but it is not usually shown to an end user.

The other nice part about chat about using Elmer is that you have access to different parameters that are available through the API. So this is an API parameter that many models support. It's temperature, it's called temperature, it basically means you could you can kind of think of it like creativity, where a number closer to one means the model will pick more random words. And you end up getting an answer that feels more creative. And a value close to zero is the model will pick mostly the most likely words. And you'll get a you tend to get something that's a little bit more dry. So given that this is a sort of creative task of turning things into riddles, I'm going to give it a temperature of one to dial up the creativity. And we'll see what happens.

So I want to clear my console really quickly. And you can see I'm now talking to GPT five nano, we haven't had any turns, but there is a system prompt that that is telling the model that it's a trickster, right? So I'm going to give it just one word and it will make me a riddle. I hope so. One thing that I've noticed in with jet with GPT five, the newer model is that it takes a little bit, it takes a little while to get the first to have it like actually stream tokens back to you. So usually there's a little bit of a delay with GPT five that I don't typically see in GPT 4.1. But here you go. Here's a silly riddle spun from your prompt. I'm gray and grand the Savannah's own parade. And the answer obviously is an elephant.

Okay, this is way more useful than just to make riddles. This is very useful when you are trying to guide the conversation. And as a programmer, it's one of the one of like the best ways that you can use a system prompt is to tell the model how you want it to behave for like it's an expert programmer and statistician and I want it to explain things clearly and concisely. Most importantly, though, I wanted to use the latest features of R, including the pipe operator. I definitely prefer the tidyverse style instead of packages. And please write nicely formatted code, use a functional style, and prefer things like vectorization. So basically I can say like, hey, look, this is how I want you to write code for me if I ask you to write code. And I can pick the model that I want. And I can dial down the creativity this time because this is not a as much of a creative task.

Building a Shiny chat app

Okay, so let's see how we can use Elmer in a Shiny app. And and have any like create a whole model around or a whole and you know, the interface and experience around around an LM. So I have the Shiny extension installed. You can see if I go to my extensions, I have installed Shiny. And what's cool about that is it gives me a little snippet where I can just type shiny app and hit tab and I get the snippet. And then I'm going to change from page page flow to page fillable. I like that better here. And, and cool. Now I have an app, I have the skeleton of an app.

I'm going to use the shiny chat package. So I'm going to add in shiny chat here. And you can grab you can grab shiny chat from from our GitHub repo. It's at positive slash shiny, shiny chat slash package R. And you can use pack to install that. And, and then we have a nice module for creating a chat interface. And basically you call the shiny chat. That's chat, mod, UI, and you give it an ID. And we put that in our UI. And then on our server, we're going to do chat mod server, let me load shiny chat. So I get good completions here.

And chat mod server is going to get take an ID and then we need a client object. So like the connection to a model through Elmer. So I'm going to come back up to my imports. And I'm going to library Elmer. And I'm going to create a client. And here I'll do chat, open AI. And let's, let's stick with 4.1 nano because it streams a little faster. And yeah, so everything is connected now. So I have this Elmer chat client, and I have my UI and my server. And if I go up here to the run button, I can run my Shiny app. And we will see. Ideally. Yeah, here we go. We've got a little Shiny app. And I'll ask for an elephant fact.

Nice. Okay, so what is this? Let's see, there's one important thing about how I have set this up. You'll notice that I am creating the client to the to the LM provider. I'm creating this here inside of the server. And that means that every new Shiny session, every new app session, every new user that comes to look at this app gets their own fresh chat session. We saw before that Elmer is chat objects are stateful. So they record the history that and the conversation that you've had with them before. So if I accidentally create the client outside of the server function, I'm basically using one client for everybody. And that means that everybody's conversations are going to get interleaved. And, and that would not be great. You probably don't want to do that. So make sure you create your client inside of your server function. And then I'm passing it to chat mod server.

LLM limitations and tool calling

Neat. Okay, so it'd be really fun to do more things with LLMs. But first, I want to take a small detour into to talk about some of the things that LLMs aren't great at first. So I'm going to create another chat instance to GPT 4.1. Nano, let's make sure. Okay, I'm on you talking to 4.1 through open AI. And I'm going to ask it something like, hey, what is positive been up to lately. And the model says something interesting. It says, as of October 2023, posit, which was formerly known as our studio, has continued, this is, this is pretty bland. We've definitely been up to something more specific than just generally developing the tidyverse and related packages.

So I'll ask the model to read this URL and figure out what we're doing. And right away, the model says, hey, I can't do that for you. I'm unable to access external websites directly, including that blog link that you gave me. In fact, if I even ask it like, hey, what is today's date? Okay, apparently, today is April 27, 2024. So these things are this is it's not 2024. First of all, and second of all, it's kind of disappointing that this model can't go read the website. The, the reason that this happens is because models are trained, training is a very expensive operation, they're trained on lots and lots of data. And then training only happens once every once in a while. Basically, prior to every new model release. So by themselves, an LLM large language model basically just writes text, that's what it can do, it can write text, but it can't go out in the world and do things for you.

So by themselves, an LLM large language model basically just writes text, that's what it can do, it can write text, but it can't go out in the world and do things for you.

Right? It also isn't really a database, either. So I'm going to ask it this, I'm going to make a little chat app. And I'm going to ask the model to figure out all the zip codes in the US that start with 18, that start with one eight, and have a population of more than 20,000. And, and, like, what's interesting is it actually it does, the model does give me an answer. But, but they, the knowledge that the model store is not like a database, a large language model is not going to go like pull from a knowledge store and pick out facts, like it, it writes words that seem to be plausible and realistic. Right? So this isn't a good this is not going to be generally a good way of figuring out what, which zip codes have a pop start with one eight and have a population of 20,000. But what I can do is I could ask the model to write some R code, using the tidyverse and tidy census to find this number or find these zip codes. And yeah, and so now we've shifted the problem into something that models are actually really good at. So I've just asked the model to write some R code for me.

And even GPT 4.1, or, you know, a relatively or nano, a relatively small model is going to be able to write some code that is going to get me pretty close to, to what I want. I think we're getting close to the end. Okay, so now it's going to explain all that to me, I'm going to try to just come up and grab this code here. And I'm going to create a new R file and paste in that code. And I'm going to take a look, I'm going to get rid of all this stuff, because I have a census API key ready. What else am I going to do, I'm going to start a new R console. This is one of the really cool things about positron is that I can have multiple R consoles at the same time. So I have this one that's running my Shiny app. And now I'm going to go into a fresh R console and try some stuff. And we're going to YOLO this, we're just going to run, run some code, there's no tidyverse. That is disappointing, but I can fix it.

Okay. So, alright, let's go back to OpenAI. So tools are really tools are sometimes they're called functions or tool calling or function calling. But the goal of a tool is it is a way to bring real time or up to date information to an LLM. So we've seen that they don't have access to real time information or knowledge of what is going on in the world. So tools let you bring that information to a model. What else is really cool is that you can write tools that let the model go out and interact with the world too.

To understand how this works, I'm going to take one little step back. And we're going to imagine, you know, imagine you're talking to ChatGPT. So you send a message. You type in a message at ChatGPT. It looks a little bit like a text message. That message is sent to a server. The server then goes and talks to the LLM. The LLM gives an answer, writes more words. That response comes back through the server and makes its way all the way back to you where ChatGPT shows it to you in the UI. Cool. Except we're not writing ChatGPT.

We just saw that we can write R code and have ShinyChat be the thing that is replacing ChatGPT. So we're kind of here as the developer. We're writing some R code. We just saw how you can write a little bit of R code to connect Elmer to a chat provider and create a whole chat interface with it. And that's something that you can, like, deploy and give to your users, right?

So we're going to focus on this spot here, on the person writing the R code. And if you are a person who is writing R code, then you're writing R code on a server where you can also run R code. And if you can run R code, you can do things like write code that pulls in data. Or you could talk to a weather API and get up-to-date information from a weather API. Or you can write R code that sends emails. And you can do all of this in, like, the same exact place that you are currently, like, connecting to an LLM. So you're in charge, essentially, of writing these tools and then connecting them to the LLM. But the tools run in the same place that the LLM runs.

Okay. To, like, walk through this, we're going to try this question. We're going to basically see, like, how do tools work through asking an LLM, is today a good day to go to the pool? So I'm going to, let's see, hop back. I need to run this app really quickly. You're not seeing it, but it is starting up in the background. And as soon as it does, I will be able to do this.

And here we go. I'm going to make it really big. Okay. So I have a little Shiny app. And I have told the model that I can look up the weather for the model. And if it wants to know the weather, it should write this special message, get weather with the zip code. Right? Okay. So we're going to see what happens. Is today a good day to go to the pool? And I live in Woodstock, Georgia. And it writes back, I'm going to need the weather, please. It says get weather for this zip code, which happens to be the correct zip code for Woodstock, Georgia. So if I hop over to the National Weather Service and look for the forecast today, and I could just, like, copy this text here, go back to the chat, and give the model the answer. And it will say back, okay, looks like there's a 40% chance of showers after two. So maybe, yeah, maybe go earlier in the day if you can.