Kelly O'Briant | Build Your Ideal Showcase of Data Products | RStudio Connect 1.9.0

Transcript#

This transcript was generated automatically and may contain errors.

To kick things off here, at the very end of last month, we released RStudio Connect 1.9 with new features for content curation to build your ideal showcase of data products. And so today, Kelly is going to go through a demo of the new functionality. And we'll be here to answer any and all questions that you have.

So again, please feel free to use the chat window to ask questions. Or I'll put the Slido link in there shortly if there's anything you want to ask anonymously as well. And with that, I will turn it over to you, Kelly.

Awesome. Thanks, Rachel. I'm going to share my screen here. All right. Hopefully that's showing up OK for you.

I can see it. It's not in the presentation mode yet, but I can see the slide. I'm going to be hopping through the browser and in and out of my RStudio IDE. So I'm just going to keep it like this. It doesn't bother everybody. No, that's good. I'm going to present like this. It gives a casual feel. So feel free to ask as many questions as you want.

I also want to give a call out to Katie Maciela, who's on the call as well, who's one of our CS reps. She took some slides that I built and built some better slides. And then I took her slides and turned them into these slides. So Katie, thank you for your slides. I will just keep passing them back and forth and improving upon them forever. And it will be great.

So welcome. I'm here today to talk about RStudio Connect and the release we did last month, which was 1.9.0. It introduces some new functionality and a new R package called Connect Widgets. That is the big headline for this release and R package, actually, not anything having to do with Connect code at all. There's a little bit of functionality that went along with that. I'll talk about some quality of life improvements, we'll call them, to go along with this package as part of the 1.9.0 release.

But largely, this is about an external R package we produced to add some functionality that I think was really missing in the product itself. This is the Connect Widgets official HEX logo, but we haven't officially made stickers of it yet. But Edgar Lees put together this HEX logo himself for us. And so I wanted to give him a special call on his call.

But at the end of the day, by using the R package ecosystem, Python libraries, by writing things out in code you're making those patterns reproducible for yourself at least in theory that's more reproducible for yourself at least.

So when we designed a solution to make content discovery a little bit easier to make these app stores or content galleries a reality in Connect, we wanted to make that solution as self-service as possible and rely on existing familiar Connect content types rather than produce something that would magically pull together your content items for you and make the end solution more rigid and less reproducible that we wanted. So we wanted it to be code-based. We wanted it to, again, like I said, to be built upon familiar known Connect content types like R Markdown and Shiny. We wanted the end result to be still polished, have presentable defaults out of the box. So not need or require you to do a lot of heavy-handed CSS or work on the design front to get something that looks really great.

We didn't want you to need to read the entire Connect server API documentation book in order to figure this out. We wanted to do some light hand-holding when it comes to working with our APIs and we wanted it to be as customizable as possible and really easy to put your own branding on. So we produced an R package called Connect Widgets that provides HTML widgets or helper functions to organize and curate your RStudio Connect content within an R Markdown document or Shiny application. That's a lot of words, so I'm gonna demo it just to show you sort of what all of this means.

Connect Widgets demo

Let me flip over here to RStudio. Okay, so the Connect Widgets package is available on CRAN. You can install that package's Connect Widgets. And it's already loaded, yay. So I'm not gonna update it, but it does give me great joy just to type out and install that package's Connect Widgets because it was a long time getting it onto CRAN and by a strange manager, Jill is sitting there smiling about. So we're really happy to have it on CRAN and it's awesome to be able to just type that out.

So once you have the package installed and loaded, five minutes, Connect Widgets, you can come over to open a new R Markdown document. That's the easiest way to get started with Connect Widgets even though you can use these components in R Markdown or Shiny applications because if you're using R Markdown, you can start from a template page. So we have a Connect Widgets template available along with the package template. And it will open up into a template right here that you can just knit right away and get started with.

And so the first thing you do is pull out the title and then this will take you through some intro code chunks to get started with Connect Widgets. And you'll note that it does also add Decliner because one of the things that you can do with Connect Widgets is pull all of the content that you have access to on your existing Connect server down locally and then use basic Decliner that you're used to using to filter that down into the subset of content items that you actually want to showcase on this page.

So the setup is to connect to the Connect server that you'd like access to. And the way to do that that we recommend is setting some environment variables. So behind the scenes, I've already set up a .environ file to set my Connect server environment variable and the Connect API key environment variable. So both of those are already set. They're set to that sales server I was demoing before. So once I connect, it's going to try to pull down all the content items and then create a variable called sample content, which will just slice 50 of those content items out for me.

All right, so the first thing we get is not actually part of the package itself. It's sort of a joke that my friend David put into the template. As you can see here, a random unsplash forest image is calling random forest. So we can yell at him about that joke.

But each of the components that make up Connect widgets are expressed here on this template page that you can pull out and push different content items onto. So we're working with this sample content group and the first component type is called a card. So this is just a random piece of content that we've pulled off of the sales demo server that Nick produced on August 17th. He didn't produce, he didn't add any metadata like a description to this content. He just gave it a basic title and then that was it. It's an API type. We can tell that from the default image. And then if we wanted to visit this piece of content, we could click through the open content button. So that's a card that represents singular content items.

You can put multiple cards on a page. You don't need to use the card content type. You can pull it out and you can use something like a grid. So if you want to represent more content in less space, you could choose a grid instead. So this represents a larger number of content. There's pagination. All of these thumbnail images are the default ones that come with the package. If you haven't set thumbnail images for your content, they will appear as these defaults in gray in each of the grid and card types.

Now, this is a special case. I'm not able to pull down the thumbnail images to my local development environment off of the sales server. Most of our sales server content does have thumbnail images associated with it. So you'll see this difference between development and pushing something up to your actual Connect server. If I were to publish this content item back to our sales server, and I'll have that running in the background as I keep talking about the different component types here, but we'll see what it looks like to differentiate between local development. I don't see any of these thumbnail images and pushing it to production where the thumbnail images come into play.

So that's the grid and card. The last content component is a table. So this is a set table of metadata. You get a name, a type, an owner, and last updated, and you get the mini content thumbnail images alongside. Again, it's paginated. And finally, there are components for searching and filtering. So this shows against basically the same table type. If you were to add searching and filtering to this, so I can search for content that Edgar has produced. And the type, let's see, let's look at all, oh, no, we don't have any of, let's see, all of Edgar's pins. It looks like he's produced a lot of pins. And you can also filter by tags. You can look at all Edgar's access to care pins. Searching, similarly, if I were to look at, let's see, maybe that's his model. I just get this content that is named model. So the search just searches the main field here. You can reset, go back. And the searching and filtering component can be applied to the table or to the grid.

Theming and customization

So that's all there is for the basics of Connect Widget and its components. Definitely want to talk about theming, but let's see if I can get this first iteration of it published Connect first before we make some changes here.

Kelly, I can ask a few questions that came in on the Slido as well while it's publishing. One of the upvoted ones is, do we actually push the output of the R Markdown knitted or the R Markdown doc itself? So for example, do I have to set things up to re-knit regularly to keep the list up to date?

Yep, so if you wanted the sample content to regenerate, or if you had this content coming off of a tag, then you would want to publish with the source code, and then you would want to schedule this to re-update. So I could run it, you know, I could run it every 30 minutes to re-update if I wanted to. I don't recommend running every minute. That seems heavy handed. If you're down to that level, then you'd want to probably put these component types into a Shiny application and make them interactive. And that way, at least everyone coming to a fresh session would see the content of it that they should at the beginning of their session.

Okay, so now you can see what this looks like when I publish it. So this is another interesting piece of, I don't know, a piece of the puzzle here is that some of these content items I've pulled down, I don't actually have access to. And that's because on this server, I am an administrator. So I can use Connect Widgets to pull down actually the full set of content items that exist on the server. If you're a publisher, you only get to pull down the content items that you own and that you're a collaborator on. So that's something to keep in mind as you use Connect Widgets.

As an administrator, I need to be sort of more aware of this and more careful because I can produce a content item like this one where even I don't have access to view the thumbnails for each of these content types. So for this one, this is something that I know I own and it has a nice thumbnail image. I can visit it. For this one, let's see. I'm sure that Nick won't mind, but I wanted to show you, this is a feature that's been added in the last six months to Connect where we've put in a request for access workflow.

So as actually an administrator, I don't have the ability to send access requests. I can just add myself to the content, but that action will be audited. So I can add myself as a viewer or a collaborator to the piece of content. If I were logged in here as a publisher, I would see these same options. I could request access as a collaborator or as a viewer, but it would send Nick an email letting him know that I had requested and asking him to give me those permissions. Similarly, for a viewer role, you only have the ability to request view access since you don't have publisher rights and it doesn't make sense for you to be a collaborator.

But now you can see that we still have a couple content items that have the default image. We have a couple of content items that have been updated to their existing thumbnails. And this is a table that looks the same. So the table and the search and filter table, they look the same between development and their production state on Connect. But you'll see those differences for all of the content types, card and grid, that allow you to associate a thumbnail image on something.

So that is Connect widgets. I did wanna show the theming elements too, since this document that I produced is pretty boring at this point. Let's quickly add a theme here. So I'm warning you, I'm not a great designer, but let's see, we can use the Bootswatch themes, which are really nice. And then often what I'll do in addition to this is add a background and foreground color as well, so that some of the colors can like fill out the background and foreground. I find that just adding the Bootswatch alone doesn't always like satisfy my need for enough green on the screen.

And that's what you'll see I've done for some of the images that I mocked up for the blog post and for the logo that Rachel used to put together. But this will show you just what you can do, how these things translate into different component items when you just add a Bootswatch theme. Beyond that, you can really go to town. If you want, you can add as much CSS as you want to this, but this just shows you what Minty will do. So it changes the font, it changes the color of the buttons, it changes the color of the highlight text a little bit, and the pagination. Again, all the Bootswatch themes will take you in different directions, but translate into component items themselves. So you can turn up the knobs on that, like I said, by adding background and foreground colors to add more different accent colors to your content.

Filtering content and building a curated showcase

So that is the theme. And the other thing that I wanted to show was how to take the sample content and turn it into something that might be actually useful to you. I'm assuming that most folks on the call, if you are at all interested in this feature set, are not going to want to pull down a random sample of content off of your server and then put that onto a page. So if you're interested in interacting with Connect Widgets to pull down content that's meaningful to you, there are two helper functions you're provided that you should probably know about. One is by tag.

So by tag, let's see. I have one picked out for Python. Is it just called Python? Let's look. So this is all the Python content we have. Yeah, so it's just called Python. So let's see what happens when I pull down everything with the Python tag. All right, so now sample content has 47. We've got 47 different pieces of Python content here. And that's a lot, so I'm gonna filter that down even further by, what's my name on the server? I think it's just Kelly. Kelly. All right, so now I'm down to nine. I've only contributed nine pieces of Python content to the sales. That's okay. We'll use that.

So now this will take the same sample content variable, but now it contains Python content that I've produced, and it'll put that into the card grading table. So let's make that, let's see, what was it? Okay, so here's all my Python content. That's great. I don't need the table twice. I probably don't need the table at all. So I'll just keep the grid and the card, and then change this to Kelly, Python.

And in the meantime, I'll talk through the rest of my slides. So it gives you a bit of a sense for what Connect Widgets is capable of, I hope. It gives you a sense for what you can build with Connect Widgets. In the backend, remember that you need to provide an API key and server adjusts environment variables because it is using the RStudio Connect server API to do those calls out to your server to pull the content items down and give you that information about the metadata associated with each of them. And then you can apply different groups of content to different HTML components and place them on a page or an application.

And I'm hoping that this is useful to folks, not only for the case that I described at the top of the deck that talks about like a project that just blows up in complexity, but also as potentially a content hub or knowledge repository for a whole group of sort of related content items that belong to a single group or an objective or an OKR or a team as a single entry point for stakeholders. So if you wanted to create a really polished showcase for a stakeholder group that just shows them the content items that you want them to see and doesn't point them to the overwhelming leadership information that is the Connect dashboard, you could use that for this, or just as a general purpose presentation layer for any curated list of content items.

So if you wanted to create a really polished showcase for a stakeholder group that just shows them the content items that you want them to see and doesn't point them to the overwhelming leadership information that is the Connect dashboard, you could use that for this, or just as a general purpose presentation layer for any curated list of content items.

This is another interpretation of something you can do with Connect, which is for a more complex project. I'm sure some of you have seen this demo to you by our solutions engineers or sales team, but this is an end-to-end data science workflow for visualizing and predicting pipe share data.

Oh, oh, the thing that I published is now ready. Here's my Python demo content. Oh, it looks great, very cool, very cute. Okay, sorry about that brief interruption.

So the actual demo assets that build up this set of analyses for the pipe share data is built upon a number of our Markdown documents that do data import, cleaning, and model training, and produce output in the form of several PIMS data sets that get updated on a regular basis. Those then get pulled into API prediction services built on Parmer and an application for doing an assessment of model quality over time. And then the final output is two different prod and development client apps that can be rotated between each other.

So all of these components make sort of this ecosystem that you can move on to this complex data science project. And you can use Connect Widgets like we've done here to sort out all of these different content items and make sure that everybody who's collaborating on them knows where to find them, knows what all the pieces are, knows that when something breaks, you can refer to this diagram and see what else downstream might be affected. And use this as not only a landing spot for everything that's been deployed to Connect for this particular project, but also link back to the GitHub repository itself where we manage all the code that goes into producing all of this content. So everything is linked together. Everything can be shared as a singular showcase. And we're hopefully reducing the amount of complexity that there is to track over time.

Supporting features: request for access and Git-backed publishing

As I mentioned, there are a couple of supporting features that I demoed that went into some early releases of Connect that have built into this 1.9.0 release. One was the request for access permissions that I showed. Another is more streamlined publishing. So even though you still have to set the Connect server and Connect API key on the local side, local development, these will be, by default at least, automatically enabled for content that you publish to Connect. That's why you'd see me be able to publish directly these things that I've been building locally and they are just working on Connect. I don't have to go in and set my API key or server address again, once that content item is published. So it doesn't publish in a broken state anymore. That was very nice.

If you are a particularly security-conscious customer, your server admin does have the ability to turn this particular feature off, as well as the request for access permissions. So those are always options for you. If these are offensive to you in some way, the product will continue to work as it did before.

I see there was a question where someone asked, is there a way to pick up changes from Git when you make new commits to some of the included content or how does updating one piece of content work this way?

Yeah, absolutely. So if I go into Connect, I could, let's see. Well, yeah, let's change this. I think we have time. Let's do this quickly. So I'm going to create a new GitHub repository in my own personal GitHub for this content. Hopefully you have figured that out internally.

So in this step, I won't show where I'm creating a repository in my GitHub. You'll do that for your own organizational repository. Okay, so now I have a new repository. The last piece that I need to do is to generate a manifest for this content item. So I will do, let's see, rsConnect .

Right, manifest, and now I'm not in the right content. Let's see, set as working directory. Now I'm at the manifest, so that it goes into the template example directory. And then once that's written, it takes a couple of seconds, but it's trying to capture all the information about the packages that I used and the version of R that I'm running and the content itself, including this banner image. And it's writing up a manifest of all of that information for me so that I can commit it to a GitHub repository and then I can link that Git repository to a content item on Connect. I will have to produce a new content item. I can't link the existing one that I already push-button published to Git. I have to start over from a fresh content item. But since I've only published once so far, it shouldn't be an issue for me.

Is there an upcoming feature so that publishers can create tags? That's a great question. I'm so glad you asked. We don't have that slated for an upcoming release, but it is something that we have heard a lot. So I am quite aware that it's a hole that's necessary to complete this publisher enable

Kelly O'Briant | Build Your Ideal Showcase of Data Products | RStudio Connect 1.9.0

Transcript#

Publisher actions and content management on Connect

Demo: managing a single content item

The content discovery problem

Connect Widgets demo

Theming and customization

Filtering content and building a curated showcase

Supporting features: request for access and Git-backed publishing

Featured software#

rstudio