Workflow Demo Live Q&A - April 24th

Transcript#

This transcript was generated automatically and may contain errors.

Everybody, thanks so much for joining us over here. We'll get started in just a second here. All right. Hey, Julia. Thank you so much. Can you hear me now? I can. Yes. Great.

Okay, perfect. Well, we are just waiting for a few people to come over to the Q&A room. So let's give people two minutes or so here.

Just wanted to remind everybody, if you want to ask questions, you can type them into the YouTube chat here. But you can also use the Slido link, which is shown on the screen. There's a little poll in there that we would love to have you answer just to learn a little bit more about how you're deploying models today. Okay, we're up to 70 people in the Q&A room. So I think we can get started and jump in. Thank you so much, Julia, for the awesome demo and for joining us today.

And I did just want to remind people again, I know I said this in the beginning, but registration is open for PositConf 2024. And so I just want to remind everybody about that. If you enjoyed today's demo, Julia is actually leading a workshop at PositConf as well. And I was thinking, Julia, before we jump in, maybe you could tell us a little bit about the workshop at Conf and what that will cover.

Vetiver uses what we call an input data prototype. Basically, let me talk about the shape and the characteristics of the input data for the model. Let me record that as metadata, as part of the model, and then we will check it whenever we make a prediction and check that was coming in, like, complies with the input data prototype.

I know some people have to drop at the top of the hour. So I did just want to say here, thank you so much for joining us today. And thank you to anybody watching the recording later as well. As a reminder, we have these workflow demos the last Wednesday of every month. And so if you do want to join us live for the next one, you can add these to your calendar with the link I shared here in the chat. My colleague Isabella is going to lead next month's demo on Quarto dashboards as well. So I just want to give everybody a preview of what's to come next month.

Q&A: PyTorch models with Vetiver

One other question I see that came in in YouTube from Hamed is how easily can we deploy PyTorch models with Vetiver? My own experience in Python is that simpler models are pretty seamless, but I started hitting problems with PyTorch.

So PyTorch is one of the models that the types of models that the Python Vetiver package supports. And so we expect this to work quite well. So far, people that we know about using it have had good experiences with it. We certainly are interested in hearing on specific kinds of problems as you run into them. But I would say we expect it to work. Give it a go. And we would love to hear about any problems you run into.

Q&A: Free vs paid Posit products

Somebody had asked a question a bit earlier about the difference between the paid and unpaid products. So I bet a lot of you are users of the RStudio IDE. And that is a that is not only free, but also open source products. Like you can actually see the code that goes into making RStudio the IDE. There's a desktop app that you like download and use that is, like, free for all use cases, including commercial and all that kind of thing.

Now, there are paid professional products that we that we make that are a little more designed for the enterprise use case. So if we want to specifically talk about RStudio, like we have a product that's called Posit Workbench. And what Posit Workbench is is a product that provides you with multiple different choices of IDEs for your data science work. So you can choose between, like, JupyterLab and RStudio and different IDEs that people like to use for data science work. And so the reason why companies pay for RStudio Workbench is for the, like, enterprise type features that people need. Things like can we use this with single sign on and authentication? Can we use this with our certain security needs that we have around where people do their model development? So we see people choosing to for the pro products for a variety of reasons.

No, I think you've covered that really well. I would add that Posit Connect that you saw today is also one of the paid products. And so Julia showed today deploying a model to Posit Connect, but a lot of our other customers use Connect to deploy dash apps or Shiny applications, Flask APIs, a variety of different data products built in R and Python.

And I guess just to wrap it up, I guess, so the package manager has a there's a there's Posit public package manager, which provides these really convenient binaries that you can use, like when you make Docker containers. But Posit, but package manager also is a paid product that, for example, an enterprise would use internally. And so the use cases for that would be we have internal packages, we need to make really easy for everyone inside of our organization to use. Or we have compliance needs about people using specific versions of packages that have been vetted and put on a, you know, like on a on a allow list, like these are the packages people are allowed to use. And so if you hook up your, you know, your ID to the package manager, you can get those kind of guarantees that you need.

Q&A: Deploying to Posit Connect via Docker

One of the questions we missed from earlier was about Posit Connect specifically, and it was, is there a way to deploy to Posit Connect via Dockerization or containerization?

So it kind of works that way. Now. So when you when I showed you in the demo, that ever deploy RS Connect, and it went and it made a little bundle and put it like the that is a standalone environment that is separated from everything else. So a lot of the reasons you might choose containerization already work with Connect. Now, if you're saying I literally have a Docker container, and I want it to go to Connect, that is a feature request that I believe, you know, Connect is like the Connect team is talking about. So if you're asking, is there a way to do something like it where it you get the benefits of it, that's that is how Connect works already. If the question is, I literally want to take my Docker container, then not as of today, but it's something that I know the Connect team talks about.

Q&A: Logging model inputs and outputs

Sergio asked, is it possible to create a log file of a deployed model where we can find the input data, the output of the recipe and the prediction? This may be useful for the monitoring.

So the answer is, yes, it is possible. We don't do this by default right now. So you don't default get the logging. But since we are talking about FastAPI and Plumber, there are already tools out there that allow you to kind of like add a line of code, and then start producing those kinds of logs. And then say you deployed a model to Connect. We'll say an R model to Connect. It's Vetiver. It's Plumber. You add one of these packages that has like default plus extensible logging for Plumber APIs. And then it starts going into the logs of the Plumber API on Connect. And you see that there. So we didn't want to build these things from scratch because it's a little bit more about general software engineering practices. But we wanted, like with everything, with how we think about Vetiver, we want to make it very straightforward to integrate into these other kinds of tools, depending on what your needs are.

Q&A: Vetiver development lifecycle

Jeff asked, where is Vetiver in its development life cycle today?

Yeah, this is a great question. I think we may still have, we still are calling it experimental, which I think is a little bit maybe not accurate. Maybe we should move that the badge says experimental on it. And I think we should probably bump that up to something like stable. Because the core functionality is quite stable now.

When we have been looking at new features for Vetiver lately, we have been talking about Vetiver on platform. So like Vetiver on SageMaker. We've talked about, so that one exists today. You can use Vetiver on SageMaker. And then when we talk about like what else would we do for further development, the things that we talked about that don't exist as of today would be like Vetiver for MLflow. Like we mentioned that earlier. That's especially relevant to people who use Databricks. We've talked about Vetiver on the Google Cloud, the Vertex AI is what it's called. Like Google Cloud's kind of MLE system thing. Like how can we go there? So those are the things we talk about now in terms of development lifecycle. Where do people want to take their models?

So right now we have options like Connect, of course, very straightforward. Docker, which you can take anywhere. And then how can we make the process easier to take to different kinds of platforms? So today SageMaker works. Where else would we go with it? So I think I would say core functionality, core decisions that we made about what is Vetiver, quite stable. And our forward facing work is like where might we want to take this?

So I think I would say core functionality, core decisions that we made about what is Vetiver, quite stable. And our forward facing work is like where might we want to take this?

Thank you so much. Thank you all so much for joining us too. And I just wanted to add here, your feedback on these workflow demos has been so helpful. And so I would love to learn what other things you would like to see. What could we be doing to make these more helpful for you as well? So if you have maybe 20 seconds to do this for me after today's session, I would greatly appreciate your feedback. I just put a Google form into the chat here. This is really helpful for me as I figure out what do we prioritize here in terms of community events as well.

But thank you so much, Julia, for the awesome demo. I thought this was amazing. And I love that you had a blog post like ready to go right when it launched. Thank you again. Yeah, thank you for having me so much. I saw somebody's comment in the Slido. So I wanted to echo this as well. Julia, thank you for all that you do to educate us all in the content that you put out on your own YouTube too. In the workflow demo, I linked to Julia's own personal YouTube there. And I highly recommend that you go check it out. And again, if you want to join us for another one of these, we have them the last Wednesday of every month. And so Isabella is going to be leading one on Quarto dashboards next month. But have a great rest of the day, everybody. Thank you so much.

Workflow Demo Live Q&A - April 24th

Transcript#

PositConf workshop preview

Q&A: MLflow vs Vetiver

Q&A: Posit Connect and model metrics

Q&A: Vetiver API internals

Q&A: Storing predictions and best practices

Q&A: Model monitoring and data checkpoints

Q&A: PyTorch models with Vetiver

Q&A: Free vs paid Posit products

Q&A: Deploying to Posit Connect via Docker

Q&A: Logging model inputs and outputs

Q&A: Vetiver development lifecycle