Lift Off! Building REST APIs that Fly (Joe Kirincic, RESTORE-Skills)

Transcript#

This transcript was generated automatically and may contain errors.

And I just want to caveat everything. This talk will be focused on R, but know that the strategies we're going to be talking about are abstract and could apply to any programming language, Python, etc. So what I'd like for you all to do is picture the scene. You've just launched your first machine learning model into production as a plumber API, and your team loves it. Word gets out over a few weeks, and suddenly other teams want to start consuming your API as well. Maybe they use it as part of a feature in their Shiny application. Maybe they use the predictions as a feature in another downstream predictive model. The point is that your new service, this wonderful service called Appy, has now created value across various segments of your organization, and you've done the equivalent of a data science grand slam, and all is good.

That is, until it isn't. You show up to work one day, and the Slack messages start pouring in. It's like, hey man, I was trying to prototype a new feature in Shiny, and I'm not getting any responses back from your API. What's with that? Or another team may message you and say, hey, we're getting cascading failures in our ETL pipeline because we can't get your predictions for our machine learning model or something like that. So we're starting to struggle at this point. And you and your manager, you go through the logs and you see that your app is still up, it's still running. The issue is that it's struggling to handle all of this new traffic that it's receiving.

So this brings us to an important question, which is, how can we make our plumber APIs more performant? Some of you may be in this talk, and you have never made a REST API before. Not even sure what REST is. And that's awesome. We're happy to have you here. A quick recap of what REST is, it's just a URL, but instead of returning a web page, it returns some junk that looks like this. Which is called JSON. And I want you to just take my word for it that this powers a lot of the modern web today.

Two strategies for API performance

API performance is a huge topic that spans all sorts of ideas. And we won't be able to cover all of them in today's talk. So instead, we're going to focus on two ideas, two strategies. We're going to talk about minimizing serialization costs and maximizing responsiveness with async programming. And we're going to cover these in particular because they will improve your REST API performance. They're relatively they require relatively minimal code changes, and there's great R packages for implementing them.

Minimizing serialization costs

So when I say that, what does that mean? So the idea here is that serialization costs are the time it takes for you to take an R object in memory and turn it into some format that you can send over the wire. Like JSON. Right? You can optimize the business logic of your API endpoints, but at the end of the day, there's going to be this serialization cost that you pay to turn your R object that's going to be the API response into JSON.

So there's an opportunity here to focus on to start to zero in on, like, how your objects are being serialized to see if there's an opportunity to improve performance. So to think about, like, why this matters, I want you all to consider a scenario. I have a party, and I'm going to invite 100 people to this party. 200 doesn't matter. And I have two options. I can either send the invites through a handwritten postcard or I can send them through e-mail. To send all of these invites through postcard, I have to go get the postcards. I have to sit there and handwrite 100 postcards. Then I have to put them in the mailbox and then wait for them to get to their intended recipient. Versus with e-mail, I can write my message essentially once and then send it to everybody on a distribution list and because of the Internet, it's essentially instantaneous.

We want our serialization cost to be closer to e-mail than to postcards. Because if we do that, we're going to be able to return our responses much faster. If we can return results faster, we're going to be able to handle more requests, things of that sort.

We want our serialization cost to be closer to e-mail than to postcards. Because if we do that, we're going to be able to return our responses much faster.

So sounds like a decent deal, good idea. How can we do that? So I have here a simple example of a plumber API that uses a, that uses what's called a serializer function to swap out the defaults and then get a performance boost. So how we go about doing this, the first step is find a package that serializes JSON faster. Out of the box, plumber uses something called JSON Lite, which is a robust battle-tested package. It's great. But in this example here, I'm using something called YYJSONR, which uses a very fast C library under the hood to read and write JSON in record times.

So once you've found your package of choice, then what you're going to do is you're going to write this little function, YYJSONRSerializer, and what that's going to be used is it's going to be used as a special function that you give to plumber so it knows how to turn your R objects into whatever you're serializing it to.

So with that serializer function, from there, it's very simple. You can go ahead and just use this serializer tag to say that you want to use your new serializer, and then your endpoint gets an immediate performance boost. There's no need to change the underlying business logic of your API endpoint. You just get the lift for essentially free.

Now, on this tiny app, you're thinking, like, this isn't a huge deal. But if you can imagine an API with 100 endpoints, then it starts to become, like, great from a maintenance standpoint as well, because now I can just change the serializer on my 100 endpoints, and I get improvement across all of them. So we go back to our little web service, Appy, and we go ahead and we change out his serializer, and he gets a nice speed boost. He gets a sick set of roller skates. So now whereas he was trudging along before, now he's gliding through a little quicker.

A lot of API design or solid performance is about solid design, not programming languages.

Q&A

So what was the learning curve to mirai for someone who hasn't used async programming before? And is future simpler for a newbie to learn the concept of async? I do think that there is a learning curve with async. You do have to kind of wrestle with it for a little bit. I think that you can go either way between mirai and future. Both of them have just great APIs for, like, working with them. They're very user-friendly. So you can't go wrong with either one.

Okay. And is there a lot of overhead cost in using mirai, startup, et cetera? Yeah. So that's a great question. Because the overhead, right, can come from the idea where now that we have these other processes, you have to send the data, right, your requests over to these processes. And there's some overhead with funneling data between processes on the server. In my experience, like, running some local testing before the conference here, I actually noticed that with the more recent versions of mirai, like, the overhead cost is, like, next to nothing, which is really cool. So I wouldn't worry too much about it.

Amazing. We have one more question here. So does mirai run on a single thread? If so, does this affect the user experience and UI responsiveness? So the important thing is that so with R, everything is always single-threaded. The idea is that instead of doing multi-threading, we're doing what's called multi-process, which is where so, like, if you spin up this API with several workers and you open, you know, like, activity monitor or Windows task scheduler or, like, whatever the monitor is, you'll see that along with, like, your API process, there will be, like, other R sessions that are running in the background. So it really shouldn't impact the user experience in the way that I think, like, multi-threaded code could, but...

Great. And I think we have one more question here. Why are faster serializers not the default for plumber? I feel like I might as well just kick that over to Thomas at this point. Yeah, he's better got ass than me. The only thing that I'll say is that, you know, choosing default dependencies for packages is a really complex thing, right? Because certain packages may be really fast, but they may only have, like, one maintainer that doesn't really work with the project anymore. So you got to choose your dependencies wisely there. Great. Thank you so much.

Lift Off! Building REST APIs that Fly (Joe Kirincic, RESTORE-Skills) | posit::conf(2025)

Transcript#

Two strategies for API performance

Minimizing serialization costs

Maximizing responsiveness with async programming

Implementing async with mirai

Recap and takeaways

Q&A

Featured software#

mirai

plumber

Shiny