Magic with WebAssembly and webR - posit::conf(2023)

Transcript#

This transcript was generated automatically and may contain errors.

Hello, my name is George Stagg , and I'm the lead developer of webR.

So today, I want to talk about what I think is particularly magical about some of the things that webR can do. Before I start, I do want to say the first piece of magic is this wonderful logo, which I didn't design. I gave Dawley the hex logo for webR and the description of this talk, and this is what it came up with. And if that's not magic, I don't know what is. That is just incredible.

So as hopefully you picked up from Joe's talk, webR is a system that allows you to execute R code directly in a web browser without a supporting R server. But webR can do more than this. It's not just about adding text R code into a box and then getting text output out of a box. WebR lets you interact with the running R session in a very particular way, and it allows you to manipulate the R environment and reach in there and tweak things. It's this kind of stuff I want to talk about today. This is the kind of thing that I find particularly magical.

When I showed this for the first time in a meeting, the response I got from the others in the meeting was just mind-blown. So I thought, well, there's no chance I can't share something that good with posit.conf. I have to.

How WebAssembly makes webR possible

So the way this all works is WebAssembly. The technical fancy way of saying what it is is a portable binary format, and it's designed so that it can run anywhere. In particular, this is in browsers, but it can also run server-side. And what I mean by that, for those who are interested, you can run webR in places like the cloud and server-side and edge nodes and anywhere where there's a WebAssembly engine. In the future, WebR will be able to run better in those than currently, but certainly the support is there through WebAssembly for those kind of computing environments.

So WebR is a version of the R interpreter built for WebAssembly, and that's what allows you to run that R code, as Joe explained. So why WebR? Why would you be interested in WebR? In addition to all the shinylive stuff, which is going to be a great boon to education, there are already other tools that can do similar things. This is a quota extension by James Balamuta that can actually inject runnable code blocks directly into a quota output. This is going to be amazing for educational material or even package documentation.

Imagine going to a website, a Pakistan website, and being able to play with a package without having to install it. In certain situations where the network is locked down and that package may not even be available to install. We saw interactive presentations, and as Joe mentioned, there are people currently looking into running portable R applications and reproducibility by using shinylive. This works because WebAssembly provides very strong security features and reproducibility features that aren't provided by a normal CPU architecture.

Magic trick: mind reading

So today I'm going to show off some WebR magic. I'll explain why I think it's useful. I'll explain a little bit about how it works, depending on the time, and I will be using JavaScript. Probably by the end of this, there will be quite a few people who say, gosh, I'm glad I don't have to use JavaScript. But don't worry, I'm not going to assume any particular knowledge, and I don't want you to think too much about the syntax of the code. I want you to think more about what the code is doing to an R environment.

So let's start. I call this magic trick mind reading. So on the left-hand side of the screen there is an R session, and on the right-hand side of the screen is a linked JavaScript session. And right at the top, in the top left-hand corner, I've set a value in R equal to 1729, and then in JavaScript, I've run a piece of code that's going to reach into that R environment that's currently running and pull out the value of that variable.

And you can see straightaway that it's returned the word promise, which is not great, but it's how it works. JavaScript's way of dealing with asynchronous programming is through this promises paradigm, but luckily there's a keyword so that if you put the word await before what you want to do, JavaScript will wait for the result to come back, wait for that promise to resolve, and just give you what the result is.

What you get back is a WebR object that looks kind of strange, it's got this sort of proxy word on it, but what you can think of that as is basically like a black box that's linked to a certain R object. So what's happened here is that little API call for WebR has reached into the environment and grabbed that foo object and returned a reference to it. And from that, you can do work on it. There's a whole set of WebR APIs that allows you to run methods on these R objects, and in this case, the method I've used is a method to JS, and that's taking that object and converting it into a JavaScript object. So you can see the value of that object shown on the screen there is 1729.

And when you think about it, this is actually quite special, because if you were just working with strings, you'd have to take that number back from the standard output of a normal R program. Well, it would be even worse if you had something like a vector, okay? If you had something like a vector, you'd then have to take your string, you'd have to split it on commas, you'd have to get rid of all the white space, you'd have to parse all the values. It's a whole micro to get those values out of R if you're working with putting strings in and getting strings out. But here, you just get the numbers directly from WebAssembly's memory in a format that JavaScript understands and in a format that JavaScript can use.

But here, you just get the numbers directly from WebAssembly's memory in a format that JavaScript understands and in a format that JavaScript can use.

Magic trick: conjuring variables

The next example I'm going to call conjuring variables. So here, there's an example where in the R session, there's an object foo, but it doesn't exist. It's not there. The environment is empty. Here you can see that we run a piece of code that puts a vector of JavaScript values into R's memory, and then associates that with an object in that environment. So what I've done is I've created a new object and given it a name. And then, when you type the name in, the object appears. It appears out of nowhere, like I've pulled it out of a hat.

I think this is awesome, because there's only a few situations I can think of where that kind of thing can happen, where you can have an object that's not there, and then all of a sudden it's there. So I think that's great. But it's not the best trick. The best trick's coming up. This is my favorite thing.

You can also do a complex JavaScript object. So you can see here on the right-hand side of the screen, this is a nested JavaScript object, which is recursively, automatically converted into a nested R list.

Magic trick: invoking R functions from JavaScript

OK, here. I got excited for this. So imagine you've got an R function. Here I'm just creating some random normal numbers. It's scaled. It doesn't really matter what the function is. The point is that you can put some arguments in, and you get some numbers back. We're going to do the same trick we did before. We're going to get a reference to that function object. We're not running the function. We're grabbing a reference to that actual function that lives in R's memory. And you can see you've got another one of these strange proxy objects that you can work with.

But this is really cool. You can invoke it, just like a normal JavaScript function. Those arguments are automatically converted into R objects, and the result is automatically converted back into JavaScript. This to me is really magical, because you don't even have to know that that's an R function. As far as you're concerned, it's just a JavaScript function that returns a promise. That means you can use it with native JavaScript frameworks that are just assuming that you're going to give it a JavaScript function for something like a callback. It doesn't need to know how to work with WebR. You just give it a function, and it invokes it.

This to me is really magical, because you don't even have to know that that's an R function. As far as you're concerned, it's just a JavaScript function that returns a promise.

So these examples, they're relatively simple, right? But they do demonstrate something which I think could be a new and useful workflow when using WebR. The three examples are about moving data into the R environment, getting data out of the R environment, and, of course, running R functions.

Now moving data into the R environment, that could be something more complicated. You could be doing a database connection. You could be doing a user data upload of something like a CSV. You could be getting data from a REST API. These are things that the browser can do inside JavaScript. One-side WASM, but you can do it in JavaScript. Running R functions, I just ran a small function there that generated some random numbers. But imagine, this could be some kind of complex data manipulation using dplyr . It could be a really sophisticated modeling pipeline with something like tidy models. And getting that data back into JavaScript, that means you don't have to live in the R world if you don't want to. You could take that data from WebR, create a dashboard in JavaScript. You could offer it as a file download. You could even use interactive visualizations in completely different frameworks, such as D3 or Observable.

And this isn't even that complicated. That amount of code is enough to take WebR and integrate it with Observable.js. And this visualization here is a web-native observable plot, but the data has been taken and computed from WebR. And I think that is one of the best things about WebR, the fact that it can integrate so well into pre-existing frameworks that already exist for data visualization and data science on the web.

Even the fact that you can get to something like that is incredible to me, considering the security benefits that you're getting by sandboxing your code and making sure it can't destroy your file system, or the fact that you can get such reproducible guarantees about the order of operations of numerical and floating point issues, for example.

Thank you. Can we have one more hand for George?

Magic with WebAssembly and webR - posit::conf(2023)

Transcript#

How WebAssembly makes webR possible

Magic trick: mind reading

Magic trick: conjuring variables

Magic trick: invoking R functions from JavaScript

How it works under the hood

Service workers and shinylive

Q&A

Featured software#

shinylive