Rich Iannone || New features in {gt} 0.6.0! || RStudio

00:00 Introduction 00:18 sub_missing() 03:51 Markdown formatting in sub_missing() 04:51 sub_zero() 07:34 sub_small_vals() 13:08 sub_large_vals() 16:25 final thoughts A new version of the R package {gt} has been released! We are now at version `0.6.0` and there are now even more features that'll make your display/summary tables look and work much, much better. Let's run through some of the bigger changes and see the benefits they can bring! New functions for substituting cell data We now have four new functions that allow you to make precise substitutions of cell values with perhaps something more meaningful. They all begin with `sub_` and that's short for substitution! sub_missing() (formerly known as fmt_missing()) Here's something that's both old and new. The sub_missing() function (for replacing NAs with... something) is new, but it's essentially replacing a function that is old (fmt_missing()). The missing_text replacement of "---" is actually an em dash (the longest of the dash family). This can be downgraded to an en dash with "--" or we can go further with "-", giving us a hyphen replacement. Or, you can use another piece of text. If you're using and loving fmt_missing(), it's okay! You'll probably receive a warning about it when you upgrade to {gt} 0.6.0 though. Best to just substitute fmt_missing() with sub_missing() anyway! sub_zero() The sub_zero() function allows for substituting zero values in the table body. sub_small_vals() Next up is the sub_small_vals() function. Ever have really, really small values and really just want to say they are small? With sub_small_vals() we can reformat smaller numbers using the default threshold of 0.01. Small and negative values can also be handled but they are handled specially by the sign parameter. Setting that to "-" will format only the small, negative values. You don't have to settle with the default threshold value or the default replacement pattern (in small_pattern). This can be changed and the "x" in small_pattern (which uses the threshold value) can even be omitted. sub_large_vals() Okay, there's one more substitution function to cover, and this one's for all the large values in your table: sub_large_vals(). With this you can substitute what you might consider as too large values in the table body. Large negative values can also be handled but they are handled specially by the sign parameter. Setting that to "-" will format only the large values that are negative. You don't have to settle with the default threshold value or the default replacement pattern (in large_pattern). This can be changed and the "x" in large_pattern (which uses the threshold value) can even be omitted. Final thoughts We are always trying to improve the gt package with a mix of big features (some examples: improving rendering, adding new families of functions) and numerous tiny features (like improving existing functions, clarifying documentation, etc.). It's hoped that the things delivered in gt 0.6.0 lead to improvements in how you create and present summary tables in R. If there are features you *really* want, always feel free to: File an issue: https://github.com/rstudio/gt/issues) Talk about your ideas on the Discussions page: https://github.com/rstudio/gt/discussions Learn more about the gt package here: https://gt.rstudio.com/ Got questions? The RStudio Community site is a great place to get assistance: https://community.rstudio.com/ Content: Rich Iannone (@riannone) Motion Design & editing: Jesse Mostipak Music: Nu Fornacis by Blue Dot Sessions https://app.sessions.blue/browse/track/98983

image: thumbnail.jpg

Transcript#

This transcript was generated automatically and may contain errors.

largely based missing values for something else. And the setup is, you know, we use that with the data data here just means like the GT table data. So you always start with GT. And then you use this function, and that's data and columns. Basically, you can focus this function over any columns you want. But by default, it's everything. So it'll just go through every single call. And no big deal. If like, there's no missing values in some of those columns, it'll just, you know, skip over those. So it's kind of nice, you can just like, almost like it's almost like paint by numbers, or I'm trying to find the right analogy, but I can't seem to do that. It'll just find things which apply. And if they don't apply, no big deal. And you can even run it if you don't have missing values at all. It'll still work. No change, but you can feel safe doing that in case you have some like, table inputs that may or may not have missing values. You don't know that. So that's kind of cool. They can do that.

So let me show you this. We have a built-in dataset in GT called Xybil. It's a little hard to fit. Although the idea is that it does fit. It's only eight rows. And a small table, it's just for like, messing around with tables in GT. So I'm gonna make this even smaller. Get rid of a few columns, get rid of row and group, and then put that into GT with this GT function. So if you know nothing about GT, this is how you sort of way we can use submissing. So in this case, we can use column names. But for this one, I'm just using like, indices. So 1 and 2 are num and char. And 4 to 7 are the rest of the columns here. And we're gonna replace in columns 1 and 2, we have any values. There's one here and one there. We're gonna replace NAs with the text missing. And just to be a bit different, in columns 4 to 7, we'll replace the NAs with nothing. Two ways of saying it's missing.

So you can totally do that. It's great. So I'm gonna run this. And immediately you'll see the effect. Missing, missing. And a whole lot of nothing here in these columns. So kind of cool.

By default, I'm just gonna change things here a bit. I'm gonna take out this missing text. It's not the default value. So if I do this and run it again, the default is an em dash. And it's hard to sort of like, write that to represent what an em dash is. Here the default in GT is three dashes. It magically takes that and converts it into an em dash. There's no way in Markdown to say I want an em dash. You just can't do it. This actually becomes like a horizontal rule, which is not what you want in a cell, probably. Maybe you do, but I don't think you do.

So with that said, you can even use 2 to get like a smaller dash. In this case, that's an em dash. And maybe you can see it, maybe you can't. It's a slightly smaller dash. And of course, you can go back to this, which is a hyphen. So there you go. So the dash becomes smaller and smaller with less of these hyphens used. So, yeah. So that's submissing. And this is actually nothing really new. We used to have this function. It used to be called front missing. It's just been renamed to submissing because I want to move things into a separate family and have a bunch of other functions which do the same sort of thing. You can still use the former, but it gives a warning, which you may not want to see each and every single time you want to make a table. So just migrate your missings over to submissing if you have some code that you're just going to run in the future.

Markdown formatting in sub_missing()

Let's find out. It does, yeah. You have to wrap it with md. And so, let's actually do this. Let's do just go on. Not there. Okay. Let's do bold just so we can really see it. So I'm going to run this. Yes. It does take markdown, which is great. Easy way to make, you don't have to style it after the fact. You can just use that. So let's do emo. So let's put inside this md thing. I'm pretty sure that is super important because that just becomes like HTML. It worked. It really worked. Okay. That's great. This is fun. I love like, you know, I didn't know for sure that would work. Will it work without md? Probably not, right? But I don't know. Let's give it a whirl. Let's see. Whoa. Oh, yeah, yeah, yeah. It just gives you a different flag. You don't need that md. Okay. That's cool. That's cool. That is very cool.

sub_zero()

So here's some really new stuff. Like it's new both in like, like totally new. It's like, it's a 0.6. Sub zero. Cool function. You can either think about like a blue ninja or a refrigerator. It's totally up to your imagination what that might invoke in your mind. But basically what it really is, is if you give it a zero value, like it's truly zero, not rounded to zero, but like really, really zero, then you can replace that zero with something. And it might be kind of fun to do that. Who knows? We'll just find out. Okay. So basically the same idea as submissing. Instead of missing text, it's zero text. And the default is nil. I don't know why I chose that. I think I saw it somewhere and I thought, that's kind of cool. You never see that. I'm going to put that in. Instead of just, you know, using this, that's even kind of like too boring for me because just replacing the thing you have with the same thing. Okay. For this, I'm going to actually make a new table using the table function from dplyr and the table package.

I can make a small table which has a bunch of values, but also some zeros. And these are like perfect zeros. They're not 0.0001 or whatever. They're just like zeros. Okay. So with this table, the zero values will be given replacement text. Okay. And in this case, this is kind of cool, right? Because you can like just use this without anything. And the default will just work. I mean, you may not like the default value, but you know, it's pretty easy to use. You have to admit. Okay. So let's try that. Let's run this and see if it works. It totally works. Okay. That's awesome. And the cool thing is like if you use format number on the same column, this will still work. Like it won't like cancel out of things like other formatters do. That's kind of like a different thing with these sub functions. You can run it within the same columns as the formatters. And it won't interfere. It won't do anything weird. It won't cancel out the formatting. It's kind of nice.

You can run it within the same columns as the formatters. And it won't interfere. It won't do anything weird. It won't cancel out the formatting. It's kind of nice.

Okay. So let's try with a different zero text. Again, markdown will totally work. Like zero should totally work. Yeah. That's great. I wonder if there's emoji for that. Let's give it a shot. if only to like check our knowledge of this package. Yeah. Yeah. Sub zero. Really simple. It's meant to be simple. Why would you use that? I don't know. I've seen it or like sometimes like rounding down to zero is different. Like it's been a true zero. It's almost like a truly missing value is different than like something that's not entered. Who knows? There's a difference there. It just hits different. These zeros, whether they're like really zeros or not.

Like, one case, especially, is like the, not the sub zero one, but the sub small values. You may have like values which are below a certain threshold, could be like measurement stuff, like below detection rate. You can have all sorts of like specialized words, depending on your area of focus, like your research subject, which denote that these values are small, but not zero.

Rich Iannone || New features in {gt} 0.6.0! || RStudio

Transcript#

Markdown formatting in sub_missing()

sub_zero()

sub_small_vals()

sub_large_vals()

Final thoughts

Featured software#

gt

rstudio