Mine Çetinkaya-Rundel | Feedback at scale | RStudio

Transcript#

This transcript was generated automatically and may contain errors.

Hello, I'm Mina Çetinkaya-Rundel. I'm a data scientist and professional educator at RStudio , as well as faculty at the University of Edinburgh and Duke University. If you're also an educator teaching data science, you might find yourself in the following scenario, particularly this year.

Your department chair says, your data science course is going spectacularly. We'd love to expand it, and you say a big, yay. But then they say, oh, and you're going to need to do it all online, and you say, yay? And they follow up with, and we can't provide any additional support, you get the same number of TAs as before. And you start thinking to yourself, am I supposed to say yay?

As enrolments in statistics and data science courses grow, and as these courses become more and more computational, educators are faced with an interesting challenge, providing timely and meaningful feedback, particularly with online delivery of courses. I'm sure we all agree that feedback should be meaningful, but traditionally, meaningful, helpful, constructive feedback requires human effort and can take a significant amount of time for large courses, especially if they're under-resourced, which, let's face it, they tend to be. Timeliness of feedback is just as important as meaningfulness. The longer time passes between when a student turns in an assignment and when they get feedback on it, the lower the utility of that feedback.

For certain assignments, like an open-ended project, this trade-off is absolutely reasonable, because it's really difficult, if not impossible, to replace human feedback with something else in such an assignment. But for others, there are alternatives.

Introducing learnr tutorials

One such option is a learnr tutorial, and if you've ever used learnr before, the tutorial probably looked a little bit different than this, and that is just to say that with a little bit of CSS and theming, you can make your learnr tutorials look like whatever you like. I like starting my learnr tutorials with a bit of narrative, usually a little bit longer than this about the data and the analysis we're going to work through and introduce the students to the packages, and I like using the progressive reveal option, so they have to deliberately click through the material. I also like providing some ready-to-run code at the beginning, so they don't need to start working on exercises right away, but they do need to start interacting with the document.

Here they're working on a dataset on Airbnb listings in Edinburgh, and the first thing we're asking them to do is to take a look at the variable names. It is not necessarily meaningful interactivity, but maybe it introduces them a little bit to the structure of a learnr tutorial and how to interact with it, and potentially also helps build a little bit of anticipation around what the result is going to look like when they hit run code, as opposed to providing them with a static document where the code output is already provided. But really where learnr tutorial shines are these coding exercises, where the students can try out some code, submit their answer, and you can provide custom feedback to them.

So the first exercise here says how many Airbnb listings are included in this dataset. Let's imagine the student decided to look at the number of columns as opposed to the number of rows. The feedback that they're going to get sounds just like the type of thing I would say to them if I was working through the exercises with them and trying to nudge them in the right direction. It says, did you calculate the number of columns instead of number of rows? So you can write these very precise feedback based on your anticipation about the types of mistakes students can make.

And obviously, it's not possible to anticipate all possible mistakes. So what if a student chooses to try out some code that I didn't anticipate they might try out? For that, I like providing some canned feedback that's still constructive and still nudges them in the right direction. It says something like, each observation is represented in one row. Can you remember which function we used to calculate the number of rows? And finally, let's get this question correct. I'm going to use the nrow function to do that. Let's run our code and submit our answer. You'll see that we've got the green banner, which is great, but it doesn't just say correct, well done. It says a little bit more. So I like using the space for the message for the correct answers to give them a little bit more information about what's to come next.

So I also like following on my coding exercises with some multiple choice exercises, something that allows the students to take the code output that they saw and put it in the context of the data set that they're working with, which is so important for teaching statistics and data science. So what does each row in the eddybnb data set represent? That's an individual Airbnb listing. And again, we can use the feedback to give them a little bit more information or something to think about.

If you would like to see the code for this learnr tutorial, both in terms of creating the exercises and also the theme and the look, you can test drive it on RStudio Cloud or you can view the code on GitHub. And I'll be providing links to all of this at the end of the talk. For now, I'd like to talk a little bit about writing effective exercises.

Writing effective exercises

So let's start with an exercise in mind. The three most expensive neighborhoods in terms of mean nightly price are Newtown, Old Town, and West End. Calculate the median number of reviews in these neighborhoods and arrange them in descending order. One option would be to provide no scaffolding whatsoever, so an entirely empty canvas for students to work with. And the thing about this is we've really not given them any direction in terms of how to get started. And instead, what we've given them is this giant button that says solution. And it's so tempting to simply go, all right, let me take a look at the solution, copy it to my clipboard, paste it, run my code, and submit my answer. And lovely job, I got it right. But did I learn anything from this experience? Not really.

This is not to say your students are all going to peek into the solution code. But I think one thing to keep in mind about providing no scaffolding whatsoever is that I think there are better venues for giving such exercises. So for assignments where the students already have to work in the RStudio IDE, either in an R Markdown document or in an R script, I think is a better venue for a more open-ended question where you provide no scaffolding whatsoever.

Here we've given them no scaffolding, and we've also done something else where we've basically are doing this strict code-checking business. So let's say that your student, with no scaffolding provided to them, actually decided to move the filter over here. They group by and summarize first, and then they filter. And if they run their code, they're going to get the exact same response, but their code is not marked as correct. Because with the grade this code function, what is happening under the hood is that the gradethis package is checking the code your student wrote and the solution code you provided and looks for a one-to-one match. So it says that I expected you to call summarize where you called filter, give it another try. Sure, you might say that no, this is exactly how I want my students to do things, but I think that's not really entirely fair, especially because they actually got the right answer. So the strict code-checking with no scaffolding whatsoever can lead to situations where students get the right answer but are not necessarily marked correct, if you will.

So the strict code-checking with no scaffolding whatsoever can lead to situations where students get the right answer but are not necessarily marked correct, if you will.

Another option would be to provide a little bit of scaffolding, but here we're still doing the strict check. So here we've given them a really nice structure for how to write their code, and obviously we've made the exercise a lot easier for them, but that might be okay if this is one of the first few times they're seeing these multi-line dplyr chains. So these learnr tutorials are a great place to provide additional opportunities for kind of drill-type exercises students can work through at the beginning when they are first introduced to a new topic. And you can also provide hints for them. So with these hints, let's take a look at that. I like writing them in a progressive way where they can scroll through and get a chance to potentially think about what their answer should look like at each stage. And sure, they could at the end see the solution if you want them to, but at least it makes them think along the way as well.

So let's go ahead and run this code and submit the answer. I think in this structure there is no worry that they would be putting their filter function later on because we had already provided the scaffolding. However, since we're doing a strict check, what if the student in the last line forgot to actually put things in descending order? So let's go ahead and run that code, and obviously this is not the correct answer, but the feedback that we get tends to be not very informative when we use the grade this code function here. So it's basically saying, I expected you to call desk for descending where you wrote median rating. And in fact, here on line five, that statement is correct. But does it actually sound like the type of thing you would say to your student if you are standing right behind them and trying to nudge them in the right direction? Not really. It doesn't seem like very human-friendly feedback.

So while developing meaningful automated feedback can be quite tedious and time-consuming, I think that's a much more intellectually engaging way to spend your time as an instructor.

Thank you for watching! And remember that comment from your chair about no more TA support? With what learnr, gradethis, and LearnR Hash can offer, you might get closer to responding. Bring it on!