Riva Quiroga | Learning to program in R with a "communicative approach"

Transcript#

This transcript was generated automatically and may contain errors.

Hi, my name is Riva and I'm a linguist who uses R. Seven years ago I tried to learn R and I failed. I wasn't sure where to start my R journey, so as someone with a background in humanities I thought the best option was reading a book. I wanted to make one of those cool plots with a great background I've seen around to communicate some results. So I picked a book that promised me that I will learn the most important elements of R.

On page 99 I was still reading about defining class and methods, trying to understand what an array was and how to multiply its elements, but hadn't done anything with real data yet. The problem, I think, were my expectations. I was expecting learning resources that helped me say things with data, but all I found were description of rules that didn't make much sense to me as a beginner. It was very difficult for me to see how I can use these rules with the data I had, and how I will jump from these abstract descriptions to, for example, making a plot. It felt like trying to learn a second language only by reading a grammar book and not by interacting with actual human beings.

Programming languages and second language learning

The thing is that learning a programming language has many similarities with learning a second language. But what does this mean? How can this idea be applied to the design of learning resources? Or when choosing them to guide your learning path? And how can you take this into account when developing a package, for example?

First, let's talk about learning. Not everyone learns a second language with the same purposes. The same happened with programming languages. I failed learning R the first time because I wasn't the expected audience for the book I chose. At the moment, I just wanted to be an R user, not a programmer. Not everyone is learning R to be a linguist or a grammarian of the language. There are surely people that want a deep understanding of how the language works. And those are the people who are contributing to its development.

But there are tons of people that just want to use R to do things, like making a plot, or running a statistical model, or creating reports. I failed the first time I tried to learn R because the book I chose for learning was the equivalent of a grammar book of R. Descriptions of how the language works and the names and purposes of all its different parts. And as a beginner, or as someone who just wants to learn the language to communicate your data to others, that is not what is more useful. What you need is something that works more like a textbook.

The communicative approach

Textbooks for learning a second language always start the same. You learn how to say hello, how to introduce yourself, and some simple sentences to communicate meaning to others. Even books for learning dead languages have this approach. I had to learn a little bit of Latin as an undergrad. And the first thing we learned was how to ask someone who they are and how to respond to that question. I even learned how to insult someone in Latin, a language in which the last native speaker died centuries ago.

Textbooks start this way because they are built around people's needs and knowledge, not around language rules. In textbooks, you learn first what you can do with a language, not how the language works. In language teaching, this is called a communicative approach, and is based on the idea that learning a language successfully comes from having to communicate real meaning to real people.

In language teaching, this is called a communicative approach, and is based on the idea that learning a language successfully comes from having to communicate real meaning to real people.

Throughout the different lessons in a textbook, you learn new words and structures, and new contexts where to use those words and structures. The grammar rules behind those real-life examples are only explained after you learn how to use them, and if an old if is really necessary for you to know them. In textbooks, scaffolding works as a spiral. You learn something in one lesson, and then come back to that content in the next lesson, but now showing more ways to use what you learn, how to adapt it to different situations, and how to combine the words and structures you know in new ways.

It's like copy and pasting, and then adapting. You learn how to use a new language structure, like how to ask a question, and then you learn how to adapt it to your own needs, to solve the problems you are interested in. It's this ability to adapt language structures to new situations what makes us flexible users, in both natural languages and programming ones.

For example, we first learn how to create one type of plot using all the default settings, and then we come back to that content to add more complexity to the visualization, like mapping more variables, using different geoms, or making annotations. And we build those new skills over the ones we learned previously. This copy, pasting, and adapting is also the strategy we use with examples we find in Stack Overflow or RStudio community. We bring to our script something that worked in another context, and then try to figure out how to adapt it to our own needs.

Sometimes we are not even sure why it worked, but after doing it a couple of times, we start inferring what is going on behind the scenes. You don't need to know all the grammar rules to be understood in a language, and you don't need to know the name of all its structures to be a competent user. Even native speakers don't know the name of all the parts. You probably don't know what a paratactic or a hypotactic clause complex is, but you definitely know how to use one.

If you are a native English speaker, you know how to correctly pronounce every case of two O's together in words like moon, book, floor, flood, something very, very difficult for us not-native speakers. But you probably don't know why they are pronounced differently. The adults who raise you didn't write you a grammar book for you to acquire your native language. They didn't teach you the grammar rules. They talked to you and let you talk. They probably read you books. You sang or sang together. You did things with words progressively more complex.

In the same way, you don't need to know all the rules of a programming language to be able to do things with it. What rules do you need? The ones that can help you achieve the purposes you're seeking. If you need to customize a plot in a very precise way because the default options are not what you need, then learning how a theme works in ggplot is relevant, not before. If you need to work with arrays, then learning what they are and what kind of things you can do with them is relevant, not before.

Because the most dangerous thing that can happen to a natural or a programming language is not having a vibrant community of users to keep it alive and evolving.

Riva Quiroga | Learning to program in R with a "communicative approach" | RStudio

Transcript#

Programming languages and second language learning

The communicative approach

Learner personas and modern R resources

Package development: error messages

Package development: documentation and vignettes

Closing remarks

Featured software#

rstudio