ChalkTalk: Globalizing Data Science Education with AI-generated Videos (Kene David Nwosu)

Transcript#

This transcript was generated automatically and may contain errors.

All right, hi everyone. I'm Kene. I'm going to be talking about what's up there. It's a bit of a mouthful. ChalkTalk. Automating Video Tutorials with Large Language Models and Text-to-Speech. And subtitle, what I've learned about the art of vibe coding. And I'm the Curriculum Director at The Graph Courses. We're based between Geneva, London, and kind of global.

Okay, I'll start with this picture. Which will probably be familiar to many of you as you were traveling over to PositConf. You're at the security checkpoint at the airport and need to remove your laptop and other large electronics from your bag. It's a bit of a minor inconvenience. For me, it's a slightly more major inconvenience.

Because here are some things I need to remove usually from my bag. Here's my big laptop that has a big graphics processing unit. My extra portable display. Usually I have a spare laptop in case anything goes wrong with my main laptop. And I have a camera and a microphone as well. And sometimes big headphones. And after taking all of that out, my bag looks like this. With all of the charging cables. So I still get the extra search at those checkpoints.

And why do I live like this? Well, because this is my job. Hello and welcome back. Welcome to this lesson on lines, scales, and labels with ggplot2 . You now know how to select your variables, how to filter your data entries. In this lesson, you will be learning how to pivot data. R is the programming language you're going to use to write code. These three components, data, aesthetics, and geometry. Superior or equal to 25. It's easy to infer the mean. Okay, I think you get the point.

About The Graph Courses

So I work for an organization called The Graph Courses. And that's our homepage there. We teach data and code skills for health and life sciences. So that's a lot of our Python stuff. And we have taught about 4,000 students in our free self-paced courses on our YouTube and our website. Have graduated about 500 students from these boot camps. We do 8 to 12 weeks. And we also do custom trainings for universities and other organizations.

And as part of doing this, we've made hundreds of videos now. And so this means that I have to carry all of that equipment. And I think it is kind of worth it because videos are valuable. This is something we know from research and also just from talking to our students. Here is, for example, a study looking at Gen Z and their learning preferences. And you can see YouTube has a 59% preference versus printed books having 47%. That's from an online poll by Harris and Pearson.

Or you can also look at the effect on learning in higher ed. Where adding videos to existing content gives you 0.88 additional standard deviation of improvement on a range of scores. And that's from meta-analysis of a bunch of RCTs. So videos are valuable. And we know this as well from our experiences with students. But videos are expensive. They're expensive in terms of time spent. In terms of the equipment you need to carry around. And in terms of tears as well.

So I have this mental anguish index of different things that cause me suffering. One is chasing students for late assignments. Another is my art console crashing in the middle of regression. That's a 10 index. And recording and debugging videos is a 50 on that scale. Because it's a perfect union of Murphy's Law, so everything will go wrong. And Hofstadter's Law, which is things take longer than you think they will. And I've had lots of painful sessions recording videos. And my colleagues will also share that experience.

The case for AI-generated videos

And so starting a few years ago when ChatGPT came around, we've been seeing headlines like this. About how AI may be coming for our jobs. And to this, I've been thinking, well, yes, please. Or more specifically, just this part of my job, this extra tedious painful part of video creation. Can AI make it easier? Or maybe take it away completely?

And in particular, there's been a confluence of trends over the last few years. That have contributed to me thinking this could be possible. One, like we know, is large language models. Now, these are not yet competent teachers. They can't yet write very good lessons. But they could maybe get you started on lesson writing. Get you started on video creation. Help you overcome that blank page problem. And then they could maybe create videos for existing lessons as well.

And then text-to-speech has gotten quite good. So it used to sound quite horrible. But these days, text-to-speech has improved a significant amount. Which I'll show you in a moment. And so the question is, could we plug these two things together? Large language models and text-to-speech models. In order to make a kind of automated video creator.

The problem, though, is that the folks on our team, including me, are mostly data people. Not really app developers. But the solution maybe is another trend. Which is the rise of something called vibe coding. Now, what is vibe coding? It's typified by some quotes like this. The hottest new programming language is English. And how I learned to stop worrying and trust the model. The idea here being that models are getting good enough at writing code. That you often don't actually need to read or understand the code to work with them.

And so maybe, not being app developers, we or I could try to start building out this tool that I've dreamed about. And an extra detail is that I knew I would have about maybe four weeks of free time to actually work on this specific task. In preparation for PositConf. So motivation questions for my talk, then, are... Can I vibe code this dream application, an AI video creator, in about four weeks? And maybe more importantly, if I can actually do this, is there still a point to making this app? Because is my job, is programming education still useful? Does it still make sense to be teaching students how to code if I can actually just do this without being a proper web developer?

Can I vibe code this dream application, an AI video creator, in about four weeks? And maybe more importantly, if I can actually do this, is there still a point to making this app? Because is my job, is programming education still useful?

And then last and maybe most importantly, vibe coding is a bad name. So it gives the impression that it's a very easy thing. It doesn't require any focus. You don't need to know any coding. But I say it still requires focus even to build fairly simple apps, and some one-on-one knowledge of web development will come in super handy.

And then for substantial apps, models need a lot of guidance. So my real history is that I've done a lot of shiny development and some web development massive open online courses as well, so I've learned a little bit of web stuff. And so you can tell from the kinds of prompts I'm using when I'm communicating with these models, lots of references to CSS and body tags and things of that sort. So again, like Klaus, I've been forced to learn CSS as part of my data science career. And there's lots of last-mile issues as well. The models are bad at finding docs, handling auth, and so on and so forth.