Daniel Falbel | What's new in TensorFlow for R | RStudio (2020)

Transcript#

This transcript was generated automatically and may contain errors.

Hello, I'm Daniel, and today I'm going to talk about what's new in TensorFlow for R.

And instead of predicting a single value for each observation, you can, for example, predict a normal distribution for each observation. So, you can calculate standard deviation and stuff. So, this opens a great scope for deep learning.

There is AutoKeras, a package by Juan Cruz. And it interfaces R to the AutoKeras package in Python, which uses AutoML techniques to build machine learning models. So, instead of defining all your Keras layers and how they are connected, you can just use this model image classifier and how many different models you want to try. And it will try to get a good model for your dataset.

There is TFDS, a new package, which is very experimental yet, but it allows you to load public datasets in the TensorFlow datasets format. Which, for example, you can load ImageNet using TFDS without touching all the, I don't know, 100 gigabytes of images and how to download them and how to preprocess and everything. So, it's much easier to just test your machine learning model. And it provides this split API that you can say, like, I want to split my data in training, validation, and test that directly. So, it's a nice package when you are learning new things, when you are trying your deep learning model.

Model packages and community contributions

We are also providing some model packages, which, like, we took some commonly used deep learning models and packaged R wrappers for them. For example, the GP2 package by Javier Luraski. And the GP2 model is a deep learning model by OpenAI, which takes, like, a prompt string. And it can, like, complete this string with a text that really makes sense, and it's pretty incredible.

We also implemented some deep learning models using raw Keras layers, just for... So, it's easier to learn, and you can see advanced modeling code in Keras. So, there's UNet, which is an image segmentation model. And DenseNet, which is a convolutional neural network architecture that was famous, like, maybe two years ago. And deep learning is very fast. And there are also community-contributed models, like the RBERT by Jonathan and John Harmon, which provides an implementation of the BERT model, which is a Google model for text embedding.

We also have the TensorFlow for Art blog, which Sigrid writes many, many articles showing the state-of-the-art of TensorFlow and deep learning with very detailed explanations.

Q&A

So, one of the questions on Slido is, classification and regression problems are very accessible via R's interface to TensorFlow. How about survival problems, such as time-to-event outcomes?

Usually, deep learning is very flexible, so these time-to-event models can be modeled by just changing your loss function. So, you can cover this with deep learning, with the Keras package. There's a blog post by Sigrid. Actually, that would not be deep learning, but that would use TensorFlow probability or the RREP IDF probability, and that would be Monte Carlo modeling to do that. We have a blog post. I think it's called, like, oh, how is it called? Anyone know? Censored data something. Yeah, censored data, right? You wrote this blog post. Yeah, and this uses TensorFlow probability, so not deep learning, but also cool stuff.

What are our studio's plans to support Torch or PyTorch? I don't know. We have worked. I have experimented with Torch, like, a year ago with the C++ API, but I'm not sure if we are going to move forward Torch for now.

Well, someone wants to know, what are you working on right now? Yeah. TensorFlow is a large ecosystem. Like, there's always a lot of stuff that we want to work on. Like, there's this dopamine library for reinforcement learning, which is something we would like to work, and there's also a lot of work with DF probability still, so, yeah.

All right, one more. Is the plan to track the Python interface to TensorFlow, or will the R extras mean that Python users won't recognize R-based TensorFlow code? Yeah. Like, Keras, our package, tries to keep track of all Python changes. For example, these new text preprocessing layers, and there will be image preprocessing layers and all this kind of stuff. We will try to always keep track of the Keras API. But we like to add, like, the R way of doing things also, like the feature spec interface, which doesn't exist in the Python implementation.

Daniel Falbel | What's new in TensorFlow for R | RStudio (2020)

Transcript#

What is TensorFlow?

TensorFlow 2.0 and eager execution

Feature spec interface in tfdatasets

tf.hub and pre-trained models

New Keras text preprocessing layers

tf.probability, AutoKeras, and TFDS

Model packages and community contributions

Q&A

Featured software#

rstudio

tensorflow

tfdatasets