Julia Silge's Blog, page 7
December 2, 2020
Tune random forests for #TidyTuesday IKEA prices
This is the latest in my series of
screencasts demonstrating how to use the
tidymodels packages, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to get started quickly with tidymodels via
usemodels functions for code scaffolding and generation, using this week’s
#TidyTuesday dataset on IKEA furniture prices. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explo...
October 28, 2020
Tune and interpret decision trees for #TidyTuesday wind turbines
This is the latest in my series of
screencasts demonstrating how to use the
tidymodels packages, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to tune, fit, and predict from decision tree models, using this week’s
#TidyTuesday dataset on Canadian wind turbines. ��������
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore data
Our modeling goal is to predict the ...
October 13, 2020
Predicting class membership for the #TidyTuesday Datasaurus Dozen
This is the latest in my series of
screencasts demonstrating how to use the
tidymodels packages, from starting out with first modeling steps to tuning more complex models. Today’s screencast uses a smaller dataset but lets us try out some important skills in modeling, using this week’s
#TidyTuesday dataset on the
Datasaurus Dozen.
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore data
The
Datasaurus Dozen dataset is a coll...
October 6, 2020
Modeling #TidyTuesday NCAA women's basketball tournament seeds
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to tune and choose hyperparameters using this week’s
#TidyTuesday dataset on NCAA women’s basketball tournaments. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
Our modeling goal is to estimate the relationsh...
September 22, 2020
Handle class imbalance in #TidyTuesday climbing expedition data with tidymodels
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through a detailed model analysis from beginning to end, with important feature engineering steps and several model types, using this week’s
#TidyTuesday dataset on Himalayan climbing expeditions. ���
Here is the code I used in the video, for those who prefer reading instead of or in addition to v...
September 16, 2020
Introducing our new book, Tidy Modeling with R
Today Max Kuhn and I are pleased to announce our new book,
Tidy Modeling with R!
This book focuses on how to use
tidymodels and has two main goals:
Readers will learn how to use R and tidymodels packages to create robust and reliable models. This is a practical book, full of code examples with real datasets.
The book encourages good methodology and statistical practice. The design of tidymodels packages (from the software itself to documentation, training materials, and this book) is ori...
September 1, 2020
Train and analyze many models for #TidyTuesday crop yields
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from just getting started to tuning more complex models. Today’s screencast explores how to fluently apply tidy data principles to the task of building many models using with this week’s
#TidyTuesday dataset on crop yields. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
Our modeling goal is to estimate how
crops yield...
August 10, 2020
Build a #TidyTuesday predictive text model for The Last Airbender
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast admittedly does not result in the best performing model you’ll ever see, but it is really fun and uses this week’s
#TidyTuesday dataset on Avatar: The Last Airbender. ���� ���� ���� ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
This wee...
July 27, 2020
Get started with tidymodels and #TidyTuesday Palmer penguins
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast is good for folks just getting started with tidymodels, using this week’s
#TidyTuesday dataset on penguins. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
This week’s #TidyTuesday dataset is from
palmerpenguins, observations of Antarcti...
July 23, 2020
Supervised Machine Learning for Text Analysis in R
Today,
Emil Hvitfeldt and I led a
useR! 2020 online tutorial on predictive modeling with text using tidy data principles. This tutorial was hosted by
R-Ladies en Argentina; huge thanks to the organizers for their leadership and effort in making this tutorial possible.
Materials for this tutorial are
available on GitHub, with two main resources in the repo:
Slides, which you can
see rendered here and the
source for here
An
R Markdown file to work through
If you start working through the...