Julia Silge's Blog, page 7

December 2, 2020

Tune random forests for #TidyTuesday IKEA prices

This is the latest in my series of
screencasts demonstrating how to use the
tidymodels packages, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to get started quickly with tidymodels via
usemodels functions for code scaffolding and generation, using this week’s
#TidyTuesday dataset on IKEA furniture prices. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explo...
 •  0 comments  •  flag
Share on Twitter
Published on December 02, 2020 16:00

October 28, 2020

Tune and interpret decision trees for #TidyTuesday wind turbines

This is the latest in my series of
screencasts demonstrating how to use the
tidymodels packages, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to tune, fit, and predict from decision tree models, using this week’s
#TidyTuesday dataset on Canadian wind turbines. ��������








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore data

Our modeling goal is to predict the ...

 •  0 comments  •  flag
Share on Twitter
Published on October 28, 2020 17:00

October 13, 2020

Predicting class membership for the #TidyTuesday Datasaurus Dozen

This is the latest in my series of
screencasts demonstrating how to use the
tidymodels packages, from starting out with first modeling steps to tuning more complex models. Today’s screencast uses a smaller dataset but lets us try out some important skills in modeling, using this week’s
#TidyTuesday dataset on the
Datasaurus Dozen.








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore data

The
Datasaurus Dozen dataset is a coll...

 •  0 comments  •  flag
Share on Twitter
Published on October 13, 2020 17:00

October 6, 2020

Modeling #TidyTuesday NCAA women's basketball tournament seeds

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through how to tune and choose hyperparameters using this week’s
#TidyTuesday dataset on NCAA women’s basketball tournaments. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

Our modeling goal is to estimate the relationsh...

 •  0 comments  •  flag
Share on Twitter
Published on October 06, 2020 17:00

September 22, 2020

Handle class imbalance in #TidyTuesday climbing expedition data with tidymodels

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from starting out with first modeling steps to tuning more complex models. Today’s screencast walks through a detailed model analysis from beginning to end, with important feature engineering steps and several model types, using this week’s
#TidyTuesday dataset on Himalayan climbing expeditions. ���








Here is the code I used in the video, for those who prefer reading instead of or in addition to v...

 •  0 comments  •  flag
Share on Twitter
Published on September 22, 2020 17:00

September 16, 2020

Introducing our new book, Tidy Modeling with R

Today Max Kuhn and I are pleased to announce our new book,
Tidy Modeling with R!



[image error]


This book focuses on how to use
tidymodels and has two main goals:




Readers will learn how to use R and tidymodels packages to create robust and reliable models. This is a practical book, full of code examples with real datasets.




The book encourages good methodology and statistical practice. The design of tidymodels packages (from the software itself to documentation, training materials, and this book) is ori...

 •  0 comments  •  flag
Share on Twitter
Published on September 16, 2020 17:00

September 1, 2020

Train and analyze many models for #TidyTuesday crop yields

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from just getting started to tuning more complex models. Today’s screencast explores how to fluently apply tidy data principles to the task of building many models using with this week’s
#TidyTuesday dataset on crop yields. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

Our modeling goal is to estimate how
crops yield...

 •  0 comments  •  flag
Share on Twitter
Published on September 01, 2020 17:00

August 10, 2020

Build a #TidyTuesday predictive text model for The Last Airbender

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast admittedly does not result in the best performing model you’ll ever see, but it is really fun and uses this week’s
#TidyTuesday dataset on Avatar: The Last Airbender. ���� ���� ���� ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

This wee...

 •  0 comments  •  flag
Share on Twitter
Published on August 10, 2020 17:00

July 27, 2020

Get started with tidymodels and #TidyTuesday Palmer penguins

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast is good for folks just getting started with tidymodels, using this week’s
#TidyTuesday dataset on penguins. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

This week’s #TidyTuesday dataset is from
palmerpenguins, observations of Antarcti...

1 like ·   •  0 comments  •  flag
Share on Twitter
Published on July 27, 2020 17:00

July 23, 2020

Supervised Machine Learning for Text Analysis in R

Today,
Emil Hvitfeldt and I led a
useR! 2020 online tutorial on predictive modeling with text using tidy data principles. This tutorial was hosted by
R-Ladies en Argentina; huge thanks to the organizers for their leadership and effort in making this tutorial possible.


tutorial flyer


Materials for this tutorial are
available on GitHub, with two main resources in the repo:



Slides, which you can
see rendered here and the
source for here
An
R Markdown file to work through

If you start working through the...

 •  0 comments  •  flag
Share on Twitter
Published on July 23, 2020 17:00