Julia Silge's Blog, page 8

July 14, 2020

Bagging with tidymodels and #TidyTuesday astronaut missions

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast focuses on
bagging using this week’s
#TidyTuesday dataset on astronaut missions. �����������








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

Our modeling goal is to use bagging (bootstrap aggregation) to model the duration of
astronaut missi...

 •  0 comments  •  flag
Share on Twitter
Published on July 14, 2020 17:00

June 29, 2020

The Bechdel test and the X-Mansion with tidymodels and #TidyTuesday

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast focuses on using bootstrap resampling with this week’s
#TidyTuesday dataset from the
Claremont Run Project about issues of the comic book series Uncanny X-Men. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Read in the data

Our modeling goal is to use i...

 •  0 comments  •  flag
Share on Twitter
Published on June 29, 2020 17:00

June 16, 2020

Impute missing data for #TidyTuesday voyages of captive Africans with tidymodels

This week’s
#TidyTuesday datasets reflect on Juneteenth, the date when the law abolishing slavery in the United States was finally enforced throughout the American South, and specifically Texas. My own education as a white woman in the United States has been gravely lacking in the history of enslaved people, their experiences, and their impact and legacy; I’m glad to spend some time in this screencast understanding more about the forced transport of African people using the

 •  0 comments  •  flag
Share on Twitter
Published on June 16, 2020 17:00

May 26, 2020

PCA and UMAP with tidymodels and #TidyTuesday cocktail recipes

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast isn’t about predictive modeling, but about unsupervised machine learning using with this week’s
#TidyTuesday dataset on cocktail recipes. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

Our modeling goal is to use unsupervised algorithms ...

 •  0 comments  •  flag
Share on Twitter
Published on May 26, 2020 17:00

May 25, 2020

tidylo is now on CRAN! ����

I am very pleased to announce that
tidylo, a package for weighted log odds using tidy data principles, is
now on CRAN! ���� I would like to send my warmest thanks to my collaborators
Alex Hayes and
Tyler Schnoebelen for their helpful contributions.


You can now install the released version of tidylo from
CRAN with:


install.packages("tidylo")

A log odds ratio is a way of expressing probabilities, and we can weight a log odds ratio so that our implementation does a better job dealing with dif...

 •  0 comments  •  flag
Share on Twitter
Published on May 25, 2020 17:00

May 20, 2020

Tune XGBoost with tidymodels and #TidyTuesday beach volleyball

Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, starting from just getting started. Today’s screencast explores a more advanced topic in how to tune an XGBoost classification model using with this week’s
#TidyTuesday dataset on beach volleyball. ����








Here is the code I used in the video, for those who prefer reading instead of or in addition to video.


Explore the data

Our modeling goal is to predict whether a beach volleyball team of two won ...

 •  0 comments  •  flag
Share on Twitter
Published on May 20, 2020 17:00

May 14, 2020

Learn tidymodels with my supervised machine learning course

Today I am happy to announce that a new tidymodels-centric version of my free, online, interactive course, Supervised Machine Learning: Case Studies in R, has been published! ����

This is at least the third version of this course I’ve built at this point ���� but I believe it to be the best, in terms of how it communicates machine learning concepts and how useful to your real-world problems the demonstrated code will be. Huge thanks to my RStudio teammates such as Alison Hill and Max...

1 like ·   •  0 comments  •  flag
Share on Twitter
Published on May 14, 2020 17:00

May 12, 2020

Multinomial classification with tidymodels and #TidyTuesday volcano eruptions

Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast demonstrates how to implement multiclass or multinomial classification using with this week’s #TidyTuesday dataset on volcanoes. ����

Here is the code I used in the video, for those who prefer reading instead of or in addition to video.

Explore the data

Our modeling goal is to predict the type of...

 •  0 comments  •  flag
Share on Twitter
Published on May 12, 2020 17:00

May 5, 2020

Sentiment analysis with tidymodels and #TidyTuesday Animal Crossing reviews

A lot has been happening in the tidymodels ecosystem lately! There are many possible projects we on the tidymodels team could focus on next; we are interested in gathering community feedback to inform our priorities. If you are interested in sharing your opinion on next steps in tidymodels development, please take this short survey.

Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to tune more complex models....

 •  0 comments  •  flag
Share on Twitter
Published on May 05, 2020 17:00

April 21, 2020

Modeling #TidyTuesday GDPR violations with tidymodels

This is an exciting week for us on the tidymodels team; we launched tidymodels.org, a new central location with resources and documentation for tidymodels packages. There is a TON to explore and learn there! ���� You can check out the official blog post for more details.

Today, I’m publishing here on my blog another screencast demonstrating how to use tidymodels. This is a good video for folks getting started with tidymodels, using this week’s #TidyTuesday dataset on GDPR...

 •  0 comments  •  flag
Share on Twitter
Published on April 21, 2020 17:00