Julia Silge's Blog, page 8
July 14, 2020
Bagging with tidymodels and #TidyTuesday astronaut missions
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast focuses on
bagging using this week’s
#TidyTuesday dataset on astronaut missions. �����������
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
Our modeling goal is to use bagging (bootstrap aggregation) to model the duration of
astronaut missi...
June 29, 2020
The Bechdel test and the X-Mansion with tidymodels and #TidyTuesday
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast focuses on using bootstrap resampling with this week’s
#TidyTuesday dataset from the
Claremont Run Project about issues of the comic book series Uncanny X-Men. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Read in the data
Our modeling goal is to use i...
June 16, 2020
Impute missing data for #TidyTuesday voyages of captive Africans with tidymodels
This week’s
#TidyTuesday datasets reflect on Juneteenth, the date when the law abolishing slavery in the United States was finally enforced throughout the American South, and specifically Texas. My own education as a white woman in the United States has been gravely lacking in the history of enslaved people, their experiences, and their impact and legacy; I’m glad to spend some time in this screencast understanding more about the forced transport of African people using the
May 26, 2020
PCA and UMAP with tidymodels and #TidyTuesday cocktail recipes
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast isn’t about predictive modeling, but about unsupervised machine learning using with this week’s
#TidyTuesday dataset on cocktail recipes. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
Our modeling goal is to use unsupervised algorithms ...
May 25, 2020
tidylo is now on CRAN! ����
I am very pleased to announce that
tidylo, a package for weighted log odds using tidy data principles, is
now on CRAN! ���� I would like to send my warmest thanks to my collaborators
Alex Hayes and
Tyler Schnoebelen for their helpful contributions.
You can now install the released version of tidylo from
CRAN with:
install.packages("tidylo")
A log odds ratio is a way of expressing probabilities, and we can weight a log odds ratio so that our implementation does a better job dealing with dif...
May 20, 2020
Tune XGBoost with tidymodels and #TidyTuesday beach volleyball
Lately I’ve been publishing
screencasts demonstrating how to use the
tidymodels framework, starting from just getting started. Today’s screencast explores a more advanced topic in how to tune an XGBoost classification model using with this week’s
#TidyTuesday dataset on beach volleyball. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the data
Our modeling goal is to predict whether a beach volleyball team of two won ...
May 14, 2020
Learn tidymodels with my supervised machine learning course
Today I am happy to announce that a new tidymodels-centric version of my free, online, interactive course, Supervised Machine Learning: Case Studies in R, has been published! ����
This is at least the third version of this course I’ve built at this point ���� but I believe it to be the best, in terms of how it communicates machine learning concepts and how useful to your real-world problems the demonstrated code will be. Huge thanks to my RStudio teammates such as Alison Hill and Max...
May 12, 2020
Multinomial classification with tidymodels and #TidyTuesday volcano eruptions
Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to evaluate complex models. Today’s screencast demonstrates how to implement multiclass or multinomial classification using with this week’s #TidyTuesday dataset on volcanoes. ����
Here is the code I used in the video, for those who prefer reading instead of or in addition to video.
Explore the dataOur modeling goal is to predict the type of...
May 5, 2020
Sentiment analysis with tidymodels and #TidyTuesday Animal Crossing reviews
A lot has been happening in the tidymodels ecosystem lately! There are many possible projects we on the tidymodels team could focus on next; we are interested in gathering community feedback to inform our priorities. If you are interested in sharing your opinion on next steps in tidymodels development, please take this short survey.
Lately I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to tune more complex models....
April 21, 2020
Modeling #TidyTuesday GDPR violations with tidymodels
This is an exciting week for us on the tidymodels team; we launched tidymodels.org, a new central location with resources and documentation for tidymodels packages. There is a TON to explore and learn there! ���� You can check out the official blog post for more details.
Today, I’m publishing here on my blog another screencast demonstrating how to use tidymodels. This is a good video for folks getting started with tidymodels, using this week’s #TidyTuesday dataset on GDPR...