Julia Silge's Blog, page 4

January 27, 2022

Predict ratings for #TidyTuesday board games

Use custom feature engineering for board game categories, tune an xgboost model with racing methods, and use explainability methods for deeper understanding.
 •  0 comments  •  flag
Share on Twitter
Published on January 27, 2022 16:00

January 20, 2022

Text predictors for #TidyTuesday chocolate ratings

Get started with feature engineering for text data, transforming text to be used in machine learning algorithms.
 •  0 comments  •  flag
Share on Twitter
Published on January 20, 2022 16:00

December 14, 2021

Topic modeling for #TidyTuesday Spice Girls lyrics

This is the latest in my series of screencasts, but instead of being about tidymodels, this screencast focuses on unsupervised modeling for text, specifically topic modeling. Today���s screencast walks through how to build a structural topic model and then how to explore and understand it, with this week���s #TidyTuesday dataset on Spice Girls lyrics. ����

Here is the code I used in the video, for those who prefer reading instead of or in addition to video.

Explore data

Our modeling go...

 •  0 comments  •  flag
Share on Twitter
Published on December 14, 2021 16:00

November 26, 2021

Predicting viewership for #TidyTuesday Doctor Who episodes

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just getting started to tuning more complex models. Today���s screencast walks through how to handle workflow objects, with this week���s #TidyTuesday dataset on Doctor Who episodes. ����

Here is the code I used in the video, for those who prefer reading instead of or in addition to video.

Explore data

Our modeling goal is to predict the UK viewership of Doctor Who episodes (since the ...

 •  0 comments  •  flag
Share on Twitter
Published on November 26, 2021 16:00

November 4, 2021

Spatial resampling for #TidyTuesday and the #30DayMapChallenge

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just getting started to tuning more complex models. Today���s screencast walks through how to use spatial resampling for evaluating a model, with this week���s #TidyTuesday dataset on geographic data. ����

Here is the code I used in the video, for those who prefer reading instead of or in addition to video.

Explore data

Geographic data is special when it comes to, well, basically everyt...

 •  0 comments  •  flag
Share on Twitter
Published on November 04, 2021 17:00

October 21, 2021

Predict #TidyTuesday giant pumpkin weights with workflowsets

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages. If you are a tidymodels user, either just starting out or someone who has used the packages a lot, we are interested in your feedback on our priorities for 2022. The survey we fielded last year turned out to be very helpful in making decisions, so we would so appreciate your input again!

Today���s screencast is great for someone just starting out with workflowsets, the tidymodels package for hand...

 •  0 comments  •  flag
Share on Twitter
Published on October 21, 2021 17:00

September 28, 2021

Multiclass predictive modeling for #TidyTuesday NBER papers

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just getting started to tuning more complex models. Today���s screencast walks through how to build, tune, and evaluate a multiclass predictive model with text features and lasso regularization, with this week���s #TidyTuesday dataset on NBER working papers. ����

Here is the code I used in the video, for those who prefer reading instead of or in addition to video.

Explore data

Our model...

 •  0 comments  •  flag
Share on Twitter
Published on September 28, 2021 17:00

September 14, 2021

Dimensionality reduction for #TidyTuesday Billboard Top 100 songs

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just getting started to tuning more complex models. Today���s screencast focuses only on data preprocessing, or feature engineering; let���s walk through how to use dimensionality reduction for song features sourced from Spotify (mostly audio), with this week���s #TidyTuesday dataset on Billboard Top 100 songs. ����

Here is the code I used in the video, for those who prefer reading inst...

 •  0 comments  •  flag
Share on Twitter
Published on September 14, 2021 17:00

August 31, 2021

Fit and predict with tidymodels for #TidyTuesday bird baths in Australia

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just getting started to tuning more complex models. Today���s screencast is good for folks who are newer to modeling or tidymodels; it focuses on how to use feature engineering together with a model algorithm and how to fit and predict, with this week���s #TidyTuesday dataset on bird baths in Australia. ����

Here is the code I used in the video, for those who prefer reading instead of o...

 •  0 comments  •  flag
Share on Twitter
Published on August 31, 2021 17:00

August 23, 2021

Modeling human/computer interactions on Star Trek from #TidyTuesday with workflowsets

This is the latest in my series of screencasts demonstrating how to use the tidymodels packages, from just getting started to tuning more complex models. Today���s screencast is on a more advanced topic, how to evaluate multiple combinations of feature engineering and modeling approaches via workflowsets, with this week���s #TidyTuesday dataset on Star Trek human/computer interactions. ����

Here is the code I used in the video, for those who prefer reading instead of or in addition to ...

 •  0 comments  •  flag
Share on Twitter
Published on August 23, 2021 17:00