Introducing tidylo

Today I am so pleased to introduce a new package for calculating weighted log odds ratios, tidylo.

Often in data analysis, we want to measure how the usage or frequency of some feature, such as words, differs across some group or set, such as documents. One statistic often used to find these kinds of differences in text data is tf-idf. Another option is to use the log odds ratio, but the log odds ratio alone does not account for sampling variability. We havent counted every feature the same...

 •  0 comments  •  flag
Share on Twitter
Published on July 07, 2019 17:00
No comments have been added yet.