Kevin > Recent Status Updates

Showing 61-90 of 779
Kevin
Kevin is on page 79 of 256 of How Charts Lie: Getting Smarter about Visual Information
Choose an aspect ratio that does not exaggerate or minimize the change (in the data).
A general rule-of-thumb*: a 1/x increase or decrease should have a roughly x:1 aspect ratio (width:height)
- *This produces distorted charts when x is relatively large or small

"Chart design is similar to writing"
"Decoding a chart is similar to reading text"
Aug 06, 2021 05:37AM Add a comment
How Charts Lie: Getting Smarter about Visual Information

Kevin
Kevin is on page 79 of 256 of How Charts Lie: Getting Smarter about Visual Information
C2: Charts may lie by being poorly designed. When the mapping between values and symbols is not proportional. There can be visual distortions (scale, proportions).
The choice of baseline plays a large role in how a chart is perceived. The data ultimately govern this choice. E.g. Fahrenheit and Celsius don't have a minimum of zero, so a 0° baseline is often a bad choice.
Choose sensible baselines, not possible ones.
Aug 06, 2021 05:30AM Add a comment
How Charts Lie: Getting Smarter about Visual Information

Kevin
Kevin is on page 217 of 405 of Doing Data Science: Straight Talk from the Frontline
This chapter explores three methods useful for recommendation systems:
1) Linear Regression
2) Singular Value Decomposition
3) Principal Component Analysis

SVD decomposes the observation matrix into 3 matrices so that less important features can be identified and dropped from the model.
PCA identifies "latent features" that can be used to create a simpler model. Both involve lots of computation for large datasets.
Aug 04, 2021 08:22AM Add a comment
Doing Data Science: Straight Talk from the Frontline

Kevin
Kevin is on page 188 of 546 of Python Data Science Handbook: Essential Tools for Working with Data
Vectorized String Operations

Pandas excels at data munging with vectorized operations on strings.
Jul 29, 2021 09:20AM Add a comment
Python Data Science Handbook: Essential Tools for Working with Data

Kevin
Kevin is on page 53 of 256 of How Charts Lie: Getting Smarter about Visual Information
"A chart may lie because:
- It's poorly designed.
- It uses the wrong data.
- It shows an inappropriate amount of data—either too little or too much.
- It conceals or confuses uncertainty.
- It suggests misleading patterns.
- It panders to our expectations or prejudices."
Jul 28, 2021 08:03AM Add a comment
How Charts Lie: Getting Smarter about Visual Information

Kevin
Kevin is on page 53 of 256 of How Charts Lie: Getting Smarter about Visual Information
"To read a chart correctly, we must pay attention to it and never take anything for granted."
"...if you just skim them, you won't understand them, and well-designed charts aren't just informative but also graceful..."
Elements:
- Title, intro, caption, source
- Measurements, units, scales, legends
- Visual encoding methods
- Annotations
- Patterns, trends, relationships (using bird's-eye view)
Jul 28, 2021 08:02AM Add a comment
How Charts Lie: Getting Smarter about Visual Information

Kevin
Kevin is on page 53 of 256 of How Charts Lie: Getting Smarter about Visual Information
C1:How Charts Work
Charts encode information using: height/length, width, position, size/area, angle, color hue, and color shade.
Effective communication between a designer and an audience requires a shared understanding of what the chart is about and how the data is symbolized. Mental models save us time and effort.
Jul 28, 2021 07:57AM Add a comment
How Charts Lie: Getting Smarter about Visual Information

Kevin
Kevin is on page 147 of 546 of Python Data Science Handbook: Essential Tools for Working with Data
Combining Datasets: Concat and Append

Concatenation of Pandas Series and DataFrames works much the same as NumPy arrays, but indices are preserved in Pandas whereas they are not in NumPy.
Jul 20, 2021 02:51PM Add a comment
Python Data Science Handbook: Essential Tools for Working with Data

Kevin
Kevin is on page 141 of 546 of Python Data Science Handbook: Essential Tools for Working with Data
Multi-indexing can be useful for computing tables of aggregated statistics. It is very similar to performing the groupby operation.
Jul 19, 2021 02:48PM Add a comment
Python Data Science Handbook: Essential Tools for Working with Data

Kevin
Kevin is on page 148 of 405 of Doing Data Science: Straight Talk from the Frontline
Time series analysis is tricky and involves making a justifiable assumption about what the time is modeling. This choice that you make for one research question might be different for another question.
Jul 19, 2021 04:49AM Add a comment
Doing Data Science: Straight Talk from the Frontline

Follow Kevin's updates via RSS