Holden Karau

year in books

Holden Karau’s Followers (20)

member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
member photo
Ana  Ulin
777 books | 41 friends

Aly
Aly
585 books | 7 friends

Benjamin
2,226 books | 154 friends

Emily
1,499 books | 63 friends

Matt Healy
169 books | 133 friends

Morgan ...
316 books | 116 friends

Kelly E...
215 books | 206 friends

judytuna
280 books | 317 friends

More friends…

Holden Karau

Goodreads Author


Member Since
March 2014


Average rating: 3.86 · 740 ratings · 74 reviews · 14 distinct worksSimilar authors
Learning Spark: Lightning-F...

by
3.91 avg rating — 567 ratings — published 2013 — 15 editions
Rate this book
Clear rating
High Performance Spark: Bes...

by
3.98 avg rating — 128 ratings6 editions
Rate this book
Clear rating
Fast Data Processing with S...

2.77 avg rating — 30 ratings — published 2013 — 8 editions
Rate this book
Clear rating
Scaling Python with Ray: Ad...

by
4.40 avg rating — 5 ratings
Rate this book
Clear rating
Scaling Python with Ray: Ad...

by
it was amazing 5.00 avg rating — 2 ratings
Rate this book
Clear rating
Spark學習手冊

0.00 avg rating — 0 ratings
Rate this book
Clear rating
高性能Spark(影印版)

0.00 avg rating — 0 ratings
Rate this book
Clear rating
深入理解大数据:大数据处理与编程实践

0.00 avg rating — 0 ratings4 editions
Rate this book
Clear rating
Scaling Python with Dask: F...

by
0.00 avg rating — 0 ratings
Rate this book
Clear rating
Scaling Python with Dask: F...

by
0.00 avg rating — 0 ratings
Rate this book
Clear rating
More books by Holden Karau…
Her Majesty's Roy...
Rate this book
Clear rating

 
The Violinist's T...
Rate this book
Clear rating

 
The Icepick Surge...
Rate this book
Clear rating

 
Quotes by Holden Karau  (?)
Quotes are added by the Goodreads community and are not verified by Goodreads. (Learn more)

“Co-partitioning is related to but distinct from partition co-location. We say that multiple RDDs are co-partitioned if they are partitioned by the same known partitioner.”
Holden Karau, High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

“Note that the hash function you pass will be compared by identity to that of other RDDs. If you want to partition multiple RDDs with the same partitioner, pass the same function object (e.g., a global function) instead of creating a new lambda for each one!”
Holden Karau, Learning Spark: Lightning-Fast Big Data Analysis

“For each input source, Spark Streaming launches receivers, which are tasks running within the application’s executors that collect data from the input source and save it as RDDs. These receive the input data and replicate it (by default) to another executor for fault tolerance. This data is stored in the memory of the executors in the same way as cached RDDs.1”
Holden Karau, Learning Spark: Lightning-Fast Big Data Analysis

“decided early on that my ideal dog would be like Diefenbaker in the TV show Due South, which was popular in the UK where I was living.”
Zazie Todd, Wag: The Science of Making Your Dog Happy

No comments have been added yet.