Open Source Engineer
Netflix
Holden is a transgender Canadian Open Source Engineer at Netflix with a focus on improving OSS data tooling. She is the co-author of Kubeflow for Machine Learning (2020), High Performance Spark (2017) and Learning Spark (2015). She is a committer and PMC on Apache Spark and committer on SystemML & Mahout projects. She was tricked into the world of big data while trying to improve search and recommendation systems and has long since forgotten her original goal.
Talks I've Given
-
Validating Big Data Jobs - Stopping Failures before Production (w/ Spark, BEAM, & friends!)
Featuring Holden Karau
As big data jobs move from the proof-of-concept phase into powering real production services, you will need to consider what will happen when everything eventually goes wrong (such as recommending inappropriate products or other decisions taken on bad data).
scala scalax big-data apache-beam apache-spark -
Keynote: The Magic Behind Spark
Featuring Holden Karau
Apache Spark is one of the most popular general purpose distributed systems, and has driven a lot of growth in the Scala community. This talk will look at the magic which makes Spark work, peeling back the curtain to revel the several hundred gnomes that secretly power most distributed systems.
scalax spark keynote scala apache-spark