Video and slides synchronized, mp3 and slide download available at URL http://bit.ly/1kMUPAe.
Josh Wills discusses using Hadoop technologies to build real-time data analysis models with a focus on strategies for data integration, large-scale machine learning, and experimentation. Filmed at qconsf.com.
Josh Wills is the director of data science at Cloudera. Wills is one of the main contributors to Cloudera’s most recent open source project, Crunch, a Java library that aims to make writing, testing, and running MapReduce pipelines easy, efficient, and even fun. Prior to joining Cloudera, Wills was a software engineer at Google. Josh holds a M.S.E. in operations research and a BS in mathematics.