One of the most active and fastest growing open source mammoth data cluster computing projects is Apache Spark,which was originally developed at U.
C. Berkeley's AMPLab and is now used by internet giants and other companies around the world. Including, as announced most recently, and IBM. In this Q&A with Spark inventor Matei Zaharia on the heels of the recent Spark Summit,we cover the difference between Hadoop MapReduce and Spark; what are the ingredients of a successful open source project; and the epic of how Spark almost helped a friend win a million dollars. MORE
Source: a16z.com