
"Zaharia began building Apache Spark as a doctoral student at UC Berkeley in 2009, a faster alternative to Hadoop MapReduce, which had become the default framework for large-scale distributed data processing but was burdened by slow disk-based I/O between stages."
"Spark moved intermediate computation into memory, cutting processing times for iterative workloads, machine learning training, graph processing, stream analysis, from hours to minutes or seconds."
"The ACM, in its prize citation, credited Zaharia with 'visionary development of distributed data systems and computing infrastructure, which has enabled large-scale machine learning, analytics, and AI at global scale.'"
Matei Zaharia, a professor at Berkeley and co-founder of Databricks, received the 2026 ACM Prize in Computing for his work on Apache Spark. This $250,000 award recognizes his foundational contributions to distributed data systems and AI infrastructure. Zaharia created Spark to improve data processing speeds, significantly outperforming Hadoop MapReduce. His work led to the establishment of Databricks, which achieved a $134 billion valuation in December 2025. Zaharia believes AGI has already arrived in an unrecognized form and advocates for a shift in how AI is benchmarked.
Read at TNW | Insights
Unable to calculate read time
Collection
[
|
...
]