AI, Distributed Systems, and Open-Source Innovation by Matei Zaharia

Prof. Dr. Matei Zaharia is a Romanian-Canadian computer scientist who co-created Apache Spark in 2009 while at UC Berkeley. Spark transformed large-scale data processing and has been cited more than 12,840 times, earning Zaharia the 2014 ACM Doctoral Dissertation Award. In 2013, he co-founded Databricks, which as of January 2025 is valued at approximately $62 billion and reports over $1.3 billion in annual revenue, according to Forbes. Zaharia has also led the development of MLflow, Delta Lake, and Dolly, Databricks’ open-source large language model. He has received the U.S. Presidential Early Career Award (2019) and the ACM SIGOPS Mark Weiser Award (2023).

As an Associate Professor at UC Berkeley, Zaharia directs research on AI systems, cloud privacy, and LLM-data integration. He is scheduled to lead new research initiatives in these areas for fall 2025. He teaches two undergraduate and graduate-level courses each semester and mentors more than 30 PhD students. Although his primary expertise is in big data and AI infrastructure rather than cryptocurrency, Zaharia’s contributions to distributed systems and privacy-preserving architectures are foundational to secure blockchain analytics and decentralized data workflows.

His work continues to support scalable analytics and trusted computation, both of which are critical for the advancement of crypto and Web3 ecosystems.

  • Dan Blystone
  • 16.06.2026
Matei Zaharia confirms real-time lakehouse queries deliver under 10ms latency
A recent announcement from Matei Zaharia introduces a significant upgrade for data lakehouse users. According to Zaharia, it is now possible to run real-time queries on lakehouse data with ...
  • Hanna Syniavska
  • 21.05.2026
Matei Zaharia: Team builds advanced agents for analytical questions in biotech and finance
Technology expert Matei Zaharia has highlighted an opportunity to join a team focused on building advanced agents and models for analytical tasks in sectors such as biotechnology and finance. ...
  • Ashutosh Sureka
  • 15.05.2026
Matei Zaharia: Team advances biotech and finance analytics with reliable AI agents
A new initiative led by Matei Zaharia is seeking talent to develop artificial intelligence agents and models aimed at providing reliable answers to challenging analytical questions in sectors such ...
  • Andreas Kristo
  • 13.05.2026
Matei Zaharia says combining GEPA with RL yields improved learning from feedback
A novel approach merging Generalized Empirical Policy Approximation (GEPA) with Reinforcement Learning (RL) is being explored to enhance the quality of machine learning models. According to Matei ...
  • Eugene Komchuk
  • 08.05.2026
Matei Zaharia: Genie boosts Databricks data agent accuracy by threefold
Genie, an innovative data agent, has achieved a threefold increase in accuracy for Databricks users compared to generic agents. This advancement, referenced by Matei Zaharia, comes as the research ...
  • Andreas Kristo
  • 25.04.2026
Matei Zaharia notes GPT 5.5 and Codex now manageable on Databricks with enhanced controls
OpenAI's GPT 5.5 and Codex models have been integrated into the Databricks platform, according to a recent post by Matei Zaharia. These tools now support the Unity AI Gateway, which allows ...
  • Iryna Sazhynska
  • 26.03.2026
Matei Zaharia: coSTAR agent pattern enhances user task performance at Databricks
Databricks has introduced the coSTAR pattern for building software agents, according to Matei Zaharia. The approach is designed to improve agents’ ability to address users’ most challenging ...
  • Eugene Komchuk
  • 18.03.2026
Matei Zaharia notes DSPy use case supported by MLflow and Databricks interfaces
A recent statement by Matei Zaharia draws attention to a notable application of DSPy, a tool for advanced data science workflows. Zaharia mentions that DSPy is now well supported within MLflow and ...
  • Elena Nikulina
  • 06.03.2026
Matei Zaharia says synthetic data and RL help build specialized AI models
Matei Zaharia has outlined a new approach for developing specialized artificial intelligence models, detailed in a recent report. The method involves generating synthetic data using the current ...
  • Hlib Chabaniuk
  • 27.02.2026
Matei Zaharia highlights Databricks Harvard Cornell research showing off-policy RL outperforms on-policy
Matei Zaharia, co-founder and chief technologist of Databricks, draws attention to a recent collaborative effort between Databricks Research, Harvard University, and Cornell University that could ...
  • Ivan Andriyenko
  • 20.02.2026
Matei Zaharia: GEPA enhances coding skills with LLM-guided optimization
Matei Zaharia, a notable figure in the technology sector, has voiced excitement over groundbreaking work from GEPA. The latest development showcases the use of a large language model (LLM)-guided ...
  • Andrey Mastykin
  • 04.02.2026
Matei Zaharia announces Lakebase GA simplifying database interactions
Matei Zaharia, co-founder of Databricks, has announced that Lakebase is now generally available, promising to simplify interactions with online databases. Lakebase introduces functionalities that ...
  • Parshwa Turakhiya
  • 21.11.2025
Matei Zaharia congratulates Databricks on database team recognition
Databricks has received significant recognition for its innovative work in the database domain. Matei Zaharia, co-founder of Databricks and a professor of computer science at Stanford University, ...
  • Marc Chandler
  • 09.10.2025
Matei Zaharia celebrates collaboration with Omar's Berkeley group
Matei Zaharia acknowledged a collaborative effort with a research group led by Omar at the University of California, Berkeley. Zaharia expressed his gratitude for having had the opportunity to ...
  • Igor Krasulya
  • 25.09.2025
Matei Zaharia: Prompt optimization enhances AI efficiency at lower cost
Matei Zaharia, a prominent figure in the AI community, highlights a groundbreaking development in artificial intelligence: prompt optimization techniques that outperform standard fine-tuning ...
  • Oleg Tkachenko
  • 09.09.2025
Matei Zaharia proposes overhaul of OLTP databases with Lakebase
Matei Zaharia, a distinguished computer scientist and co-founder of Databricks, recently delivered a keynote address at the Very Large Data Bases (VLDB) conference, advocating for a comprehensive ...
  • Ivan Andriyenko
  • 21.08.2025
Matei Zaharia unveils real-time mode for Spark on Databricks
Matei Zaharia, co-founder of Apache Spark, announces the public preview of a new real-time mode for Apache Spark streaming on Databricks. The updated feature aims to provide users with the ...