Databricks to Launch First of Five Free Big Data Courses on Apache Spark
Databricks helps hundreds of organizations use Apache Spark to answer important questions by analyzing data. Apache Spark is an open-source data processing engine for engineers and analysts that includes an optimized general execution runtime and a set of standard libraries for building data pipelines, advanced algorithms, and more. Besides contributing over 75% of Apache Spark's...
Databricks Launches Second MOOC: Scalable Machine Learning
We have been working in collaboration with professors at UC Berkeley and UCLA to produce two freely available Massive Open Online Courses (MOOCs). The first MOOC was released earlier this month and has been a tremendous success, with over 60K students enrolled and a large number of active students. We are excited to announce that...
Databricks Launches MOOC: Data Science on Apache Spark
For the past several months, we have been working in collaboration with professors from the University of California Berkeley and University of California Los Angeles to produce two freely available Massive Open Online Courses (MOOCs). We are proud to announce that both MOOCs will launch in June on the edX platform! The first course, called...
Databricks to run two massive online courses on Apache Spark
In the age of ‘Big Data,’ with datasets rapidly growing in size and complexity and cloud computing becoming more pervasive, data science techniques are fast becoming core components of large-scale data processing pipelines. Apache Spark offers analysts and engineers a powerful tool for building these pipelines, and learning to build such pipelines will soon be...