Friday, November 24, 2017

Azure Databricks for Apache Spark

Databricks are founders of Apache Spark, open source big data cluster computing framework.

A technical overview of Azure Databricks | Blog | Microsoft Azure

"Azure Databricks, an exciting new service in preview that brings together the best of the Apache Spark analytics platform and Azure cloud."

Foundations of Data Science, textbook & lectures

math-heavy

Foundations of Data Science - Lecture 1 | Microsoft Research | Channel 9


"Modern data often consists of feature vectors with a large number of features. High-dimensional geometry and Linear Algebra (Singular Value Decomposition) are two of the crucial areas which form the mathematical foundations of Data Science. This mini-course covers these areas, providing intuition and rigorous proofs. Connections between Geometry and Probability will be brought out. Text Book: Foundations of Data Science."

book: Foundations of Data Science, June 14, 2017.pdf (450 pages)

by John E. Hopcroft, Avrim Blum and Ravi Kannan