Sunday, October 28, 2018

ELT Data Lake <= ETL Data Warehouse

informative podcast interview:
Data Warehouse with Christian Kleinerman - Software Engineering Daily

Data lake - Wikipedia

"A data lake is a system or repository of data stored in its natural format,[1] usually object blobs or files. A data lake is usually a single store of all enterprise data including raw copies of source system data and transformed data used for tasks such as reporting, visualization, analytics and machine learning. A data lake can include structured data from relational databases (rows and columns), semi-structured data (CSV, logs, XML, JSON), unstructured data (emails, documents, PDFs) and binary data (images, audio, video). [2]

A data swamp is a deteriorated data lake that is either inaccessible to its intended users or is providing little value."

What is next, how about "data ocean" metaphor:) That is web of data.

No comments: