This resource is no longer available
Data analysis is no longer restrained by having to stick to small samples of data; with the advent of machine learning, neural networks, and deep learning, companies are now able to leverage massive enterprise-wide data sets.
When an enterprise is considering setting up a data lake, streaming new data into that repository, and moving their existing data over as well, there are three dimensions to focus on:
- Data ingestion
- Data layout
- Data governance
Read this white paper to gain a deeper understanding of what a data lake is, how it fits into this picture, and more data lake best practices.