The challenges in developing a data lake are many so we won’t go into the whole list here. The guiding principles are:
- There should be no size limitation as a working assumption
- There are no limits on processing capabilities (this is a function of applied resources)
- Preventing the lake from becoming a “swamp” requires close management of what is included, documenting the source of the data elements, the frequency of the update and other characteristics (read more the in the Data Catalog section).