Loading both Lake and Warehouse - Single Transform Path
Data Organization, build-vs-buy, transform audit, and technology choices all
depend on your organization's policies, business, and compliance requirements.
We are going to look at some business requirements that might put us on a
different path from the parallel load, warehouse first, and lake first
patterns previously discussed.
Video
Discussion
This pattern assumes that all the primary raw and
conformed/curated transformations happen in one data
repository with one set of tools. The raw and conformed/curated
zones are then replicated into the other repository. Your org would
choose whether the lake or the warehouse was home for transformations for
those zones.
Comments
Post a Comment