Whether to take data ingestion cycles off the ETL tool and the data warehouse or to facilitate competitive Data Science and building algorithms in the organization, the data lake – a place for unmodeled and vast data –will be provisioned widely in 2020.
Though it doesn’t have to be complicated, the data lake has a few key design points that are critical, and it does need to follow some principles for success. Avoid building the data swamp, but not the data lake! The tool ecosystem is building up around the data lake and soon many will have a robust lake and data warehouse. We will discuss policy to keep them straight, send data to its best platform, and keep users’ confidence up in their data platforms. The data lake is also morphing with the data warehouse, a strong trend that will be covered in the webinar.
Learn more about William’s book, Integrating Hadoop, and read a free sample.
About William McKnight
William McKnight is an internationally recognized authority in information management. His consulting work has included many of the Global 2000 and numerous midmarket companies. His clients have reaped tremendous ROI and turned data into a real corporate asset. Many have gone public with their success stories.
William is the #1 global influencer in data warehousing, #1 in master data management, #3 in data management, #7 in information management and #14 in information architecture.
He is president of McKnight Consulting Group, which provides clients with action plans, architectures, strategies, complete programs and vendor-neutral tool selection to manage information. MCG is #1001 on the 2018 Inc. 5000 list of the fastest-growing companies in the US and #743 on the 2017 list.
He is the author of the books “Integrating Hadoop”, “Information Management: Strategies for Gaining a Competitive Advantage with Data” and “90 Days to Success in Consulting”.