History of the Data Warehouse
In the 1970s and '80s, data began to proliferate and organizations needed an easy way store and access their information. Computer scientist Bill Inmon, who’s considered the father of data warehousing, began to define the concept in the 1970s and is credited as coining the term “data warehouse.” He published Building the Data Warehouse, a book lauded as a fundamental source on data warehousing technology, in 1992. Inmon’s definition of the data warehouse takes a “top-down” approach, where a centralized repository is established first, and then data marts – which contain specific subsets of data – are created within that repository.
Ralph Kimball, another technology expert who published The Data Warehouse Toolkit in the mid '90s, took a slightly different tactic to the data warehousing concept with his “bottom up” approach, where individual data marts are developed first and later integrated together to create a data warehouse.
Data warehousing remains relevant today, yet it’s evolving as the industry changes to accommodate cloud computing and real-time analytics. One emerging data storage tool that's similar to a data warehouse is a data lake, which was brought about by disruptive low-cost technologies such as Apache Hadoop. Data lakes are often used in conjunction with unfettered data streaming in and storing without processing or building schemas.