Discussion about this post

User's avatar
Ryfl's avatar

"Every day there is some new technology we have to learn... by the way, have you heard about Estuary?" No, not something else... :-D

Expand full comment
Peter Andrew Nolan's avatar

1. Hi Ben,

there have only been a couple of major innovations in the data warehousing area over the last 30 years. The biggest two innovations was the move from "only send the data you think you need into the data warehouse" to "send all the data that might one day be important into the data warehouse". And the second big innovation was the ability to support archives in dimensional models.

In the 80s disk was so expensive there was simply no way you could get a budget to replicate all the data in operational systems into a data warehouse. When I started in 1982 disk was USD100K per GB. A 9600baud dedicated modem line was USD40,000 per year and you could put 8 terminals on the end of each 9600 baud modem. The concept of "replicating" data was IN-SANE. It was all we could do to actually process the data we had. I worked on some of the largest systems in the world in the second half of the 80s because I worked on the IBM Internal Billing Systems. I was the guy who got IBM Japan to accept and install our Billing System. It was the largest single image billing system in the world.

So Ralph and the guys at Metaphor hit on the idea of just put the data into a dimensional data warehouse that matches the value drivers of the business so that you can answer 90% of all the most important questions that the business comes up with. I met Bill at the 1993 Metaphor users conference and he shocked all the attendees by proposing that ALL DATA in operational systems should not only be put into the data warehouse but that up to 10 years history should be kept. This was such a radical idea that I didn't even get it. I thought he could not possibly be suggesting this. When we brought him to Australia and he did a full day presentation / workshop in Sydney I really "got" that he was proposing store ALL the data....for at least 10 years.

Obviously dimensional models could not do that so we pretty much had to adopt his TV+SA models to do that. These were VERY hard to sell because of the cost and most companies I tried to sell them to would not buy them.

The first BIG innovation was invented by my mentor, who Ralph left in charge of data models at Metaphor when he left in 1989, and myself in 1996. I had moved to Hitachi and we were selling disk drives essentially. And so we were selling "put all your data into the dimensional data warehouse" to sell disk drives. To that time we used to build the data warehouse as either a dimensional data warehouse like Ralph/Metaphor proposed or we built it as an archive like Bill proposed. At Hitachi Asia Pacific, which is what I ran for DWH, we committed to only dimensional models.

Expand full comment
1 more comment...

No posts