Multiple sources for the same data - which one to extract from?
3 posters
Page 1 of 1
Multiple sources for the same data - which one to extract from?
Here is the situation:
The system that generates the data does not talk to a database directly. It talks to other components (systems) that end up storing the same source data in different ways. I understand that there is probably some architectural problems with this model but that is what we currently have. The question is should we build different data marts from these sources or should we choose one to be the most reliable and extract from there. The argument for extracting from all of them is that it gives end-users a way to cross check between them.
Thanks
The system that generates the data does not talk to a database directly. It talks to other components (systems) that end up storing the same source data in different ways. I understand that there is probably some architectural problems with this model but that is what we currently have. The question is should we build different data marts from these sources or should we choose one to be the most reliable and extract from there. The argument for extracting from all of them is that it gives end-users a way to cross check between them.
Thanks
Last edited by arowshan on Fri Oct 21, 2011 11:51 am; edited 1 time in total
arowshan- Posts : 23
Join date : 2011-10-18
Location : Vancouver, Canada
Re: Multiple sources for the same data - which one to extract from?
Extract from all, consolidate into unified dimensional model.
BoxesAndLines- Posts : 1212
Join date : 2009-02-03
Location : USA
Re: Multiple sources for the same data - which one to extract from?
If you are getting almost the same data let's say coin-in amount from three different sources (possibly with different grains), how would you consolidate into the same dimensional model? Is that one fact table? I mean in Kimball terms those are all representing the same business process. Could you elaborate?
arowshan- Posts : 23
Join date : 2011-10-18
Location : Vancouver, Canada
Re: Multiple sources for the same data - which one to extract from?
Generally speaking, best practice is to get the data from the original source and store it at the lowest grain attainable. Not knowing your exact situation or what the other systems do to the data, it's hard to say what is the best course of action. But I would be very wary of trying to get the same data from multiple sources, all of which have manipulated the data in some manner.
Re: Multiple sources for the same data - which one to extract from?
arowshan wrote:If you are getting almost the same data let's say coin-in amount from three different sources (possibly with different grains), how would you consolidate into the same dimensional model? Is that one fact table? I mean in Kimball terms those are all representing the same business process. Could you elaborate?
Like information should be pulled from the highest quality source. What you generally find in the different sources are different data points that add context to your measures. One dimension could be sourced from one source, another dimension could come from yet another source. In worst cases, even your measures can be sourced from multiple sources. Often times, you will find facts that only come in from one of the sources that the other sources do not contain. In this case, to get a holistic view, you need to ensure all distinct facts are loaded into a common model.
BoxesAndLines- Posts : 1212
Join date : 2009-02-03
Location : USA
Similar topics
» Dimension with different sources (multiple business keys) ?
» Dimension Design with Multiple Data Sources (ORACLE and SQL Server)
» Modeling a fact with multiple sources
» effect of data sources in dimensional modeling
» Dimension Attributes and Fact attributes storing same data in multiple data marts??
» Dimension Design with Multiple Data Sources (ORACLE and SQL Server)
» Modeling a fact with multiple sources
» effect of data sources in dimensional modeling
» Dimension Attributes and Fact attributes storing same data in multiple data marts??
Page 1 of 1
Permissions in this forum:
You cannot reply to topics in this forum
|
|