In recent discussions with Joseph Dossantos, he pointed out to me that the differences in architecting an MDM solution for Reporting, such as for a Data Warehouse, and for real-time transaction processing, go beyond the choice of batch versus real-time Data Integration. Obviously, although the use of a batch ETL solution may be appropriate for integrating the source and target systems with a Master Data hub, it is insufficient for update and access to Master Data being used in transaction processing. For real-time Data Integration it is better to use an Enterprise Service Bus (ESB) and / or Service Oriented Architecture (SOA).
However, there are other differences in the architectural solution for real-time MDM. The common functions of MDM, such as matching and deduplication, also need to be architected for real-time use. The response to information requests needs to be instantaneous. Master Data for Reporting flows from source to hub to target to report (see Inmon’s Corporate Information Factory) but for transaction processing, all capabilities must be able to happen in any order or simultaneously.
Interesting post.
I am currently working with a client who have decided to go with a SOA enabled integration hub that includes MDM as well.
What are your thoughts on using this hub as a single source for the DW?