Friday, August 30, 2013

Hybrid Option and Data Extraction

Hybrid Option - After examining the legacy systems and the more modern applications in your corporation, it is most likely that you will decide that a single-plat form approach is not workable for your data warehouse. This is the conclusion most companies come to. On the other hand, if your company falls in the category where the legacy platform will accommodate your data warehouse, then, by all means, take the approach oho single-platform solution. Again, the single-platform solution, if feasible, is an easier solution.

For the rest of us who are not that fortunate, we have to consider other options. Let us begin with data extraction, the first major operation, and follow the flow of data unit it is consolidated into the load images and waiting in the staging area. We will now step through the data now and examine the platform options,

Data Extraction - In any-data warehouse it is best to perform the data extraction function from each source system on its own computing platform. If your telephone sales data resides in a minicomputer environment, create extract files tm the mini-computer itself for telephone sales. If your mail order application executes on the mainframe using an IMS database, then create the extract Ides for mail orders on the mainframe platform. It is rarely prudent to copy all the mail order database riles to another plat (Orin and then do the data extraction.

Initial Reformatting and Merging - After creating the raw data extracts from the various sources, the extracted files from each source ate reformatted and merged into a smaller number of extract files. Verification of the extracted data against source system repot and reconciliation of input and output record counts take place in this step. Just like the extraction step. It is best to do this step of initial merging in each set of source extracts on the source platform itself.

Preliminary Data Cleansing -
In this step, you verify the extracted data from each data source for any missing values in individual fields, Supply default values, and perform basic edits. This is another step for the computing platform of the source system itself. However in some data warehouses, this type of data cleansing happens after the data from all sources are reconciled and consolidated. In either case the teat tires and conditions of data .from your source systems dictate when and where -this step must be performed for your data warehouse.

Transformation and Consolidation - This step comprises all the major data transformation and integration functions. Usually, you will use transformation software tools for this purpose. Where is the best place to perform this step? Obviously not in any individual legacy platform. You perform this step on the platform where your staging area resides.

No comments:

Post a Comment