Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Develop a data extraction strategy that can support extracting from 14 different MVP sites.
  2. Capture and correct as many unexpected data scenarios as possible.
  3. Extract as many observations possible to model the indicators in the attached spreadsheet, eHealth Indicators for Pentaho Dashboard 2011-10-24.xls.
  4. Develop a load strategy and physical database schema that can support aggregation of the 14 sites' data into a common, centralized data store.
  5. Develop the needed models to allow Pentaho Analysis reports to be created against the loaded star schemas in the warehouse. 

MVP Tasks and Status

...

Task #1: Design deployment topology for the MVP ETL and data warehouse initiative.

...

This task is complete. The following images depict the recommended deployment structure for MVP.

Image Added

Where there is an implementation of OpenMRS, a Pentaho Data Integration Carte server will be deployed. The Carte server harnesses the ETL engine for executing  the extraction jobs and transformations necessary to pull the desired data out of OpenMRS. The data will be written to CSV text files, and compressed for eventual distribution to the central warehouse site (presumably in the United States). 

Image Added

Implement the ETL processes that will extract, de-identify and load OpenMRS data from the source OpenMRS instances in the field to a centralized warehouse.

...

Install and configure servers per design, both in the United States and Africa (refer to the design slides attached).

Deploy, schedule, monitor and maintain the warehouse solution.

...

See the OpenMRS Pentaho sprint page for sprint objectives and a summary of the development effort.

Attachments and Resources

Sprint Summary for Columbia University, Sprint Summary for Columbia University.pdf

eHealth Indicators from Columbia University, eHealth Indicators for Pentaho Dashboard 2011-10-24.xls