About this item:

787 Views | 715 Downloads

Subjects:

Research Funding:

This work was supported in part by PHS Grant UL1 RR025008, KL2 RR025009 and TL1 RR025010 from the CTSA program, NIH, NCRR; NHLBI grant R24 HL085343; and M01 RR-00039 from the GCRC program, NIH, NCRR.

A Temporal Abstraction-based Extract, Transform and Load Process for Creating Registry Databases for Research.

Show all authors Show less authors

Tools:

Journal Title:

AMIA Joint Summits on Translational Science Proceedings

Volume:

Volume 2011

Publisher:

, Pages 46-50

Type of Work:

Article | Final Publisher PDF

Abstract:

In the CTSA era there is great interest in aggregating and comparing populations across institutions. These sites likely represent data differently in their clinical data warehouses and other databases. Clinical data warehouses frequently are structured in a generalized way that supports many constituencies. For research, there is a need to transform these heterogeneous data into a shared representation, and to perform categorization and interpretation to optimize the data representation for investigators. We are addressing this need by extending an existing temporal abstraction-based clinical database query system, PROTEMPA. The extended system allows specifying data types of interest in federated databases, extracting the data into a shared representation, transforming it through categorization and interpretation, and loading it into a registry database that can be refreshed. Such a registry's access control, data representation and query tools can be tailored to the needs of research while keeping local databases as the source of truth.

Copyright information:

© 2011 AMIA. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose.

Export to EndNote