Publication Details

Evaluating robustness of a generalized linear model when applied to electronic health record data accessed using an Open API.

Type: article

Author(s): Sharma, Priya; Haaland, Perry; Krishnamurthy, Ashok; Lan, Bo; Schmitt, Patrick L; Sinha, Meghamala; Xu, Hao; Fecho, Karamarie

Pages: 14604582231170892


Publication Date: 2023

Journal: Health informatics journal

Volume: 29

Issue: 2

Doi: 10.1177/14604582231170892

Pmid: 37066514

Abstract: The Integrated Clinical and Environmental Exposures Service (ICEES) provides open regulatory-compliant access to clinical data, including electronic health record data, that have been integrated with environmental exposures data. While ICEES has been validated in the context of an asthma use case and several other use cases, the regulatory constraints on the ICEES open application programming interface (OpenAPI) result in data loss when using the service for multivariate analysis. In this study, we investigated the robustness of the ICEES OpenAPI through a comparative analysis, in which we applied a generalized linear model (GLM) to the OpenAPI data and the constraint-free source data to examine factors predictive of asthma exacerbations. Consistent with previous studies, we found that the main predictors identified by both analyses were sex, prednisone, race, obesity, and airborne particulate exposure. Comparison of GLM model fit revealed that data loss impacts model quality, but only with select interaction terms. We conclude that the ICEES OpenAPI supports multivariate analysis, albeit with potential data loss that users should be aware of.