TY - JOUR
T1 - Facilitating Harmonization of Variables in Framingham, MESA, ARIC, and REGARDS Studies Through a Metadata Repository
AU - Mallya, Pratheek
AU - Stevens, Laura M.
AU - Zhao, Juan
AU - Hong, Chuan
AU - Henao, Ricardo
AU - Economou-Zavlanos, Nicoleta
AU - Wojdyla, Daniel M.
AU - Schibler, Tony
AU - Manchanda, Vihaan
AU - Pencina, Michael J.
AU - Hall, Jennifer L.
N1 - Publisher Copyright:
© 2023 Lippincott Williams and Wilkins. All rights reserved.
PY - 2023/11/1
Y1 - 2023/11/1
N2 - BACKGROUND: High-quality research in cardiovascular prevention, as in other fields, requires inclusion of a broad range of data sets from different sources. Integrating and harmonizing different data sources are essential to increase generalizability, sample size, and representation of understudied populations - strengthening the evidence for the scientific questions being addressed. METHODS: Here, we describe an effort to build an open-access repository and interactive online portal for researchers to access the metadata and code harmonizing data from 4 well-known cohort studies - the REGARDS (Reasons for Geographic and Racial Differences in Stroke) study, FHS (Framingham Heart Study), MESA (Multi-Ethnic Study of Atherosclerosis), and ARIC (Atherosclerosis Risk in Communities) study. We introduce a methodology and a framework used for preprocessing and harmonizing variables from multiple studies. RESULTS: We provide a real-case study and step-by-step guidance to demonstrate the practical utility of our repository and interactive web page. In addition to our successful development of such an open-access repository and interactive web page, this exercise in harmonizing data from multiple cohort studies has revealed several key themes. These themes include the importance of careful preprocessing and harmonization of variables, the value of creating an open-access repository to facilitate collaboration and reproducibility, and the potential for using harmonized data to address important scientific questions and disparities in cardiovascular disease research. CONCLUSIONS: By integrating and harmonizing these large-scale cohort studies, such a repository may improve the statistical power and representation of understudied cohorts, enabling development and validation of risk prediction models, identification and investigation of risk factors, and creating a platform for racial disparities research. REGISTRATION: URL: https://precision.heart.org/duke-ninds.
AB - BACKGROUND: High-quality research in cardiovascular prevention, as in other fields, requires inclusion of a broad range of data sets from different sources. Integrating and harmonizing different data sources are essential to increase generalizability, sample size, and representation of understudied populations - strengthening the evidence for the scientific questions being addressed. METHODS: Here, we describe an effort to build an open-access repository and interactive online portal for researchers to access the metadata and code harmonizing data from 4 well-known cohort studies - the REGARDS (Reasons for Geographic and Racial Differences in Stroke) study, FHS (Framingham Heart Study), MESA (Multi-Ethnic Study of Atherosclerosis), and ARIC (Atherosclerosis Risk in Communities) study. We introduce a methodology and a framework used for preprocessing and harmonizing variables from multiple studies. RESULTS: We provide a real-case study and step-by-step guidance to demonstrate the practical utility of our repository and interactive web page. In addition to our successful development of such an open-access repository and interactive web page, this exercise in harmonizing data from multiple cohort studies has revealed several key themes. These themes include the importance of careful preprocessing and harmonization of variables, the value of creating an open-access repository to facilitate collaboration and reproducibility, and the potential for using harmonized data to address important scientific questions and disparities in cardiovascular disease research. CONCLUSIONS: By integrating and harmonizing these large-scale cohort studies, such a repository may improve the statistical power and representation of understudied cohorts, enabling development and validation of risk prediction models, identification and investigation of risk factors, and creating a platform for racial disparities research. REGISTRATION: URL: https://precision.heart.org/duke-ninds.
KW - atherosclerosis
KW - cardiovascular disease
KW - metadata
KW - sample size
KW - stroke
UR - http://www.scopus.com/inward/record.url?scp=85178542758&partnerID=8YFLogxK
U2 - 10.1161/CIRCOUTCOMES.123.009938
DO - 10.1161/CIRCOUTCOMES.123.009938
M3 - Article
C2 - 37850400
AN - SCOPUS:85178542758
SN - 1941-7713
VL - 16
SP - E009938
JO - Circulation: Cardiovascular Quality and Outcomes
JF - Circulation: Cardiovascular Quality and Outcomes
IS - 11
ER -