DOI QR코드

DOI QR Code

The Korea Cohort Consortium: The Future of Pooling Cohort Studies

  • Lee, Sangjun (Department of Preventive Medicine, Seoul National University College of Medicine) ;
  • Ko, Kwang-Pil (Clinical Preventive Medicine Center, Seoul National University Bundang Hospital) ;
  • Lee, Jung Eun (Department of Food and Nutrition, Seoul National University) ;
  • Kim, Inah (Department of Occupational and Environmental Medicine, Hanyang University College of Medicine) ;
  • Jee, Sun Ha (Department of Epidemiology and Health Promotion, Institute for Health Promotion, Graduate School of Public Health, Yonsei University) ;
  • Shin, Aesun (Department of Preventive Medicine, Seoul National University College of Medicine) ;
  • Kweon, Sun-Seog (Department of Preventive Medicine, Chonnam National University Medical School) ;
  • Shin, Min-Ho (Department of Preventive Medicine, Chonnam National University Medical School) ;
  • Park, Sangmin (Department of Biomedical Sciences, Seoul National University Graduate School) ;
  • Ryu, Seungho (Department of Occupational and Environmental Medicine, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine) ;
  • Yang, Sun Young (Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center) ;
  • Choi, Seung Ho (Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center) ;
  • Kim, Jeongseon (Graduate School of Science and Policy, National Cancer Center) ;
  • Yi, Sang-Wook (Department of Preventive Medicine and Public Health, Catholic Kwandong University College of Medicine) ;
  • Kang, Daehee (Department of Preventive Medicine, Seoul National University College of Medicine) ;
  • Yoo, Keun-Young (Veterans Health Service Medical Center) ;
  • Park, Sue K. (Department of Preventive Medicine, Seoul National University College of Medicine)
  • Received : 2022.07.06
  • Accepted : 2022.08.19
  • Published : 2022.09.30

Abstract

Objectives: We introduced the cohort studies included in the Korean Cohort Consortium (KCC), focusing on large-scale cohort studies established in Korea with a prolonged follow-up period. Moreover, we also provided projections of the follow-up and estimates of the sample size that would be necessary for big-data analyses based on pooling established cohort studies, including population-based genomic studies. Methods: We mainly focused on the characteristics of individual cohort studies from the KCC. We developed "PROFAN", a Shiny application for projecting the follow-up period to achieve a certain number of cases when pooling established cohort studies. As examples, we projected the follow-up periods for 5000 cases of gastric cancer, 2500 cases of prostate and breast cancer, and 500 cases of non-Hodgkin lymphoma. The sample sizes for sequencing-based analyses based on a 1:1 case-control study were also calculated. Results: The KCC consisted of 8 individual cohort studies, of which 3 were community-based and 5 were health screening-based cohorts. The population-based cohort studies were mainly organized by Korean government agencies and research institutes. The projected follow-up period was at least 10 years to achieve 5000 cases based on a cohort of 0.5 million participants. The mean of the minimum to maximum sample sizes for performing sequencing analyses was 5917-72 102. Conclusions: We propose an approach to establish a large-scale consortium based on the standardization and harmonization of existing cohort studies to obtain adequate statistical power with a sufficient sample size to analyze high-risk groups or rare cancer subtypes.

Keywords

Acknowledgement

This work was supported by the Korean Genome and Epidemiology Study (KoGES; 6635-302), National Institute of Health, Korea Disease Control and Prevention Agency, Republic of Korea, Korea National Health and Nutrition Examination Survey (KNHANES), Korea Disease Control and Prevention Agency, and customized cohort databases provided by the National Health Insurance Service (NHIS-2019-1-495, NHIS-2020-1-164), Occupational Safety and Health Research Institute (OSHRI), and Korea Occupational Safety and Health Agency (KOSHA).

References

  1. Kim IS, Ohrr H, Jee SH, Kim H, Lee Y. Smoking and total mortality: Kangwha cohort study, 6-year follow-up. Yonsei Med J 1993;34(3):212-222. https://doi.org/10.3349/ymj.1993.34.3.212
  2. Park S, Jee SH, Shin HR, Park EH, Shin A, Jung KW, et al. Attributable fraction of tobacco smoking on cancer using population-based nationwide cancer incidence and mortality data in Korea. BMC Cancer 2014;14:406. https://doi.org/10.1186/1471-2407-14-406
  3. Hanley JA. A heuristic approach to the formulas for population attributable fraction. J Epidemiol Community Health 2001;55(7):508-514. https://doi.org/10.1136/jech.55.7.508
  4. Yoo KY, Shin HR, Chang SH, Lee KS, Park SK, Kang D, et al. Korean multi-center cancer cohort study including a biological materials bank (KMCC-I). Asian Pac J Cancer Prev 2002;3(1):85-92.
  5. Yoo JY, Cho HJ, Moon S, Choi J, Lee S, Ahn C, et al. Pickled vegetable and salted fish intake and the risk of gastric cancer: two prospective cohort studies and a meta-analysis. Cancers (Basel) 2020;12(4):996. https://doi.org/10.3390/cancers12040996
  6. Korean Statistical Information Service. Population [cited 2022 May 30]. Available from: https://kosis.kr/statisticsList/statisticsListIndex.do?vwcd=MT_ZTITLE&menuId=M_01_01#contentgroup (Korean).
  7. Shin Y, Park B, Lee HA, Park B, Han H, Choi EJ, et al. Diseasespecific mortality and prevalence trends in Korea, 2002-2015. J Korean Med Sci 2020;35(4):e27. https://doi.org/10.3346/jkms.2020.35.e27
  8. Quante AS, Ming C, Rottmann M, Engel J, Boeck S, Heinemann V, et al. Projections of cancer incidence and cancer-related deaths in Germany by 2020 and 2030. Cancer Med 2016;5(9):2649-2656. https://doi.org/10.1002/cam4.767
  9. Williamson SH, Hernandez R, Fledel-Alon A, Zhu L, Nielsen R, Bustamante CD. Simultaneous inference of selection and population growth from patterns of variation in the human genome. Proc Natl Acad Sci U S A 2005;102(22):7882-7887. https://doi.org/10.1073/pnas.0502300102
  10. Wang GT, Li B, Santos-Cortez RP, Peng B, Leal SM. Power analysis and sample size estimation for sequence-based association studies. Bioinformatics 2014;30(16):2377-2378. https://doi.org/10.1093/bioinformatics/btu296
  11. Ahn Y, Lee M, Yoo K, Chung J, Park B, Li Z, et al. Epidemiological investigation on cancer risk among radiation workers in nuclear power plants and residents nearby nuclear power plants in Korea. Seoul: Seoul National University; 2011 (Korean).
  12. Kweon SS, Shin MH, Jeong SK, Nam HS, Lee YH, Park KS, et al. Cohort profile: the Namwon study and the Dong-gu study. Int J Epidemiol 2014;43(2):558-567. https://doi.org/10.1093/ije/dys244
  13. Kim J. Cancer screenee cohort study of the National Cancer Center in South Korea. Epidemiol Health 2014;36:e2014013. https://doi.org/10.4178/epih/e2014013
  14. Jee YH, Emberson J, Jung KJ, Lee SJ, Lee S, Back JH, et al. Cohort profile: the Korean Cancer Prevention Study-II (KCPS-II) Biobank. Int J Epidemiol 2018;47(2):385-386f. https://doi.org/10.1093/ije/dyx226
  15. Seo E, Lee Y, Mun E, Kim DH, Jeong Y, Lee J, et al. The effect of long working hours on developing type 2 diabetes in adults with prediabetes: the Kangbuk Samsung Cohort Study. Ann Occup Environ Med 2022;34:e4. https://doi.org/10.35371/aoem.2022.34.e4
  16. Yoon C, Goh E, Park SM, Cho B. Effects of smoking cessation and weight gain on cardiovascular disease risk factors in Asian male population. Atherosclerosis 2010;208(1):275-279. https://doi.org/10.1016/j.atherosclerosis.2009.07.024
  17. Lee C, Choe EK, Choi JM, Hwang Y, Lee Y, Park B, et al. Health and Prevention Enhancement (H-PEACE): a retrospective, population-based cohort study conducted at the Seoul National University Hospital Gangnam Center, Korea. BMJ Open 2018;8(4):e019327. https://doi.org/10.1136/bmjopen-2017-019327
  18. Kim Y, Han BG; KoGES group. Cohort profile: the Korean genome and epidemiology study (KoGES) consortium. Int J Epidemiol 2017;46(2):e20. https://doi.org/10.1093/ije/dyv316
  19. Moon S, Kim YJ, Han S, Hwang MY, Shin DM, Park MY, et al. The Korea Biobank Array: design and identification of coding variants associated with blood biochemical traits. Sci Rep 2019;9(1):1382. https://doi.org/10.1038/s41598-018-37832-9
  20. Yun S, Oh K. The Korea National Health and Nutrition Examination Survey data linked Cause of Death data. Epidemiol Health 2022;44:e2022021. https://doi.org/10.4178/epih.e2022021
  21. Seong SC, Kim YY, Khang YH, Park JH, Kang HJ, Lee H, et al. Data resource profile: the National Health Information Database of the National Health Insurance Service in South Korea. Int J Epidemiol 2017;46(3):799-800.
  22. Seong SC, Kim YY, Park SK, Khang YH, Kim HC, Park JH, et al. Cohort profile: the National Health Insurance Service-National Health Screening Cohort (NHIS-HEALS) in Korea. BMJ Open 2017;7(9):e016640. https://doi.org/10.1136/bmjopen-2017-016640
  23. Song M, Rolland B, Potter JD, Kang D. Asia Cohort Consortium: challenges for collaborative research. J Epidemiol 2012;22(4):287-290. https://doi.org/10.2188/jea.JE20120024
  24. Mulimani PS. Evidence-based practice and the evidence pyramid: a 21st century orthodontic odyssey. Am J Orthod Dentofacial Orthop 2017;152(1):1-8. https://doi.org/10.1016/j.ajodo.2017.03.020
  25. Paquette M, Kelecevic J, Schwartz L, Nieuwlaat R. Ethical issues in competing clinical trials. Contemp Clin Trials Commun 2019;14:100352. https://doi.org/10.1016/j.conctc.2019.100352
  26. Shams-White MM, Brockton NT, Mitrou P, Romaguera D, Brown S, Bender A, et al. Operationalizing the 2018 World Cancer Research Fund/American Institute for Cancer Research (WCRF/ AICR) cancer prevention recommendations: a standardized scoring system. Nutrients 2019;11(7):1572. https://doi.org/10.3390/nu11071572
  27. International Agency for Research on Cancer. IARC monographs on the identification of carcinogenic hazards to humans; 2004 [cited 2022 May 30]. Available from: https://www.ncbi.nlm.nih.gov/books/NBK294452/.
  28. Taioli E, Bonassi S. Pooled analysis of epidemiological studies involving biological markers. Int J Hyg Environ Health 2003; 206(2):109-115. https://doi.org/10.1078/1438-4639-00198
  29. Woodward M, Barzi F, Martiniuk A, Fang X, Gu DF, Imai Y, et al. Cohort profile: the Asia Pacific cohort studies collaboration. Int J Epidemiol 2006;35(6):1412-1416. https://doi.org/10.1093/ije/dyl222
  30. Woodward M, Barzi F, Feigin V, Gu D, Huxley R, Nakamura K, et al. Associations between high-density lipoprotein cholesterol and both stroke and coronary heart disease in the Asia Pacific region. Eur Heart J 2007;28(21):2653-2660. https://doi.org/10.1093/eurheartj/ehm427
  31. Li J, Vestergaard M, Obel C, Cnattingus S, Gissler M, Olsen J. Cohort profile: the Nordic perinatal bereavement cohort. Int J Epidemiol 2011;40(5):1161-1167. https://doi.org/10.1093/ije/dyq127
  32. Swerdlow AJ, Harvey CE, Milne RL, Pottinger CA, Vachon CM, Wilkens LR, et al. The National Cancer Institute Cohort Consortium: an international pooling collaboration of 58 cohorts from 20 countries. Cancer Epidemiol Biomarkers Prev 2018;27(11):1307-1319. https://doi.org/10.1158/1055-9965.EPI-18-0182
  33. Zheng W, McLerran DF, Rolland B, Zhang X, Inoue M, Matsuo K, et al. Association between body-mass index and risk of death in more than 1 million Asians. N Engl J Med 2011;364(8):719-729. https://doi.org/10.1056/NEJMoa1010679
  34. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018;562(7726):203-209. https://doi.org/10.1038/s41586-018-0579-z
  35. Hattersley AT, McCarthy MI. What makes a good genetic association study? Lancet 2005;366(9493):1315-1323. https://doi.org/10.1016/S0140-6736(05)67531-9
  36. Meisner A, Kundu P, Chatterjee N. Case-only analysis of geneenvironment interactions using polygenic risk scores. Am J Epidemiol 2019;188(11):2013-2020. https://doi.org/10.1093/aje/kwz175
  37. Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 2015;12(3):e1001779. https://doi.org/10.1371/journal.pmed.1001779
  38. Ollier W, Sprosen T, Peakman T. UK Biobank: from concept to reality. Pharmacogenomics 2005;6(6):639-646. https://doi.org/10.2217/14622416.6.6.639
  39. Taioli E, Bonassi S. Methodological issues in pooled analysis of biomarker studies. Mutat Res 2002;512(1):85-92. https://doi.org/10.1016/S1383-5742(02)00027-3
  40. Adhikari K, Patten SB, Patel AB, Premji S, Tough S, Letorneau N, et al. Data harmonization and data pooling from cohort studies: a practical approach for data management. Int J Popul Data Sci 2021;6(1):1680.