DOI QR코드

DOI QR Code

Uncover This Tech Term: Foundation Model

  • Kyu-Hwan Jung (Department of Medical Device Management and Research, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University)
  • 투고 : 2023.08.21
  • 심사 : 2023.08.23
  • 발행 : 2023.10.01

초록

키워드

참고문헌

  1. Bommasani R, Hudson DA, Adeli E, Altman R, Arora S, von Arx S, et al. On the opportunities and risks of foundation models. arXiv [Preprint]. 2021 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2108.07258
  2. Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language models are few-shot learners. arXiv [Preprint]. 2020 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2005.14165
  3. Jiang LY, Liu XC, Nejatian NP, Nasir-Moin M, Wang D, Abidin A, et al. Health system-scale language models are all-purpose prediction engines. Nature 2023;619:357-362
  4. Harshvardhan GM, Gourisaria MK, Pandey M, Rautaray SS. A comprehensive survey and analysis of generative models in machine learning. Comput Sci Rev 2020;38:100285
  5. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. arXiv [Preprint]. 2017 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.1706.03762
  6. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, et al. An image is worth 16x16 words: transformers for image recognition at scale. arXiv [Preprint]. 2020 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2010.11929
  7. Wei J, Tay Y, Bommasani R, Raffel C, Zoph B, Borgeaud S, et al. Emergent abilities of large language models. arXiv [Preprint]. 2022 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2206.07682
  8. Gu J, Han Z, Chen S, Beirami A, He B, Zhang G, et al. A systematic survey of prompt engineering on vision-language foundation models. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2307.12980
  9. Fei N, Lu Z, Gao Y, Yang G, Huo Y, Wen J, et al. Towards artificial general intelligence via a multimodal foundation model. Nat Commun 2022;13:3094
  10. Kirillov A, Mintun E, Ravi N, Mao H, Rolland C, Gustafson L, et al. Segment anything [accessed on August 21, 2023]. Available at: https://segment-anything.com
  11. Radford A, Kim JW, Hallacy C, Ramesh A, Goh G, Agarwal S, et al. Learning transferable visual models from natural language supervision. arXiv [Preprint]. 2021 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2103.00020
  12. Alayrac JB, Donahue J, Luc P, Miech A, Barr I, Hasson Y, et al. Flamingo: a visual language model for few-shot learning. arXiv [Preprint]. 2022 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2204.14198
  13. Driess D, Xia F, Sajjadi MSM, Lynch C, Chowdhery A, Ichter B, et al. PaLM-E: an embodied multimodal language model. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2303.03378
  14. Bhayana R, Bleakney RR, Krishna S. GPT-4 in radiology: improvements in advanced reasoning. Radiology 2023;307:e230987
  15. Kanjee Z, Crowe B, Rodman A. Accuracy of a generative artificial intelligence model in a complex diagnostic challenge. JAMA 2023;330:78-80
  16. Wang G, Yang G, Du Z, Fan L, Li X. ClinicalGPT: large language models finetuned with diverse medical data and comprehensive evaluation. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2306.09968
  17. Liu Z, Zhong A, Li Y, Yang L, Ju C, Wu Z, et al. Radiology-GPT: a large language model for radiology. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2306.08666
  18. Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature 2023;620:172-180
  19. Singhal K, Tu T, Gottweis J, Sayres R, Wulczyn E, Hou L, et al. Towards expert-level medical question answering with large language models. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2305.09617
  20. Ma J, He Y, Li F, Han L, You C, Wang B. Segment anything in medical images. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2304.12306
  21. Zhou J, He X, Sun L, Xu J, Chen X, Chu Y, et al. SkinGPT-4: an interactive dermatology diagnostic system with visual large language model. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2304.10691
  22. Li C, Wong C, Zhang S, Usuyama N, Liu H, Yang J, et al. LLaVA-Med: training a large language-and-vision assistant for biomedicine in one day. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2306.00890
  23. Tiu E, Talius E, Patel P, Langlotz CP, Ng AY, Rajpurkar P. Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning. Nat Biomed Eng 2022;6:1399-1406
  24. Huang Z, Bianchi F, Yuksekgonul M, Montine TJ, Zou J. A visual-language foundation model for pathology image analysis using medical Twitter. Nat Med 2023 Aug 17. [Epub]. https://doi.org/10.1038/s41591-023-02504-3
  25. Li J, Liu C, Cheng S, Arcucci R, Hong S. Frozen language model helps ECG zero-shot learning. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2303.12311
  26. Zhang K, Yu J, Yan Z, Liu Y, Adhikarla E, Fu S, et al. BiomedGPT: a unified and generalist biomedical generative pre-trained transformer for vision, language, and multimodal tasks. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2305.17100
  27. Wu C, Zhang X, Zhang Y, Wang Y, Xie W. Towards generalist foundation model for radiology. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2308.02463
  28. Tu T, Azizi S, Driess D, Schaekermann M, Amin M, Chang PC, et al. Towards generalist biomedical AI. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2307.14334
  29. Tian S, Jin Q, Yeganova L, Lai PT, Zhu Q, Chen X, et al. Opportunities and challenges for ChatGPT and large language models in biomedicine and health. arXiv [Preprint]. 2023 [cited August 21, 2023]. Available at: https://doi.org/10.48550/arXiv.2306.10070
  30. Gilbert S, Harvey H, Melvin T, Vollebregt E, Wicks P. Large language model AI chatbots require approval as medical devices. Nat Med 2023 Jun 30. [Epub]. https://doi.org/10.1038/s41591-023-02412-6