통합 검색 | Korea Science

딥러닝 기반 비디오 캡셔닝의 연구동향 분석 (Analysis of Research Trends in Deep Learning-Based Video Captioning)

려치;이은주;김영수
- 정보처리학회논문지:소프트웨어 및 데이터공학
- /
- 제13권1호
- /
- pp.35-49
- /
- 2024
컴퓨터 비전과 자연어 처리의 융합의 중요한 결과로서 비디오 캡셔닝은 인공지능 분야의 핵심 연구 방향이다. 이 기술은 비디오 콘텐츠의 자동이해와 언어 표현을 가능하게 함으로써, 컴퓨터가 비디오의 시각적 정보를 텍스트 형태로 변환한다. 본 논문에서는 딥러닝 기반 비디오 캡셔닝의 연구 동향을 초기 분석하여 CNN-RNN 기반 모델, RNN-RNN 기반 모델, Multimodal 기반 모델, 그리고 Transformer 기반 모델이라는 네 가지 주요 범주로 나누어 각각의 비디오 캡셔닝 모델의 개념과 특징 그리고 장단점을 논하였다. 그리고 이 논문은 비디오 캡셔닝 분야에서 일반적으로 자주 사용되는 데이터 집합과 성능 평가방안을 나열하였다. 데이터 세트는 다양한 도메인과 시나리오를 포괄하여 비디오 캡션 모델의 훈련 및 검증을 위한 광범위한 리소스를 제공한다. 모델 성능 평가방안에서는 주요한 평가 지표를 언급하며, 모델의 성능을 다양한 각도에서 평가할 수 있도록 연구자들에게 실질적인 참조를 제공한다. 마지막으로 비디오 캡셔닝에 대한 향후 연구과제로서 실제 응용 프로그램에서의 복잡성을 증가시키는 시간 일관성 유지 및 동적 장면의 정확한 서술과 같이 지속해서 개선해야 할 주요 도전과제와 시간 관계 모델링 및 다중 모달 데이터 통합과 같이 새롭게 연구되어야 하는 과제를 제시하였다.
https://doi.org/10.3745/KTSDE.2024.13.1.35 인용 PDF

Accuracy of posteroanterior cephalogram landmarks and measurements identification using a cascaded convolutional neural network algorithm: A multicenter study

Sung-Hoon Han;Jisup Lim;Jun-Sik Kim;Jin-Hyoung Cho;Mihee Hong;Minji Kim;Su-Jung Kim;Yoon-Ji Kim;Young Ho Kim;Sung-Hoon Lim;Sang Jin Sung;Kyung-Hwa Kang;Seung-Hak Baek;Sung-Kwon Choi;Namkug Kim
- 대한치과교정학회지
- /
- 제54권1호
- /
- pp.48-58
- /
- 2024
Objective: To quantify the effects of midline-related landmark identification on midline deviation measurements in posteroanterior (PA) cephalograms using a cascaded convolutional neural network (CNN). Methods: A total of 2,903 PA cephalogram images obtained from 9 university hospitals were divided into training, internal validation, and test sets (n = 2,150, 376, and 377). As the gold standard, 2 orthodontic professors marked the bilateral landmarks, including the frontozygomatic suture point and latero-orbitale (LO), and the midline landmarks, including the crista galli, anterior nasal spine (ANS), upper dental midpoint (UDM), lower dental midpoint (LDM), and menton (Me). For the test, Examiner-1 and Examiner-2 (3-year and 1-year orthodontic residents) and the Cascaded-CNN models marked the landmarks. After point-to-point errors of landmark identification, the successful detection rate (SDR) and distance and direction of the midline landmark deviation from the midsagittal line (ANS-mid, UDM-mid, LDM-mid, and Me-mid) were measured, and statistical analysis was performed. Results: The cascaded-CNN algorithm showed a clinically acceptable level of point-to-point error (1.26 mm vs. 1.57 mm in Examiner-1 and 1.75 mm in Examiner-2). The average SDR within the 2 mm range was 83.2%, with high accuracy at the LO (right, 96.9%; left, 97.1%), and UDM (96.9%). The absolute measurement errors were less than 1 mm for ANS-mid, UDM-mid, and LDM-mid compared with the gold standard. Conclusions: The cascaded-CNN model may be considered an effective tool for the auto-identification of midline landmarks and quantification of midline deviation in PA cephalograms of adult patients, regardless of variations in the image acquisition method.
https://doi.org/10.4041/kjod23.075 인용 PDF

Deep Learning-Based Computed Tomography Image Standardization to Improve Generalizability of Deep Learning-Based Hepatic Segmentation

Seul Bi Lee;Youngtaek Hong;Yeon Jin Cho;Dawun Jeong;Jina Lee;Soon Ho Yoon;Seunghyun Lee;Young Hun Choi;Jung-Eun Cheon
- Korean Journal of Radiology
- /
- 제24권4호
- /
- pp.294-304
- /
- 2023
Objective: We aimed to investigate whether image standardization using deep learning-based computed tomography (CT) image conversion would improve the performance of deep learning-based automated hepatic segmentation across various reconstruction methods. Materials and Methods: We collected contrast-enhanced dual-energy CT of the abdomen that was obtained using various reconstruction methods, including filtered back projection, iterative reconstruction, optimum contrast, and monoenergetic images with 40, 60, and 80 keV. A deep learning based image conversion algorithm was developed to standardize the CT images using 142 CT examinations (128 for training and 14 for tuning). A separate set of 43 CT examinations from 42 patients (mean age, 10.1 years) was used as the test data. A commercial software program (MEDIP PRO v2.0.0.0, MEDICALIP Co. Ltd.) based on 2D U-NET was used to create liver segmentation masks with liver volume. The original 80 keV images were used as the ground truth. We used the paired t-test to compare the segmentation performance in the Dice similarity coefficient (DSC) and difference ratio of the liver volume relative to the ground truth volume before and after image standardization. The concordance correlation coefficient (CCC) was used to assess the agreement between the segmented liver volume and ground-truth volume. Results: The original CT images showed variable and poor segmentation performances. The standardized images achieved significantly higher DSCs for liver segmentation than the original images (DSC [original, 5.40%-91.27%] vs. [standardized, 93.16%-96.74%], all P < 0.001). The difference ratio of liver volume also decreased significantly after image conversion (original, 9.84%-91.37% vs. standardized, 1.99%-4.41%). In all protocols, CCCs improved after image conversion (original, -0.006-0.964 vs. standardized, 0.990-0.998). Conclusion: Deep learning-based CT image standardization can improve the performance of automated hepatic segmentation using CT images reconstructed using various methods. Deep learning-based CT image conversion may have the potential to improve the generalizability of the segmentation network.
https://doi.org/10.3348/kjr.2022.0588 인용 PDF

A Study on Strategic Development Approaches for Cyber Seniors in the Information Security Industry

Seung Han Yoon;Ah Reum Kang
- 한국컴퓨터정보학회논문지
- /
- 제29권4호
- /
- pp.73-82
- /
- 2024
2017년 UN에서는 전 세계적으로 60세 이상 인구는 모든 젊은 연령층보다 빠르게 증가하고 있으며, 2050년까지 60세 이상 인구는 아프리카를 제외한 전 세계 인구의 최소 25%를 구성할 것으로 예상하였다. 세계는 전반적으로 고령화로 인해 일을 할 수 있는 인구의 증가율이 감소하고 있으며, 청년층은 힘들고 어려운 직업을 선호하지 않고 있다. 이론적으로는 인공지능을 겸비한 AI가 모든 분야에서 사람을 대신할 수 있다고 하지만 윤리적인 판단 등 현실 세계의 정보보호 분야에서는 사람의 판단과 노하우가 절대적으로 필요하다. 이에, 본 논문에서는 IT 종사자 중 50대 이상 퇴직자 또는 전직을 희망하는 사람을 대상으로 재교육을 통해 현업으로 유입시키는 방법을 제안하고자 한다. 연구를 위해 수요 부분의 정부·공공기관 21곳과 공급 부분의 보안관제전문업체 9곳을 대상으로 설문하였으며 설문 결과 공급(78%)와 수요(90%) 모두가 절대적으로 필요하다는 데 의견을 모았다. 향후 이 연구 결과를 토대로 현장에 적용한다면 인구 저출산 100세 시대에 정보보호분야 시니어의 전략적 육성으로 대한민국 정보보호산업의 초석이 될 신규시장을 발굴할 수 있을 것이다.
https://doi.org/10.9708/jksci.2024.29.04.073 인용 PDF HTML

웹페이지 분석을 위한 딥러닝 모델 학습과 구현에 관한 연구 (Research on Training and Implementation of Deep Learning Models for Web Page Analysis)

김정환;조재원;김진산;이한진
- 문화기술의 융합
- /
- 제10권2호
- /
- pp.517-524
- /
- 2024
본 연구는 ChatGPT 서비스의 개시 이후 인공지능 혁명이라 일컬어지는 시대적 배경 속에서, 웹사이트의 제작과 인공지능의 융합을 위해 딥러닝 모델을 학습 및 구현하고자 한다. 딥러닝 모델은 수집한 3,000개의 웹페이지 이미지를 구성요소와 레이아웃 분류체계 기반의 데이터 가공을 통해 학습하였으며, 다음과 같은 세 가지 단계로 구분하여 진행하였다. 첫째, 인공지능 모델에 관한 선행연구를 조사하여 구현하고자 하는 모델에 가장 적합한 알고리즘을 선택하였다. 둘째, 적합한 웹페이지 및 단락 이미지를 수집하고 분류 및 가공하였다. 셋째, 딥러닝 모델을 학습시키고 서빙 인터페이스를 연동해 모델의 실제 결과를 확인하였다. 이렇게 구현된 모델은 실제 웹페이지를 구성하는 복수의 단락을 탐지하고, 단락별 규모, 요소, 특징을 분석하여 분류체계를 기반으로 의미 있는 데이터를 도출할 것이다. 이 과정은 점차 발전하여 웹페이지를 보다 정밀하게 분석할 수 있게 될 것이다. 그리고 정밀 분석기법을 역으로 설계하여, 인공지능이 완벽한 웹페이지를 자동으로 생성할 수 있는 연구의 초석이 될 것으로 기대한다.
https://doi.org/10.17703/JCCT.2024.10.2.517 인용 PDF

IT 기업 사무직 근로자의 대사증후군 예방을 위한 맞춤형 운동프로그램의 효과 (Effect of Individualized Exercise Program for Preventing Metabolic Syndrome among IT Company Office Workers)

배경운;유승현;신다비;하윤철;김홍민;박병찬;김효상;박신애
- 한국산업보건학회지
- /
- 제34권1호
- /
- pp.77-84
- /
- 2024
Objectives: Interventions promoting physical exercise and healthy habits in workplaces have been shown to be effective in reducing risk factors for metabolic syndrome. This study was conducted to examine the effects of an individualized conditioning exercise program of IT company office workers with or at higher risk of metabolic syndrome. Methods: A total of 444 IT company office workers with or at higher risk of metabolic syndrome participated in a 3-month conditioning exercise program. Body composition data using bioelectrical impedance analysis and cardiopulmonary data using cardiopulmonary exercise testing from 53 individuals (mean age: 34.8 ± 7.1 years, sex : 21% female, height : 170.4 ± 6.8 cm, weight : 75.2±12.2 kg, body mass index : 25.8±3.3 kg/m²) who have successfully completed pre-test, intervention, and post-test were analyzed. The 12 weeks intervention encompassed: (1) health counseling (2) supervised exercise(endurance-based, aerobic exercise, or circuit training once a week for 50 minutes at heart rate reserve(HRR) of 77-95%) (3) self-directed exercise and biweekly health screening checks. Results: The results indicated a significant decrease in body weight, body fat mass and body mass index, respectively. Moreover, VO₂peak, AT VO₂ and AT Time significantly improved, respectively. Resting blood pressure(SBP/DBP) showed positive changes but were not statistically significant. We observed the correlation between characteristics of participants and rate of changes in cardiopulmonary outcomes of participants, there are no significant correlation. These results indicate positive changes in body composition and cardiorespiratory fitness parameters following individualized conditioning exercise program. Conclusions: Individualized workplace exercise program for preventing metabolic syndrome can lead to improvements in body composition and cardiorespiratory fitness.
https://doi.org/10.15269/JKSOEH.2024.34.1.77 인용 PDF

Deep learning-based automatic segmentation of the mandibular canal on panoramic radiographs: A multi-device study

Moe Thu Zar Aung;Sang-Heon Lim;Jiyong Han;Su Yang;Ju-Hee Kang;Jo-Eun Kim;Kyung-Hoe Huh;Won-Jin Yi;Min-Suk Heo;Sam-Sun Lee
- Imaging Science in Dentistry
- /
- 제54권1호
- /
- pp.81-91
- /
- 2024
Purpose: The objective of this study was to propose a deep-learning model for the detection of the mandibular canal on dental panoramic radiographs. Materials and Methods: A total of 2,100 panoramic radiographs (PANs) were collected from 3 different machines: RAYSCAN Alpha (n=700, PAN A), OP-100 (n=700, PAN B), and CS8100 (n=700, PAN C). Initially, an oral and maxillofacial radiologist coarsely annotated the mandibular canals. For deep learning analysis, convolutional neural networks (CNNs) utilizing U-Net architecture were employed for automated canal segmentation. Seven independent networks were trained using training sets representing all possible combinations of the 3 groups. These networks were then assessed using a hold-out test dataset. Results: Among the 7 networks evaluated, the network trained with all 3 available groups achieved an average precision of 90.6%, a recall of 87.4%, and a Dice similarity coefficient (DSC) of 88.9%. The 3 networks trained using each of the 3 possible 2-group combinations also demonstrated reliable performance for mandibular canal segmentation, as follows: 1) PAN A and B exhibited a mean DSC of 87.9%, 2) PAN A and C displayed a mean DSC of 87.8%, and 3) PAN B and C demonstrated a mean DSC of 88.4%. Conclusion: This multi-device study indicated that the examined CNN-based deep learning approach can achieve excellent canal segmentation performance, with a DSC exceeding 88%. Furthermore, the study highlighted the importance of considering the characteristics of panoramic radiographs when developing a robust deep-learning network, rather than depending solely on the size of the dataset.
https://doi.org/10.5624/isd.20230245 인용 PDF

이질성 학습을 통한 문서 분류의 정확성 향상 기법 (Improving the Accuracy of Document Classification by Learning Heterogeneity)

윌리엄;현윤진;김남규
- 지능정보연구
- /
- 제24권3호
- /
- pp.21-44
- /
- 2018
최근 인터넷 기술의 발전과 함께 스마트 기기가 대중화됨에 따라 방대한 양의 텍스트 데이터가 쏟아져 나오고 있으며, 이러한 텍스트 데이터는 뉴스, 블로그, 소셜미디어 등 다양한 미디어 매체를 통해 생산 및 유통되고 있다. 이처럼 손쉽게 방대한 양의 정보를 획득할 수 있게 됨에 따라 보다 효율적으로 문서를 관리하기 위한 문서 분류의 필요성이 급증하였다. 문서 분류는 텍스트 문서를 둘 이상의 카테고리 혹은 클래스로 정의하여 분류하는 것을 의미하며, K-근접 이웃(K-Nearest Neighbor), 나이브 베이지안 알고리즘(Naïve Bayes Algorithm), SVM(Support Vector Machine), 의사결정나무(Decision Tree), 인공신경망(Artificial Neural Network) 등 다양한 기술들이 문서 분류에 활용되고 있다. 특히, 문서 분류는 문맥에 사용된 단어 및 문서 분류를 위해 추출된 형질에 따라 분류 모델의 성능이 달라질 뿐만 아니라, 문서 분류기 구축에 사용된 학습데이터의 질에 따라 문서 분류의 성능이 크게 좌우된다. 하지만 현실세계에서 사용되는 대부분의 데이터는 많은 노이즈(Noise)를 포함하고 있으며, 이러한 데이터의 학습을 통해 생성된 분류 모형은 노이즈의 정도에 따라 정확도 측면의 성능이 영향을 받게 된다. 이에 본 연구에서는 노이즈를 인위적으로 삽입하여 문서 분류기의 견고성을 강화하고 이를 통해 분류의 정확도를 향상시킬 수 있는 방안을 제안하고자 한다. 즉, 분류의 대상이 되는 원 문서와 전혀 다른 특징을 갖는 이질적인 데이터소스로부터 추출한 형질을 원 문서에 일종의 노이즈의 형태로 삽입하여 이질성 학습을 수행하고, 도출된 분류 규칙 중 문서 분류기의 정확도 향상에 기여하는 분류 규칙만을 추출하여 적용하는 방식의 규칙 선별 기반의 앙상블 준지도학습을 제안함으로써 문서 분류의 성능을 향상시키고자 한다.
https://doi.org/10.13088/jiis.2018.24.3.021 인용 PDF KSCI

합성곱 신경망(Convolutional Neural Network)을 활용한 지능형 아토피피부염 중증도 진단 모델 개발 (Development of Intelligent Severity of Atopic Dermatitis Diagnosis Model using Convolutional Neural Network)

윤재웅;전재헌;방철환;박영민;김영주;오성민;정준호;이석준;이지현
- 경영과정보연구
- /
- 제36권4호
- /
- pp.33-51
- /
- 2017
제4차 산업혁명의 등장과 경제성장으로 인한 '국민 삶의 질 향상' 요구 증대로 인해 의료서비스의 질과 의료비용에 대한 국민들의 요구수준이 향상되고 있으며, 이로 인해 인공지능이 의료현장에 도입되고 있다. 하지만 인공지능이 의료분야에 활용된 사례를 살펴보면 '삶의 질'에 직접적인 영향을 끼치는 만성피부질환에 활용된 사례는 부족한 실정이며, 만성피부질환 중 대표적 질병인 아토피피부염은 정성적 진단 방법으로 인해 진단의 객관성을 확보할 수 없다는 한계가 존재한다. 본 연구에서는 아토피피부염의 객관적 중증도 평가 방법을 마련하여 아토피피부염 환자의 삶의 질을 향상시키고자 다음과 같은 연구를 수행하였다. 첫째, 가톨릭대학교 의과대학 성모병원의 데이터베이스로부터 아토피피부염 환자의 이미지 데이터를 수집했으며, 수집된 이미지 데이터에 대한 정제 및 라벨링 작업을 수행하여 모델 학습과 검증에 적합한 데이터를 확보했다. 둘째, 지능형 아토피피부염 중증도 진단 모형에 적합한 이미지 인식 알고리즘을 파악하기 위해 다양한 CNN 알고리즘들을 병변별 학습용 데이터로 학습시키고, 검증용 데이터를 활용하여 해당 모델의 이미지 인식 정확도를 측정했다. 실증분석 결과 홍반(Erythema)의 경우 'ResNet V1 101', 긁은 정도(Excoriation)의 경우 'ResNet V2 50'이 90% 이상의 정확도를 기록하였으며, 태선화(Lichenification)의 경우 학습용 데이터 부족의 한계로 인해 두 병변보다 낮은 89%의 정확도를 보였다. 해당 결과를 통해 이미지 인식 알고리즘이 단순한 사물 인식 분야뿐만 아니라 전문적 지식이 요구되는 분야에도 높은 성능을 나타낸다는 것을 실증적으로 입증했으며, 본 연구는 실제 아토피피부염 환자의 이미지 데이터를 활용했다는 측면에서 실제 임상환경에서 활용성이 높을 것으로 사료된다.
PDF

사범대학 재학생의 예비 교사 인증 영역 및 하위 요소에 대한 중요도 인식 분석 (The perception of undergraduates of the college of education on the importance of trainee teacher certification areas and sub-factors)

김태훈;이태호
- 대한공업교육학회지
- /
- 제39권1호
- /
- pp.164-188
- /
- 2014
이 연구의 목적은 사범대학 및 학과별 수준의 인증 시스템에서 제시한 인증 영역 및 요소에 대한 사범대학 재학생이 인식하는 중요도를 조사하여 개선방안을 제시하는 것이다. 이를 위한 구체적인 목표는 첫째, 인증 영역 및 요소에 대한 학과별 중요도 차이를 확인하며, 둘째, 인증 영역 및 요소에 대한 학년별 중요도 차이를 확인하는 것이다. 본 연구의 대상은 A대학교 사범대학 재학생 전체를 모집단으로 하였으며, 10개 학과 758명이 중요도에 대한 설문 조사 대상이다. 설문 조사지는 총 800부를 배포하였으며, 회수된 설문지는 299부로 회수율은 37.3%로 나타났다. 설문 조사 분석 결과를 바탕으로 주요 결론을 제시하면 다음과 같다. 첫째, 학과별로 교직 인성 영역에서는 교직 적성 검사, 사회적 지능계발 프로그램 이수에 대한 중요도의 차이가 나타났으며, 교수 전문성 영역에서는 전공별 교과교육 과목 이수, 전공별 교과내용 과목 이수, 수업 실연 경진대회 참가에서 중요도 인식의 차이가 나타났다. 학생지도 전문성 영역에서는 창의 인성 계발 관련 교육 프로그램 이수, '교육실습'과목 이수에 있어서 학과별로 중요도에 차이가 나타났다. 정보 사회 소통력에서는 제2외국어 최소 점수 획득에 대해서 낮은 중요도를 보였다. 둘째, 학년별로는 1학년 재학생이 교직과정 구성의 타당성, 교육과정의 충실성, 교육과정의 우수 교원 양성에의 타당성, 졸업자의 교원 직무 수행 능력 함양, 사범대학 교육과정의 우수 교원 양성 적절성에 대해서 타 학년에 비해 높게 인식하는 것으로 나타났다. 특히 4학년의 경우 우수 교원 양성을 위한 새로운 제도의 필요성을 가장 높게 인식하는 것으로 나타났다.
PDF KSCI

검색결과 750건 처리시간 0.022초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)