Search | Korea Science

Super Resolution Performance Analysis of GAN according to Feature Extractor (특징 추출기에 따른 SRGAN의 초해상 성능 분석)

Park, Sung-Wook;Kim, Jun-Yeong;Park, Jun;Jung, Se-Hoon;Sim, Chun-Bo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.11a
- /
- pp.501-503
- /
- 2022
초해상이란 해상도가 낮은 영상을 해상도가 높은 영상으로 합성하는 기술이다. 딥러닝은 영상의 해상도를 높이는 초해상 기술에도 응용되며 실현은 2아4년에 발표된 SRCNN(Super Resolution Convolutional Neural Network) 모델로부터 시작됐다. 이후 오토인코더 (Autoencoders) 구조로는 SRCAE(Super Resolution Convolutional Autoencoders), 합성된 영상을 실제 영상과 통계적으로 구분되지 않도록 강제하는 GAN (Generative Adversarial Networks) 구조로는 SRGAN(Super Resolution Generative Adversarial Networks) 모델이 발표됐다. 모두 SRCNN의 성능을 웃도는 모델들이나 그중 가장 높은 성능을 끌어내는 SRGAN 조차 아직 완벽한 성능을 내진 못한다. 본 논문에서는 SRGAN의 성능을 개선하기 위해 사전 훈련된 특징 추출기(Pre-trained Feature Extractor) VGG(Visual Geometry Group)-19 모델을 변경하고, 기존 모델과 성능을 비교한다. 실험 결과, VGG-19 모델보다 윤곽이 뚜렷하고, 실제 영상과 더 가까운 영상을 합성할 수 있는 모델을 발견할 수 있을 것으로 기대된다.
https://doi.org/10.3745/PKIPS.y2022m11a.501 인용 PDF

Sales Prediction of Electronic Appliances using a Convergence Model based on Artificial Neural Network and Genetic Algorithm (인공신경망과 유전자 알고리즘 기반의 융합모델을 이용한 가전제품의 판매예측)

Seo, Kwang-Kyu
- Journal of Digital Convergence
- /
- v.13 no.9
- /
- pp.177-182
- /
- 2015
The brand and product awareness of Korean electronics companies in the North American market has grown significantly and North American consumers has been recognized as an innovative technology products good performance of Korean electronics appliances. The consumer need of energy saving has led to a rise in market share because Korean electronics appliances have the excellence in energy saving aspects. The expansion of smartphones and mobile devices and the development of smart grid technology can affect electronics market. Domestic companies are continuously develop new product to provide consumers convenient with a variety of additional features combined consumer products. This study proposes a convergence model for sales prediction of electronic appliances using sales data of A company from the North American market. We develop the convergence model for sales prediction based on based on artificial neural network and genetic algorithm. In addition, we validate the superiority of the proposed convergence model by comparing the prediction performance of traditional prediction models.
https://doi.org/10.14400/JDC.2015.13.9.177 인용 PDF KSCI

Development of a Business Model of the Robot Industry in the Convergence Age (컨버전스 시대에 로봇산업의 비즈니스 모델 개발)

Seo, Kwang-Kyu;Ahn, Beum-Jun
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.10 no.4
- /
- pp.895-899
- /
- 2009
This paper presents a business models of the robot industry for copying with aging society that facilitates to create new business opportunities in the convergence age. In order to identify the market drivers for both convergence and aging society, the trends of them analyzed. Through constructing and analyzing market value chain, we design a set of the business model of the robot industry focused on u-health robots of a convergence service type integrated ubiquitous, health and robot. In addition, we describe the evolution path of the proposed business model in terms of technology development and market. Finally, we develop a matrix based evaluation framework to measure and assess the effectiveness of the business model.
https://doi.org/10.5762/KAIS.2009.10.4.895 인용 PDF

A study on the adoption-diffusion and use-diffusion of the digital convergence product : Focusing on the camera phone (디지털융합제품의 수용-확산 및 사용-확산에 관한 연구 : 카메라폰을 대상으로)

Yoo, Sang-Jin;Shon, Jung-Im
- Journal of Korea Society of Industrial Information Systems
- /
- v.14 no.4
- /
- pp.101-115
- /
- 2009
This study analyzed the diffusion process of digital convergence products focused on camera phon to compare the traditional adoption-diffusion model with use-diffusion model proposed by Shih and Venkatesh(2004). The results suggested that observability, perceived usefulness, innovativeness, relative advantage have effects on the intention of repurchase in the adoption-diffusion model. On the other hand, the intention of repurchase has been affected by technological sophistication, satisfaction, innovativeness, relative advantage in the use-diffusion model. These results imply that the traditional adoption-diffusion model could be applied to the cases of the digital convergence products. In addition, the results also show that the use-diffusion model is superior to the traditional model in analyzing the diffusion of digital convergence products.
https://doi.org/10.9723/jksiis.2009.14.4.101 인용 PDF

Deep Learning for Automatic Change Detection: Real-Time Image Analysis for Cherry Blossom State Classification (자동 변화 감지를 위한 딥러닝: 벚꽃 상태 분류를 위한 실시간 이미지 분석)

Seung-Bo Park;Min-Jun Kim;Guen-Mi Kim;Jeong-Tae Kim;Da-Ye Kim;Dong-Gyun Ham
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.07a
- /
- pp.493-494
- /
- 2023
본 논문은 벚꽃나무 영상 데이터를 활용하여 벚꽃의 상태(개화, 만개, 낙화)를 실시간으로 분류하는 연구를 소개한다. 이 연구의 목적은, 실시간으로 취득되는 벚꽃나무의 영상 데이터를 사전에 학습된 CNN 기반 이미지 분류 모델을 통해 벚꽃의 상태에 따라 분류하는 것이다. 약 1,000장의 벚꽃나무 이미지를 활용하여 CNN 모델을 학습시키고, 모델이 새로운 이미지에 대해 얼마나 정확하게 벚꽃의 상태를 분류하는지를 평가하였다. 학습데이터는 훈련 데이터와 검증 데이터로 나누었으며, 개화, 만개, 낙화 등의 상태별로 폴더를 구분하여 관리하였다. 또한, ImageNet 데이터셋에서 사전 학습된 ResNet50 가중치를 사용하는 전이학습 방법을 적용하여 학습 과정을 더 효율적으로 수행하고, 모델의 성능을 향상시켰다.
PDF

A Research on Re-examining Discriminator Design Space for Performance Improvement of ESRGAN (ESRGAN의 성능 향상을 위한 판별자 설계 공간 재검토에 관한 연구)

Sung-Wook Park;Jun-Yeong Kim;Jun Park;Se-Hoon Jung;Chun-Bo Sim
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.05a
- /
- pp.513-514
- /
- 2023
초해상은 저해상도의 영상을 고해상도 영상으로 합성하는 기술이다. 이 기술에 딥러닝이 적용되어, 2014년에는 SRCNN(Super Resolution Convolutional Neural Network) 모델이 발표됐다. 이후에는 SRCAE(Super Resolution Convolutional Autoencoders)와 GAN(Generative Adversarial Networks)을 기반으로 한 SRGAN(Super Resolution Generative Adversarial Networks) 등, SRCNN의 성능을 능가하는 모델들이 발표됐다. ESRGAN(Enhanced Super Resolution Generative Adversarial Networks)은 SRGAN 모델의 성능을 개선했지만, 완벽한 성능을 내지 못하는 문제점이 있다. 이에 본 논문에서는 판별자(Discriminator) 구조를 변경하여 ESRGAN의 성능을 개선한다. 실험 결과, 제안하는 모델이 ESRGAN보다 더 높은 성능을 보일 것으로 기대된다.
https://doi.org/10.3745/PKIPS.y2023m05a.513 인용 PDF

Noise removal in still images based on modified diffusion equation (개선된 확산방정식에 의한 정지영상의 CCD 잡음 제거)

이석호;강문기;박규태
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 1997.11a
- /
- pp.87-90
- /
- 1997
본 논문에서는 영상의 잡음(noise) 제거를 위한 새로운 diffusion모델을 제안한다. MOD모델 (mean curvature diffusion model)은 영상의 잡음 제거 때 유발되는 경계선의 blurring을 지양할 수 있는 장점이 있는 반면에 수렴상태(convergence state)를 갖지 못한 단점을 안고 있다. 본 논문에서는 MCD 모델에 min/max switch를 결합시킴으로써 MCD모델이 갖고 있던 문제점을 개선하였다. 제안하는 diffusion 모델은 scheme의 반복적인 적용에 대해서 실질적으로 그 결과가 더 이상 변동하지 않는 수렴상태(convergence state)를 가진 매우 안정적인 시스템이다.
PDF

Multi-Modal based ViT Model for Video Data Emotion Classification (영상 데이터 감정 분류를 위한 멀티 모달 기반의 ViT 모델)

Yerim Kim;Dong-Gyu Lee;Seo-Yeong Ahn;Jee-Hyun Kim
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.01a
- /
- pp.9-12
- /
- 2023
최근 영상 콘텐츠를 통해 영상물의 메시지뿐 아니라 메시지의 형식을 통해 전달된 감정이 시청하는 사람의 심리 상태에 영향을 주고 있다. 이에 따라, 영상 콘텐츠의 감정을 분류하는 연구가 활발히 진행되고 있고 본 논문에서는 대중적인 영상 스트리밍 플랫폼 중 하나인 유튜브 영상을 7가지의 감정 카테고리로 분류하는 여러 개의 영상 데이터 중 각 영상 데이터에서 오디오와 이미지 데이터를 각각 추출하여 학습에 이용하는 멀티 모달 방식 기반의 영상 감정 분류 모델을 제안한다. 사전 학습된 VGG(Visual Geometry Group)모델과 ViT(Vision Transformer) 모델을 오디오 분류 모델과 이미지 분류 모델에 이용하여 학습하고 본 논문에서 제안하는 병합 방법을 이용하여 병합 후 비교하였다. 본 논문에서는 기존 영상 데이터 감정 분류 방식과 다르게 영상 속에서 화자를 인식하지 않고 감정을 분류하여 최고 48%의 정확도를 얻었다.
PDF

Development of Multi-Organ Segmentation Model for Support Abdominal Disease Diagnosis (복부질환 진단 지원을 위한 다중 장기 분할 모델 개발)

Si-Hyeong Noh;Dong-Wook Lim;Chungsub Lee;Tae-Hoon Kim;Chul Park;Chang-Won Jeong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.05a
- /
- pp.546-548
- /
- 2023
인공지능 기술을 도입한 의료분야에서 진단 및 예측을 위한 관련 연구가 활발하게 진행되고 있다. 특히, 인공지능 기술 적용에 가장 많이 활용되고 있는 의료영상을 기반으로 하는 질환에 관한 진단 연구는 매우 복잡한 과정이 필요한 질환의 진단에 큰 영향을 미치고 있다. 복부 장기들의 분할은 환자의 질환 진단 지원 및 복강경등의 수술 지원에 매우 중요한 부분을 차지한다. 본 논문에서는 의료영상을 통해 13가지 복부 장기들을 분할하는 모델을 만들고 그 결과를 보인다. 본 논문에서 제안한 모델을 통해 13가지 복부 장기에 대한 분할로 영상분석을 통해 진단 지원이 가능할 것으로 기대한다.
https://doi.org/10.3745/PKIPS.y2023m05a.546 인용 PDF

Attention based multimodal model for Korean speech recognition post-editing (한국어 음성인식 후처리를 위한 주의집중 기반의 멀티모달 모델)

Jeong, Yeong-Seok;Oh, Byoung-Doo;Heo, Tak-Sung;Choi, Jeong-Myeong;Kim, Yu-Seop
- Annual Conference on Human and Language Technology
- /
- 2020.10a
- /
- pp.145-150
- /
- 2020
최근 음성인식 분야에서 신경망 기반의 종단간 모델이 제안되고 있다. 해당 모델들은 음성을 직접 입력받아 전사된 문장을 생성한다. 음성을 직접 입력받는 모델의 특성상 데이터의 품질이 모델의 성능에 많은 영향을 준다. 본 논문에서는 이러한 종단간 모델의 문제점을 해결하고자 음성인식 결과를 후처리하기 위한 멀티모달 기반 모델을 제안한다. 제안 모델은 음성과 전사된 문장을 입력 받는다. 입력된 각각의 데이터는 Encoder를 통해 자질을 추출하고 주의집중 메커니즘을 통해 Decoder로 추출된 정보를 전달한다. Decoder에서는 전달받은 주의집중 메커니즘의 결과를 바탕으로 후처리된 토큰을 생성한다. 본 논문에서는 후처리 모델의 성능을 평가하기 위해 word error rate를 사용했으며, 실험결과 Google cloud speech to text모델에 비해 word error rate가 8% 감소한 것을 확인했다.
PDF

Search Result 3,719, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)