Search | Korea Science

Listenable Explanation for Heatmap in Acoustic Scene Classification (음향 장면 분류에서 히트맵 청취 분석)

Suh, Sangwon;Park, Sooyoung;Jeong, Youngho;Lee, Taejin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.07a
- /
- pp.727-731
- /
- 2020
인공신경망의 예측 결과에 대한 원인을 분석하는 것은 모델을 신뢰하기 위해 필요한 작업이다. 이에 컴퓨터 비전 분야에서는 돌출맵 또는 히트맵의 형태로 모델이 어떤 내용을 근거로 예측했는지 시각화 하는 모델 해석 방법들이 제안되었다. 하지만 오디오 분야에서는 스펙트로그램 상의 시각적 해석이 직관적이지 않으며, 실제 어떤 소리를 근거로 판단했는지 이해하기 어렵다. 따라서 본 연구에서는 히트맵의 청취 분석 시스템을 제안하고, 이를 활용한 음향 장면 분류 모델의 히트맵 청취 분석 실험을 진행하여 인공신경망의 예측 결과에 대해 사람이 이해할 수 있는 설명을 제공할 수 있는지 확인한다.
PDF

Content-Based Image Retrieval Using Visual Features and Fuzzy Integral (시각 특징과 퍼지 적분을 이용한 내용기반 영상 검색)

Song Young-Jun;Kim Nam;Kim Mi-Hye;Kim Dong-Woo
- The Journal of the Korea Contents Association
- /
- v.6 no.5
- /
- pp.20-28
- /
- 2006
This paper proposes visual-feature extraction for each band in wavelet domain with both spatial frequency features and multi resolution features, and the combination of visual features using fuzzy integral. In addition, it uses color feature expression method taking advantage of the frequency of the same color after color quantization for reducing quantization error, a disadvantage of the existing color histogram intersection method. Also, it is found that the final similarity can be represented in a linear combination of the respective factors(Homogram, color, energy) when each factor is independent one another. With respect to the combination patterns the fuzzy measurement is defined and the fuzzy integral is taken. Experiments are peformed on a database containing 1,000 color images. The proposed method gives better performance than the conventional method in both objective and subjective performance evaluation.
PDF

3D Visualization using Face Position and Direction Tracking (얼굴 위치와 방향 추적을 이용한 3차원 시각화)

Kim, Min-Ha;Kim, Ji-Hyun;Kim, Cheol-Ki;Cha, Eui-Young
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2011.10a
- /
- pp.173-175
- /
- 2011
In this paper, we present an user interface which can show some 3D objects at various angles using tracked 3d head position and orientation. In implemented user interface, First, when user's head moves left/right (X-Axis) and up/down(Y-Axis), displayed objects are moved towards user's eyes using 3d head position. Second, when user's head rotate upon an X-Axis(pitch) or an Y-Axis(yaw), displayed objects are rotated by the same value as user's. The results of experiment from a variety of user's position and orientation show good accuracy and reactivity for 3d visualization.
PDF

Superpixel Exclusion-Inclusion Multiscale Approach for Explanations of Deep Learning (딥러닝 설명을 위한 슈퍼픽셀 제외·포함 다중스케일 접근법)

Seo, Dasom;Oh, KangHan;Oh, Il-Seok;Yoo, Tae-Woong
- Smart Media Journal
- /
- v.8 no.2
- /
- pp.39-45
- /
- 2019
As deep learning has become popular, researches which can help explaining the prediction results also become important. Superpixel based multi-scale combining technique, which provides the advantage of visual pleasing by maintaining the shape of the object, has been recently proposed. Based on the principle of prediction difference, this technique computes the saliency map from the difference between the predicted result excluding the superpixel and the original predicted result. In this paper, we propose a new technique of both excluding and including super pixels. Experimental results show 3.3% improvement in IoU evaluation.
https://doi.org/10.30693/SMJ.2019.8.2.39 인용 PDF KSCI

Variation of facial temperature to 3D visual fatigue evoked (3D 시각피로 유발에 따른 안면 온도 변화)

Hwang, Sung Teac;Park, SangIn;Won, Myoung Ju;Whang, MinCheol
- Science of Emotion and Sensibility
- /
- v.16 no.4
- /
- pp.509-516
- /
- 2013
As the visual fatigue induced by 3D visual stimulation has raised some safety concerns in the industry, this study aims to quantify the visual fatigue through the means of measuring the facial temperature changes. Facial temperature was measured for one minute before and after watching a visual stimulus. Whether the visual fatigue has occurred was measured through subjective evaluations and high cognitive tasks. The difference in the changes that occurred after watching a 2D stimulus and a 3D stimulus was computed in order to associate the facial temperature changes and the visual fatigue induced by watching 3D contents. The results showed significant differences in the subjective evaluations and in the high cognitive tasks. Also, the ERP latency increased after watching 3D stimuli. There were significant differences in the maximum value of the temperature at the forehead and at the tip of the nose. A previous study showed that 3D visual fatigue activates the sympathetic nervous system. Activation of the sympathetic nervous system is known to increase the heart rate as well as the blood flow into the face through the carotid arteries system. When watching 2D or 3D stimuli, the sympathetic nervous system activation dictates the blood flow, which then influences the facial temperature. This study is meaningful in that it is one of the first investigations that looks into the possibility to measure 3D visual fatigue with thermal images.
https://doi.org/10.14695/KJSOS.2013.16.4.509 인용 PDF

A Deep Learning Based Recommender System Using Visual Information (시각 정보를 활용한 딥러닝 기반 추천 시스템)

Moon, Hyunsil;Lim, Jinhyuk;Kim, Doyeon;Cho, Yoonho
- Knowledge Management Research
- /
- v.21 no.3
- /
- pp.27-44
- /
- 2020
In order to solve the user's information overload problem, recommender systems infer users' preferences and suggest items that match them. The collaborative filtering (CF), the most successful recommendation algorithm, has been improving performance until recently and applied to various business domains. Visual information, such as book covers, could influence consumers' purchase decision making. However, CF-based recommender systems have rarely considered for visual information. In this study, we propose VizNCS, a CF-based deep learning model that uses visual information as additional information. VizNCS consists of two phases. In the first phase, we build convolutional neural networks (CNN) to extract visual features from image data. In the second phase, we supply the visual features to the NCF model that is known to easy to extend to other information among the deep learning-based recommendation systems. As the results of the performance comparison experiments, VizNCS showed higher performance than the vanilla NCF. We also conducted an additional experiment to see if the visual information affects differently depending on the product category. The result enables us to identify which categories were affected and which were not. We expect VizNCS to improve the recommender system performance and expand the recommender system's data source to visual information.
https://doi.org/10.15813/kmr.2020.21.3.002 인용 PDF KSCI

A Program for Analyzing the Real-time Processing Performance of the Satellite Operation System (인공 위성 운용시스템의실시간 처리 성능을 분석하기 위한 프로그램)

하성준;김소연;한경숙
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10a
- /
- pp.659-661
- /
- 1998
다목적위성 관제 시스템의 한 부분만 위성운용 서브시스템의 실시간 처리 성능을 분석하기 위한 알고리즘과 이를 구현한 프로그램이 개발되었다. 이 프로그램은 위성운용 서브시스템에서 발생되는 event들과 이에 대한 response들의 속성에 관한 파라미터 값이 입력되었을 때, 각 event의 반응 시간과 스케줄 가능성을 계산하고 스케줄된 event들을 시각화한다. 실험 결과, event의 blocking delay, 주기 및 첫 action의 우선 순위가 해당 event 또는 다른 event의 반응 시간에 많은 영향을 준다는 것이 밝혀졌다.
PDF

생리신호 측정에 의한 감성평가

황민철;박재희;박수찬;김철중
- Proceedings of the ESK Conference
- /
- 1995.04a
- /
- pp.35-39
- /
- 1995
인간의 감성평가 기술은 제품설계에 필요한 중요한 기술로 인식되고있다. 정량적이고 객 관적인 감성평가를 위한 구체적인 연구가 요구된다. 본 연구는 생리신호가 인간감성에 따라 변화한 다는 가정아래 인간 오감(청각, 시각, 후각, 미각, 촉각)에 대한 생리신호 (EEG, ECG, GSR, FEMG)의 변화를 측정하였다. 감각종류로 긍정적 감성과 부정적 감성을 유발할 4개의 자극이 피실험자에게 제시되었고, 모든 생리신호를 동기화하여 측정하였다. 측정된 신호는 통계처리하여 주관적으로 평가한 감성과의 상관성을 분석하였고 긍정적인 감성과 부정적인 감성에 따른 신호변화의 특성을 파악하여 감 성평가를 위한 상관성을 관찰했다.
PDF

Hand-Gesture Algorithm using Morphological Shape Decomposition Elements (형태론적 형태 분해 요소를 이용한 손짓 인식 알고리즘)

김정훈;윤용인;최종수;김태은
- Proceedings of the Korea Multimedia Society Conference
- /
- 2001.06a
- /
- pp.103-106
- /
- 2001
최근 들어 인간의 의지를 컴퓨터에 전달하기 위한 수단으로 컴퓨터 시각기반 방식으로 제스처를 인식하고자 하는 연구가 널리 진행되고 있다. 제스처 인식에서 가장 중요한 문제는 실시간 처리로 알고리즘의 단순화와 처리시간의 감소이다. 이러한 문제를 해결하기 위해서 본 연구에서는 기하학적 집합론에 근거하고 있는 수학적 형태론을 적용하였다. 형태론적 형상분해를 적용하여 얻은 손짓 형상의 원시형상 요소들의 방향성은 손짓에 관한 중요한 정보를 내포하고 있으며 이러한 특징에 근거하여 본 연구에서는 주 원시형상 요소와 부 원시형상원소의 중심점을 연결하는 직선으로부터 특징벡터를 이용한 형태론적 제스처 인식 알고리즘을 제안하고 실험을 통하여 그 유용성을 증명한다.
PDF

Applications of Generative Adversarial Networks (Generative Adversarial Networks의 응용 현황)

Kim, Dong-Wook;Kim, Sesong;Jung, Seung-Won
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.11a
- /
- pp.807-809
- /
- 2017
Generative adversarial networks (GAN)에 대한 간략하게 설명하고, MNIST (숫자 손 글씨 데이터 셋)를 이용한 간단한 실험을 통해 GAN 구조 구조의 이해를 돕는다. 그리고 GAN이 어떻게 응용이 되고있는지 다양한 논문들을 통해 살펴본다. 본 고에서는 GAN 논문들을 크게 이미지 스타일 변경, 3D 오브젝트 추정, 손상된 이미지 복원, 언어의 시각화, 기타 등으로 분류하였다.
https://doi.org/10.3745/PKIPS.y2017m11a.807 인용 PDF

Search Result 587, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)