Search | Korea Science

Design of a Sound Classification System for Context-Aware Mobile Computing (상황 인식 모바일 컴퓨팅을 위한 사운드 분류 시스템 설계)

Kim, Joo-Hee;Lee, Seok-Jun;Kim, In-Cheol
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.11a
- /
- pp.1305-1308
- /
- 2013
본 논문에서는 스마트폰 사용자의 실시간 상황 인식을 위한 효과적인 사운드 분류 시스템을 제안한다. 이 시스템에서는 PCM 형태의 사운드 입력 데이터에 대한 전처리를 통해 고요한 사운드와 화이트 노이즈를 학습 및 분류 단계 이전에 미리 여과함으로써, 계산 자원의 불필요한 소모를 막을 수 있다. 또한 에너지 레벨이 낮아 신호의 패턴을 파악하기 어려운 사운드 데이터는 증폭함으로써, 이들에 대한 분류 성능을 향상시킬 수 있다. 또, 제안하는 사운드 분류 시스템에서는 HMM 분류 모델의 효율적인 학습과 적용을 위해 k-평균 군집화를 이용하여 특징 벡터들에 대한 차원 축소와 이산화를 수행하고, 그 결과를 모아 일정한 길이의 시계열 데이터를 구성하였다. 대학 연구동내 다양한 일상생활 상황들에서 수집한 8가지 유형의 사운드 데이터 집합을 이용하여 성능 분석 실험을 수행하였고, 이를 통해 본 논문에서 제안하는 사운드 분류 시스템의 높은 성능을 확인할 수 있었다.
https://doi.org/10.3745/PKIPS.y2013m11a.1305 인용 PDF

Deep Learning-Based Low-Light Imaging Considering Image Signal Processing

Minsu, Kwon
- Journal of the Korea Society of Computer and Information
- /
- v.28 no.2
- /
- pp.19-25
- /
- 2023
In this paper, we propose a method for improving raw images captured in a low light condition based on deep learning considering the image signal processing. In the case of a smart phone camera, compared to a DSLR camera, the size of a lens or sensor is limited, so the noise increases and the reduces the quality of images in low light conditions. Existing deep learning-based low-light image processing methods create unnatural images in some cases since they do not consider the lens shading effect and white balance, which are major factors in the image signal processing. In this paper, pixel distances from the image center and channel average values are used to consider the lens shading effect and white balance with a deep learning model. Experiments with low-light images taken with a smart phone demonstrate that the proposed method achieves a higher peak signal to noise ratio and structural similarity index measure than the existing method by creating high-quality low-light images.
https://doi.org/10.9708/jksci.2023.28.02.019 인용 PDF HTML

Effect of the Multisensory on the Stress-relieving for Vehicle Driver (운전자 스트레스 저감을 위한 다감각 자극의 효과)

Kim, Young-Joo;Kim, Hyejin;Lee, Hyunwoo;Jo, Youngho;Whang, Mincheol
- Science of Emotion and Sensibility
- /
- v.24 no.4
- /
- pp.107-116
- /
- 2021
This study aims to investigate the effect of multisensory stimulation on relieving the stress experienced by drivers. The photoplethysmograms (PPGs) of 30 healthy subjects were measured, and their subjective response to stressful situations and normal driving were evaluated. The subjects underwent nonstimulation and multisensory stimulation in stressful driving situations. Heart rate estimation from the PPG was collected via an ear-type sensor to reduce movement noise. The signals acquired were sampled at 200 Hz using BIOPAC PPG100C. Heart rate variability (HRV) was analyzed to compare the effect of multisensory stimulation on stress situations. In the multisensory stimulation, blue, green, and yellow were used for the visual sensory system; white, pink, and brown noises were used for the auditory sensory system; and lavender, lemon, and rosemary were used for the olfactory sensory system. No difference was observed in the subjective evaluation; however, the HRV results showed an increased HF (%) and decreased LF (%) and LF/HF (%) in the multisensory stimulation (e.g., green, pink noise, and rosemary) when compared to the nonstimulation.
https://doi.org/10.14695/KJSOS.2021.24.4.107 인용 PDF KSCI

Audio Forensic Marking using Psychoacoustic Model II and MDCT (심리음향 모델 II와 MDCT를 이용한 오디오 포렌식 마킹)

Rhee, Kang-Hyeon
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.49 no.4
- /
- pp.16-22
- /
- 2012
In this paper, the forensic marking algorithm is proposed using psychoacoustic model II and MDCT for high-quality audio. The proposed forensic marking method, that inserts the user fingerprinting code of the audio content into the selected sub-band, in which audio signal energy is lower than the spectrum masking level. In the range of the one frame which has 2,048 samples for FFT of original audio signal, the audio forensic marking is processed in 3 sub-bands. According to the average attack of the fingerprinting codes, one frame's SNR is measured on 100% trace ratio of the collusion codes. When the lower strength 0.1 of the inserted fingerprinting code, SNR is 38.44dB. And in case, the added strength 0.5 of white gaussian noise, SNR is 19.09dB. As a result, it confirms that the proposed audio forensic marking algorithm is maintained the marking robustness of the fingerprinting code and the audio high-quality.
PDF KSCI

Perceptual Ad-Blocker Design For Adversarial Attack (적대적 공격에 견고한 Perceptual Ad-Blocker 기법)

Kim, Min-jae;Kim, Bo-min;Hur, Junbeom
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.30 no.5
- /
- pp.871-879
- /
- 2020
Perceptual Ad-Blocking is a new advertising blocking technique that detects online advertising by using an artificial intelligence-based advertising image classification model. A recent study has shown that these Perceptual Ad-Blocking models are vulnerable to adversarial attacks using adversarial examples to add noise to images that cause them to be misclassified. In this paper, we prove that existing perceptual Ad-Blocking technique has a weakness for several adversarial example and that Defense-GAN and MagNet who performed well for MNIST dataset and CIFAR-10 dataset are good to advertising dataset. Through this, using Defense-GAN and MagNet techniques, it presents a robust new advertising image classification model for adversarial attacks. According to the results of experiments using various existing adversarial attack techniques, the techniques proposed in this paper were able to secure the accuracy and performance through the robust image classification techniques, and furthermore, they were able to defend a certain level against white-box attacks by attackers who knew the details of defense techniques.
https://doi.org/10.13089/JKIISC.2020.30.5.871 인용 PDF KSCI HTML

Design and Implementation of a Sound Classification System for Context-Aware Mobile Computing (상황 인식 모바일 컴퓨팅을 위한 사운드 분류 시스템의 설계 및 구현)

Kim, Joo-Hee;Lee, Seok-Jun;Kim, In-Cheol
- KIPS Transactions on Software and Data Engineering
- /
- v.3 no.2
- /
- pp.81-86
- /
- 2014
In this paper, we present an effective sound classification system for recognizing the real-time context of a smartphone user. Our system avoids unnecessary consumption of limited computational resource by filtering both silence and white noise out of input sound data in the pre-processing step. It also improves the classification performance on low energy-level sounds by amplifying them as pre-processing. Moreover, for efficient learning and application of HMM classification models, our system executes the dimension reduction and discretization on the feature vectors through k-means clustering. We collected a large amount of 8 different type sound data from daily life in a university research building and then conducted experiments using them. Through these experiments, our system showed high classification performance.
https://doi.org/10.3745/KTSDE.2014.3.2.81 인용 PDF KSCI

Learning efficiency checking system by measuring human motion detection (사람의 움직임 감지를 측정한 학습 능률 확인 시스템)

Kim, Sukhyun;Lee, Jinsung;Yu, Eunsang;Park, Seon-u;Kim, Eung-Tae
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- fall
- /
- pp.290-293
- /
- 2021
In this paper, we implement a learning efficiency verification system to inspire learning motivation and help improve concentration by detecting the situation of the user studying. To this aim, data on learning attitude and concentration are measured by extracting the movement of the user's face or body through a real-time camera. The Jetson board was used to implement the real-time embedded system, and a convolutional neural network (CNN) was implemented for image recognition. After detecting the feature part of the object using a CNN, motion detection is performed. The captured image is shown in a GUI written in PYQT5, and data is collected by sending push messages when each of the actions is obstructed. In addition, each function can be executed on the main screen made with the GUI, and functions such as a statistical graph that calculates the collected data, To do list, and white noise are performed. Through learning efficiency checking system, various functions including data collection and analysis of targets were provided to users.
PDF

An Identification of the Healing Effect of Rain Sound According to the Gender and Personal - Adjusted Rain Sound Making (성별에 따른 빗소리의 힐링 효과 규명 및 개인 맞춤형 빗소리 제작)

Lee, Bum Joo;Cho, Dong Uk;Cho, Sang Hyun;Song, Young Bin;Jeong, Yeon Man
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.41 no.10
- /
- pp.1263-1269
- /
- 2016
Stress has become one of the largest health risk to shorten the life time of health. Accordingly, in order to increase the life time of health, stress relief can be very important. Many social expenses and economic commitment have been inputted for this purpose, but their effectiveness compared to the current situation is not very high. In this paper, we carried out an identification work of rain sound which is similar to the white noise that can stabilize the body and mind of the person by analyzing the variations of 3rd formant frequency bandwidth. Also, for relieving stress, the sounds of rain that is easily accessible at a relatively among the sounds of nature instead of consuming a lot of money and time were selected for solving these problems. In addition, we identified the effectiveness of the stress relief about the sound of rain and research on whether there is a difference between men and women in their 20's or not was performed. Finally, we discussed the personal - adjusted rain making to maximize the effectiveness of stress relief.
https://doi.org/10.7840/kics.2016.41.10.1263 인용 PDF KSCI

Search Result 18, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)