Search | Korea Science

Fast Algorithm for Recognition of Korean Isolated Words (한국어 고립단어인식을 위한 고속 알고리즘)

남명우;박규홍;정상국;노승용
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.1
- /
- pp.50-55
- /
- 2001
This paper presents a korean isolated words recognition algorithm which used new endpoint detection method, auditory model, 2D-DCT and new distance measure. Advantages of the proposed algorithm are simple hardware construction and fast recognition time than conventional algorithms. For comparison with conventional algorithm, we used DTW method. At result, we got similar recognition rate for speaker dependent korean isolated words and better it for speaker independent korean isolated words. And recognition time of proposed algorithm was 200 times faster than DTW algorithm. Proposed algorithm had a good result in noise environments too.
PDF

Analysis of Meridian Response by Sound Stimulus in Body (음향 자극에 의한 인체 경락의 반응분석)

Kim, Yong-Chin;Jeong, Dong-Myong
- Journal of the Institute of Electronics Engineers of Korea SC
- /
- v.38 no.3
- /
- pp.47-54
- /
- 2001
This study is to analyze the impedance response in human body by acoustic stimulation on acupoints and contrast parte; for objectification of the meridian substance. It is to verify meridian pathway and channel theory or bio-energy in body. This paper proposes to make an hypothesis about the underground water theory. The meridian has not tube or pipe line type channel but bio-energy flow along the channel similar to flowing pattern of underground water in body. It was analyzed the current characteristic or impedance response after acoustic stimulation by sound wave of 5 specific tones. The response characteristics of current stimulation are measured by the average current magnitude and variation ratio or meridian. The current variation ratio or Live Meridian(gung) 33.2%, Heart Meridian(sang) 30.7% Kidney Meridian (gak) 33.1%, Spleen Meridian(chi) 33.9%, Lung Meridian (wo) 30.7% are to be compared to contrast parts (non-acupoint and meridian). In experimental results, meridian is discrimination to non-meridian, and 5 vital meridians have a reciprocal relationship with sound wave of 5 specific tones.
PDF

Emotion of People with Visual Disability for Enhancing Web Accessibility (웹 접근성 향상을 위한 시각장애인과 일반인의 감성 비교)

Park, Joo-Hyun;Ryoo, Han-Young
- Science of Emotion and Sensibility
- /
- v.11 no.4
- /
- pp.589-598
- /
- 2008
The purpose of this study was to compare the emotional responses of people with visual disability with those of normal people and to understand their similarity or differences in order to apply the new understandings into the future research on Web Accessibility Guidelines. For this purpose, a Web survey system was developed using 15 auditorial stimuli prepared based on the Media Taxonomy and 11 emotion measuring criteria selected from the literature review. After developing the system, emotional responses of 31 people with visual disability and 53 normal people were collected through the Web. The results of the survey showed that the emotional responses of people with visual disability were similar to those of normal people, although there were some exceptional cases. Therefore, it is clear that emotional needs of people with disability should be taken count of in the Web accessibility discussions and further in-depth studies on the emotional characteristics of people with disability are necessary.
PDF

Automatic Vowel Onset Point Detection Based on Auditory Frequency Response (청각 주파수 응답에 기반한 자동 모음 개시 지점 탐지)

Zang, Xian;Kim, Hag-Tae;Chong, Kil-To
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.13 no.1
- /
- pp.333-342
- /
- 2012
This paper presents a vowel onset point (VOP) detection method based on the human auditory system. This method maps the "perceptual" frequency scale, i.e. Mel scale onto a linear acoustic frequency, and then establishes a series of Triangular Mel-weighted Filter Bank simulate the function of band pass filtering in human ear. This nonlinear critical-band filter bank helps greatly reduce the data dimensionality, and eliminate the effect of harmonic waves to make the formants more prominent in the nonlinear spaced Mel spectrum. The sum of mel spectrum peaks energy is extracted as feature for each frame, and the instinct at which the energy amplitude starts rising sharply is detected as VOP, by convolving with Gabor window. For the single-word database which contains 12 vowels articulated with different kinds of consonants, the experimental results showed a good average detection rate of 72.73%, higher than other vowel detection methods based on short-time energy and zero-crossing rate.
https://doi.org/10.5762/KAIS.2012.13.1.333 인용 PDF KSCI

A Perceptual Audio Coder Based on Temporal-Spectral Structure (시간-주파수 구조에 근거한 지각적 오디오 부호화기)

김기수;서호선;이준용;윤대희
- Journal of Broadcast Engineering
- /
- v.1 no.1
- /
- pp.67-73
- /
- 1996
In general, the high quality audio coding(HQAC) has the structure of the convertional data compression techniques combined with moodels of human perception. The primary auditory characteristic applied to HQAC is the masking effect in the spectral domain. Therefore spectral techniques such as the subband coding or the transform coding are widely used[1][2]. However no effort has yet been made to apply the temporal masking effect and temporal redundancy removing method in HQAC. The audio data compression method proposed in this paper eliminates statistical and perceptual redundancies in both temporal and spectral domain. Transformed audio signal is divided into packets, which consist of 6 frames. A packet contains 1536 samples($256{\times}6$) :nd redundancies in packet reside in both temporal and spectral domain. Both redundancies are elminated at the same time in each packet. The psychoacoustic model has been improved to give more delicate results by taking into account temporal masking as well as fine spectral masking. For quantization, each packet is divided into subblocks designed to have an analogy with the nonlinear critical bands and to reflect the temporal auditory characteristics. Consequently, high quality of reconstructed audio is conserved at low bit-rates.
PDF

A Study of Acoustic Masking Effect from Formant Enhancement in Digital Hearing Aid (디지털 보청기에서의 포먼트 강조에 의한 마스킹 효과 연구)

Jeon, Yu-Yong;Kil, Se-Kee;Yoon, Kwang-Sub;Lee, Sang-Min
- Journal of the Institute of Electronics Engineers of Korea SC
- /
- v.45 no.5
- /
- pp.13-20
- /
- 2008
Although digital hearing aid algorithms have been developed to compensate hearing loss and to help hearing impaired people to communicate with others, digital hearing aid user still complain about difficulty of hearing the speech. The reason could be the quality of speech through digital hearing aid is insufficient to understand the speech caused by feedback, residual noise and etc. And another thing is masking effect among formants that makes sound quality low. In this study, we measured the masking characteristics of normal listeners and hearing impaired listeners having presbyacusis to confirm masking effect in speech itself. The experiment is composed of 5 tests; pure tone test, speech reception threshold (SRT) test, word recognition score (WRS) test, puretone masking test and speech masking test. In speech masking test, there are 25 speeches in each speech set. And log likelihood ratio (LLR) is introduced to evaluate the distortion of each speech objectively. As a result, the speech perception became lower by increasing the quantity of formant enhancement. And each enhanced speech in a speech set has statistically similar LLR, however speech perception is not. It means that acoustic masking effect rather than distortion influences speech perception. In actuality, according to the result of frequency analysis of the speech that people can not answer correctly, level difference between first formant and second formant is about 35dB, and it is similar to result of pure tone masking test(normal hearing subject:36.36dB, hearing impaired subject:32.86dB). Characteristics of masking effect is not similar between normal listeners and hearing impaired listeners. So it is required to check the characteristics of masking effect before wearing a hearing aid and to apply this characteristics to fitting.
PDF KSCI

Dual CNN Structured Sound Event Detection Algorithm Based on Real Life Acoustic Dataset (실생활 음향 데이터 기반 이중 CNN 구조를 특징으로 하는 음향 이벤트 인식 알고리즘)

Suh, Sangwon;Lim, Wootaek;Jeong, Youngho;Lee, Taejin;Kim, Hui Yong
- Journal of Broadcast Engineering
- /
- v.23 no.6
- /
- pp.855-865
- /
- 2018
Sound event detection is one of the research areas to model human auditory cognitive characteristics by recognizing events in an environment with multiple acoustic events and determining the onset and offset time for each event. DCASE, a research group on acoustic scene classification and sound event detection, is proceeding challenges to encourage participation of researchers and to activate sound event detection research. However, the size of the dataset provided by the DCASE Challenge is relatively small compared to ImageNet, which is a representative dataset for visual object recognition, and there are not many open sources for the acoustic dataset. In this study, the sound events that can occur in indoor and outdoor are collected on a larger scale and annotated for dataset construction. Furthermore, to improve the performance of the sound event detection task, we developed a dual CNN structured sound event detection system by adding a supplementary neural network to a convolutional neural network to determine the presence of sound events. Finally, we conducted a comparative experiment with both baseline systems of the DCASE 2016 and 2017.
https://doi.org/10.5909/JBE.2018.23.6.855 인용 PDF KSCI KPUBS HTML

Effect of Visual Factor on Subjective Evaluation of Frictional Fabric Sounds (직물 마찰음의 주관적 평가에 시각적 변수가 미치는 영향)

Han, A-Reum;Yang, Yun-Jeong;Jo, Gil-Su
- Proceedings of the Korean Society for Emotion and Sensibility Conference
- /
- 2009.11a
- /
- pp.62-65
- /
- 2009
본 연구는 동작 속도별 마찰음의 주관적 평가에 있어서 시각적 변수의 영향을 분석하는 것을 목적으로 한다. 현재 유통되고 있는 79 종의 스포츠웨어용 투습발수직물 중 음향 특성으로 계층적 군집분석에 의해 나누어진 3 개의 군집에서 각각 하나씩 추출한 총 3가지 시료를 대상으로 walking, jogging, running의 속도로 마찰시켜 총 9가지의 소리에 대하여 실험하였다. 직물 소리에 대한 주관적 평가 시 시각적 변수의 영향을 분석하기 위하여 두 가지 방법으로 직물 소리에 대한 주관적 반응을 평가하였다. 첫 번째는 기존의 연구에서 주로 이루어진 방법으로, 실험 진행자가 피험자에게 직물소리를 들려주면서 설문을 하게 하였다. 두 번째는 녹음된 직물의 소리와 함께 모니터를 통해 해당 자극물의 마찰 속도에 따라 인체 모델이 움직이는 동작을 보여줌으로써 청각과 시각 자극을 동시에 제시하여 주관적 평가의 자극물로 사용하였다. 주관적 평가는 8개의 형용사 쌍에 대해 의미미분척도로 평가되었고, 두 가지 방법을 비교하기 위하여 '실제 옷을 착용하고 움직일 때 발생하는 직물 소리와 유사하게 들린다.' 와 '옷을 착용하고 움직일 때 발생하는 소리라고 느껴진다.' 두 문항을 추가하여 평가하였다. 그 결과 시각 자극의 유무에 의한 감성평가 결과에는 큰 영향을 미치지는 않았지만, 피험자가 소리만으로 직물이 마찰되는 장면을 의식적으로 상상해야 하는 심리적 부담을 줄여주었고, 주관적 평가 몰입도를 향상시켰다고 사료된다.
PDF

Target Speech Segregation Using Non-parametric Correlation Feature Extraction in CASA System (CASA 시스템의 비모수적 상관 특징 추출을 이용한 목적 음성 분리)

Choi, Tae-Woong;Kim, Soon-Hyub
- The Journal of the Acoustical Society of Korea
- /
- v.32 no.1
- /
- pp.79-85
- /
- 2013
Feature extraction of CASA system uses time continuity and channel similarity and makes correlogram of auditory elements for the use. In case of using feature extraction with cross correlation coefficient for channel similarity, it has much computational complexity in order to display correlation quantitatively. Therefore, this paper suggests feature extraction method using non-parametric correlation coefficient in order to reduce computational complexity when extracting the feature and tests to segregate target speech by CASA system. As a result of measuring SNR (Signal to Noise Ratio) for the performance evaluation of target speech segregation, the proposed method shows a slight improvement of 0.14 dB on average over the conventional method.
https://doi.org/10.7776/ASK.2013.32.1.079 인용 PDF KSCI

Development of multilayer actuators with single crystals for implantable middle ears (압전 단결정 재료를 이용한 인공중이용 적층형 액츄에이터의 개발)

Seon J. H.;Lee S. S.;Roh Y. R.
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.315-318
- /
- 2004
이식형 인공중이에 있어 그 특성은 트랜스듀서의 성능에 따라 크게 좌우된다. 따라서 성능이 우수한 인공중이 제작을 위해서는 트랜스듀서의 주파수 특성 및 구동 성능이 우수해야 하고 인체 내 이식을 위해서는 그 크기가 작아야 한다. 본 연구에서는 인공중이용 소형 트랜스듀서로서 단결정 압전 재료인 PMN-PT를 이용한 적층형 액츄에이터를 제안하였다. 또한 제안된 모델을 두께 0.2mm를 갖는 $1mm{\times}1mm$ 크기의 PMN-PT 시편을 14층으로 쌓아 2.8mm 두께로 제작하였고, 이때 절연층으로 P.R을 사용하였다. 제작된 트랜스듀서의 성능은 Impedance Spectrum, 구동변위 측정 및 구동력의 계산을 통해 평가하였다. 이를 통해 PMN-PT를 재료로 사용한 적층형 액츄에이터의 성능이 기존의 PZT를 재료로 사용한 Bimorph 액츄에이터보다 훨씬 뛰어날 뿐만 아니라 청각 장애가 심한 고도난청자들에게 적용이 가능한 이식형 인공중이용 트랜스듀서로서 충분한 성능을 가지고 있음을 입증하였다.
PDF

Search Result 229, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)