통합 검색 | Korea Science

자동이득 조절에서 해제시간에 따른 어음인지점수 변화 (The Word Recognition Score According to Release Time on Automatic Gain Control)

황세미;전유용;박헌진;송영록;이상민
- 대한의용생체공학회:의공학회지
- /
- 제31권5호
- /
- pp.385-394
- /
- 2010
Automatic gain control(AGC) is used in hearing aids to compensate for the hearing level as to reduced dynamic range. AGC is consisted of the main 4 factors which are compression threshold, compression ratio, attack time, and release time. This study especially focus on each individual need for optimum release time parameters that can be changed within 7 certain range such as 12, 64, 128, 512, 2094, and 4096ms. To estimate the effect of various release time in AGC, twelve normal hearing and twelve hearing impaired listeners are participated. The stimuli are used by one syllable and sentence which have the same acoustic energy respectively. Then, each of score of the word recognition score is checked in quiet and noise conditions. As a result, it is verified that most people have the different best recognition score on specific release time. Also, if hearing aids is set by the optimum release time in each person, it is helpful in speech recognition and discrimination.
https://doi.org/10.9718/JBER.2010.31.5.385 인용 PDF KSCI

A Machine-Learning Based Approach for Extracting Logical Structure of a Styled Document

Kim, Tae-young;Kim, Suntae;Choi, Sangchul;Kim, Jeong-Ah;Choi, Jae-Young;Ko, Jong-Won;Lee, Jee-Huong;Cho, Youngwha
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제11권2호
- /
- pp.1043-1056
- /
- 2017
A styled document is a document that contains diverse decorating functions such as different font, colors, tables and images generally authored in a word processor (e.g., MS-WORD, Open Office). Compared to a plain-text document, a styled document enables a human to easily recognize a logical structure such as section, subsection and contents of a document. However, it is difficult for a computer to recognize the structure if a writer does not explicitly specify a type of an element by using the styling functions of a word processor. It is one of the obstacles to enhance document version management systems because they currently manage the document with a file as a unit, not the document elements as a management unit. This paper proposes a machine learning based approach to analyzing the logical structure of a styled document composing of sections, subsections and contents. We first suggest a feature vector for characterizing document elements from a styled document, composing of eight features such as font size, indentation and period, each of which is a frequently discovered item in a styled document. Then, we trained machine learning classifiers such as Random Forest and Support Vector Machine using the suggested feature vector. The trained classifiers are used to automatically identify logical structure of a styled document. Our experiment obtained 92.78% of precision and 94.02% of recall for analyzing the logical structure of 50 styled documents.
https://doi.org/10.3837/tiis.2017.02.023 인용 PDF KSCI

자동 음성분할 및 레이블링 시스템의 성능향상 (Performance Improvement of Automatic Speech Segmentation and Labeling System)

홍성태;김제우;김형순
- 대한음성학회지:말소리
- /
- 제35_36호
- /
- pp.175-188
- /
- 1998
Database segmented and labeled up to phoneme level plays an important role in phonetic research and speech engineering. However, it usually requires manual segmentation and labeling, which is time-consuming and may also lead to inconsistent consequences. Automatic segmentation and labeling can be introduced to solve these problems. In this paper, we investigate a method to improve the performance of automatic segmentation and labeling system, where Spectral Variation Function(SVF), modification of silence model, and use of energy variations in postprocessing stage are considered. In this paper, SVF is applied in three ways: (1) addition to feature parameters, (2) postprocessing of phoneme boundaries, (3) restricting the Viterbi path so that the resulting phoneme boundaries may be located in frames around SVF peaks. In the postprocessing stage, positions with greatest energy variation during transitional period between silence and other phonemes were used to modify boundaries. In order to evaluate the performance of the system, we used 452 phonetically balanced word(PBW) database for training phoneme models and phonetically balanced sentence(PBS) database for testing. According to our experiments, 83.1% (6.2% improved) and 95.8% (0.9% improved) of phoneme boundaries were within 20ms and 40ms of the manually segmented boundaries, respectively.
PDF

한국어 규칙 동사와 불규칙 동사의 심성 어휘집 접근 과정 (The Lexical Access of Regular and Irregular Korean Verbs in the Mental Lexicon)

박희진;구민모;남기춘
- 인지과학
- /
- 제23권1호
- /
- pp.1-23
- /
- 2012
본 연구는 한국어 동사의 활용된 형태인 굴절 동사의 심성어휘집 접근 과정을 알아보기 위한 연구이다. 이를 위하여 차폐 점화 어휘 판단과제 실험을 실시하여 점화크기를 비교하였다. 한국어 규칙 동사와 불규칙 동사를 다섯 가지로 나누어 실험을 수행하였다. 활용의 종류는 1) 완전규칙 2) 발음변화규칙 3) 철자변화규칙 4) 어간변화 불규칙 5) 어미변화 불규칙으로 1), 2), 3)은 규칙 활용의 범주로 4), 5)는 불규칙활용의 범주이다. 기본형의 동사를 표적자극으로 사용하였고, 점화자극으로 총 세 가지 유형이 사용하였다. 점화자극으로 사용한 자극은 기본형의 어간의 변화가 없는 규칙활용, 기본형의 어간이 철자적, 음운적으로 변화하는 불규칙활용과, 의미 및 형태적으로 관련 없는 통제된 단어이다. 또한 단어재인의 처리에서 형태소 분해 정보처리의 시간대를 살펴보기 위하여 SOA의 간격을 43ms, 72ms, 230ms의 3가지로 나누어 실험하였다. 모든 동사가 모든 SOA에서 규칙활용과 불규칙활용이 통제단어에 비해 빠른 반응시간을 보임으로써 점화효과가 관찰되었다. 그러나 규칙활용과 불규칙활용에서 뚜렷이 점화효과의 차이가 관찰되지 않는다. 이러한 규칙활용과 불규칙활용의 범주의 구분 없이 비슷한 패턴을 보여주는 결과는 한국어가 단순히 규칙과 불규칙의 기준으로 나뉘어서 처리되지 않는다는 것을 시사한다. 또한 모든 SOA에서 촉진효과를 보임으로써 형태소 정보처리가 초기과정부터 일어남을 확인하였다.
PDF

통사적 제약과 화용적 제약이 문장의 표상과 기억접근에 미치는 효과 (The effect of syntatic and pragmatic Constraints on Sentential Representaition and Memory Accessibility)

김성일;이재호
- 인지과학
- /
- 제6권2호
- /
- pp.97-116
- /
- 1995
본 연구는 문장 표상형성 과정에서 통사적 제약과 화용적 제약이 시간경과에 따라 각 구성성분의 표상 및 기억접근에 어떠한 영향을 미치는지를 살펴보고자 실시되었다.통사적 제약과 화용적 제약을 분리시키기 위해 구성성분의 통사적 역할(주어,목적어)과 언급순서(첫째,둘째)를 조작하였고, 문장 구성성분의 표상강도를 기억접근의 용이서을 통해 살펴보기 위해 각 문장을 마디별로 제시한 후 목표단어의 재인 반응시간을 측정하였다. 탐사재인의 지연시간이 255ms인 실험 1에서는 주어가 목적어보다 그리고 먼저 언급된 정보가 나중에 언급된 정보보다 각각 28ms씩 기억접근 시간이 빠르 것으로 나타났으나,지연시간이 1540ms으로 기렁진 실험2에서는 주어와 목적어간의 기억접근 시간의 차이는 없었고 먼저 언급된 정보가 나중에 언급된 정보에 비해 기억접근 시간이 48ms가 빠른 것으로 나타났다.따라서 통사적 제약과 화용적 제약 모두 문장 표상형성 과정의 초기에는 독립적인 효과를 미치나 일정시간이 경과하면서 통사적 제약의 효과는 사라지며 화용적 제약의 효과만 남는다고 할수 있다.본 연구의 이러한 결과는 문장의 기억표상이 중다제약의 수렴적 만족에 으해서 점진적으로 심성모형을 형성하는 과정이라는 이론적 입장을 지지한다.
PDF

Lip-synch application을 위한 한국어 단어의 음소분할 (The segmentation of Korean word for the lip-synch application)

강용성;고한석
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2001년도 제14회 신호처리 합동 학술대회 논문집
- /
- pp.509-512
- /
- 2001
본 논문은 한국어 음성에 대한 한국어 단어의 음소단위 분할을 목적으로 하였다. 대상 단어는 원광대학교 phonetic balanced 452단어 데이터 베이스를 사용하였고 분할 단위는 음성 전문가에 의해 구성된 44개의 음소셋을 사용하였다. 음소를 분할하기 위해 음성을 각각 프레임으로 나눈 후 각 프레임간의 스펙트럼 성분의 유사도를 측정한 후 측정한 유사도를 기준으로 음소의 분할점을 찾았다. 두 프레임 간의 유사도를 결정하기 위해 두 벡터 상호간의 유사성을 결정하는 방법중의 하나인 Lukasiewicz implication을 사용하였다. 본 실험에서는 기존의 프레임간 스펙트럼 성분의 유사도 측정을 이용한 하나의 어절의 유/무성음 분할 방법을 본 실험의 목적인 한국어 단어의 음소 분할 실험에 맞도록 수정하였다. 성능평가를 위해 음성 전문가에 의해 손으로 분할된 데이터와 본 실험을 통해 얻은 데이터와의 비교를 하여 평가를 하였다. 실험결과 전문가가 직접 손으로 분할한 데이터와 비교하여 32ms이내로 분할된 비율이 최고 84.76%를 나타내었다.
PDF

Vincent6 DSP코어를 이용한 G.728 음성 부호화기의 실시간 구현 (Real-time implementation of the G.728 speech codec using the Vincent6 DSP core)

성호상
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 제13회 신호처리 합동 학술대회 논문집
- /
- pp.131-135
- /
- 2000
본 논문에서는 고성능 고정 소수점 DSP (Digital Signal Processor) 코어인 Vincent6 코어 [1]를 이용하여 ITU-T C.728 음성 부호화기를 실시간으로 구현하였다 G.728 은 16 kb/s전송률의 ITU-T표준 음성 부호화기이며, 입력신호는 8 kHz로 샘플링되며 샘플 당 16 bit 로 양자화된 PCM 신호이다. G.728 은 LD-CELP(Low Delay Code Excited Linear Prediction)라고도 하며, 알고리 듬 delay는 0.625ms 이다. Vincent6 DSP core 는 VLIW (Very-Long Instruction Word) 특성을 가지므로 다중 명령 (multiple instruction)을 수행할 수 있다 이를 위해서 G.728 annex G를 이용하여 고정 소숫점 연산으로 코드를 작성한 후, 이를 vincent6 어셈블리 코드로 구현하였다. 최종적으로 구현된 코드는 ITU-T 의 test vector 에 대 해 bit exact 한 결과를 보이며 34 MCPS (Million Cycles Per Second)의 계산량을 가지며 사용 메모리크기는 데이터 메모리가 약 9KByte, 프로그램 메모리가 약 57 KByte 이다.
PDF

A Production and Perception Experiment of Korean Alveolar Fricatives

Yoon, Kyu-Chul
- 음성과학
- /
- 제9권3호
- /
- pp.169-184
- /
- 2002
Korean has two types of voiceless alveolar fricatives: a non-tense fricative /$S^{h}$ and a tense fricative /s'/. Twenty native speakers of Korean produced five pairs of isolated words containing word initial $S^{h}V$ and /s'V/ sequences where V was any one of five (/a, e, i, o, u/) of Korean vowels. Acoustic measures such as duration, fricative noise prominent frequency, energy change of following vowel, and fundamental frequency at vowel onset were examined. Results showed that among the parameters, aspiration noise duration of /s'/ in mid and low vowel contexts was less than 21 ms. In a perception experiment, where only the aspiration noise interval of the /$S^{h}$/ tokens was incrementally reduced, some listeners shifted perception from /$S^{h}$/ to /s'/.
PDF

유니코드 3.0의 CJK 한자 정렬 (A Sorting of Unicode 3.0 CJK Chinese Characters)

윤지헌;변정용
- 한국멀티미디어학회:학술대회논문집
- /
- 한국멀티미디어학회 2000년도 춘계학술발표논문집
- /
- pp.462-465
- /
- 2000
최근 많은 양의 문서가 전자화되어 컴퓨터에 저장되고 인터넷을 통하여 공유가 되고 있고, 그 범위를 고문헌에까지 넓혀가고 있다. 그러나 한자 문화권의 고문헌은 대부분 2만에서 3만여자의 한자로 작섣되어 있어서 한자 입력시 코드문제가 뒤따른다. 하지만 유니코드 3.0에서는 27,786자의 한자를 코드화 하여 놓아서 한자 문화권 나라에 많은 도움을 주고 있다. 하지만 한중일 3개국에서 많이 쓰이는 한자를 대상으로 하여 부수, 획수 순으로 정렬하여 국내 실정에 맞지 않고 그나마 유니코드 한자를 입력할 수 있는 환경도 MS Word 2000 정도로 제한적이다. 본 논문에서는 유니코드 3.0 한자 입력기에서 기본 한자 코드로 상요될 CJK 한자 영역에 배정된 한자를 정렬하는 방안을 제안하고 운영체제 독립적인 한자 입력 시스템에 활용한다.
PDF

HWP 문서와 EBKS 문서간의 변환 기법에 관한 연구 (The Study Conversion of EBKS and HWP Document Standards)

고승규;손원성;최윤철;정병희;이경호;임순범
- 한국멀티미디어학회:학술대회논문집
- /
- 한국멀티미디어학회 2001년도 추계학술발표논문집
- /
- pp.553-558
- /
- 2001
종이책의 디지털 형태인 전자책은 종이책에 비해 인쇄와 유통, 저장 관리가 효율적이고, 인터랙티브한 멀티미디어 정보 표현 등이 가능한 장점을 지니고 있기 때문에 향후 시장이 급성장할 것으로 예측되며, 현재 전자책 시장의 선결 조건인 전자책 문서 표준 및 저작권, 단말기의 해상도 문제 등이 하나씩 해결되어 가고 있다. 그러나 아직까지 전자책의 활성화를 위한 전자책 컨텐츠의 숫자가 많이 부족한 현실이다. 이에 본 연구에서는 전자책 컨텐츠를 확대하기 위하여 기존의 종이책으로 작성된 저작물을 전자책으로 변환하는 기법에 대해 연구한다. 특히 기존 종이책을 저작하는 문서 도구 중에서 가장 대중적이고 사용하기 쉬운 HWP문서와 EBKS 전자책 문서 표준간의 변환 기법에 대해 연구한다. 이 변환 기법은 단순히 HWP 환경에서만 적용가능한 것이 아니라 Quark이나 MS-WORD등의 다른 문서 저작도구에서도 사용가능한 일반적인 방법이다.
PDF

검색결과 74건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)