Search | Korea Science

Audio-Visual Content Analysis Based Clustering for Unsupervised Debate Indexing (비교사 토론 인덱싱을 위한 시청각 콘텐츠 분석 기반 클러스터링)

Keum, Ji-Soo;Lee, Hyon-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.5
- /
- pp.244-251
- /
- 2008
In this research, we propose an unsupervised debate indexing method using audio and visual information. The proposed method combines clustering results of speech by BIC and visual by distance function. The combination of audio-visual information reduces the problem of individual use of speech and visual information. Also, an effective content based analysis is possible. We have performed various experiments to evaluate the proposed method according to use of audio-visual information for five types of debate data. From experimental results, we found that the effect of audio-visual integration outperforms individual use of speech and visual information for debate indexing.
https://doi.org/10.7776/ASK.2008.27.5.244 인용 PDF KSCI

A Study on the Effectiveness of the Lungs Hand Acupuncture Based on Bio Signal Analysis (생체신호분석 기술을 적용한 폐 수지침 요법에 대한 효과성 연구)

Kim, Bong-Hyun;Cho, Dong-Uk
- The KIPS Transactions:PartB
- /
- v.19B no.2
- /
- pp.77-82
- /
- 2012
We carried out study to prove effectiveness as stimulating corresponding points to lung in hand to experiment applied analysis parameters for image and audio signals in this paper. To this end we collected facial image and voice before and after stimulating corresponding points to lung in hand to a male 20s 25 people. In addition, we analyzed change color, voice energy and speaking rate of right cheek area corresponding points to lung to suggest the theory of the Oriental medicine diagnosis based on data collected. As a result, after performing hand acupuncture, L value of right cheek area decreased average 2.33 and a value b value increased 0.76, 0.97 on average. In addition, size of voice energy increased average 0.42, speaking rate decreased average 0.07. In other words, effect of lung function was improved using hand acupuncture corresponding points to lung.
https://doi.org/10.3745/KIPSTB.2012.19B.2.077 인용 PDF KSCI

Chest Girth Prediction Method Using Voice Signals Analysis Technology : Focusing on Men in the 20's (음성신호 분석 기술을 이용한 흉위 예측 기법 : 20대 남성을 대상으로)

Kim, Bong-Hyun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.16 no.9
- /
- pp.2031-2036
- /
- 2012
There is body type that physique classified by apparent characteristics as shape of human body. Chest girth circumference and body type statistically has been look into correlative disposition, character etc. In this paper, we carried out study about prediction of chest girth as voice that interrelationship drew to analyze voice of disposition, character etc. in personal character. With this in mind, we measured intensity, spectrum about laughter by chest girth to classify composition group of subjects and then we would like to extract experiment result to predict chest girth by reciprocal comparison.
https://doi.org/10.6109/jkiice.2012.16.9.2031 인용 PDF KSCI

Big data for Speech and Language Processing (빅데이터 기반 음성언어 처리 기술)

Na, S.H.;Jung, H.Y.;Yang, S.I.;Kim, C.H.;Kim, Y.K.
- Electronics and Telecommunications Trends
- /
- v.28 no.1
- /
- pp.52-61
- /
- 2013
음성언어 처리 분야는 인간의 자연어 발화를 컴퓨터가 자동으로 이해하고 처리하는 알고리즘을 연구하는 분야로, 자동 통번역, Siri와 같은 음성 대화 시스템, 차세대 인터페이스, 질의 응답 시스템 등 다양한 응용군을 포함한다. 특히, 음성언어 처리 기술은, 최근 빅데이터(big data) 시대를 맞이하여, 방대한 음성/텍스트 정보를 처리하기 위한 필수 기술로 각광받고 있다. 한편, 빅데이터는 그 자체가 거대한 말뭉치 데이터로서 음성언어 처리 기술의 성능을 향상시키는 주된 리소스가 된다. 이에 따라, 최근 빅데이터를 이용하여 음성언어 처리 기술의 성능을 개선시키고자 하는 연구가 활발히 진행되고 있는데, 본고에서는 이들 연구의 배경 및 연구 동향들을 소개하기로 한다.
PDF

Acoustic Analysis of Normal and Pathologic Voice Synthesized with Voice Synthesis Program of Dr. Speech Science (Dr. Speech Science의 음성합성프로그램을 이용하여 합성한 정상음성과 병적음성(Pathologic Voice)의 음향학적 분석)

최홍식;김성수
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.12 no.2
- /
- pp.115-120
- /
- 2001
In this paper, we synthesized vowel /ae/ with voice synthesis program of Dr. Speech Science, and we also synthesized pathologic vowel /ae/ by some parameters such as high frequency gain (HFG), low frequency gain(LFG), pitch flutter(PF) which represents jitter value and flutter of amplitude(FA) which represents shimmer value, and grade ranked as mild, moderate and severe respectively. And then we analysed all pathologic voice by analysis program of Dr. Speech Science. We expect that this synthesized pathologic voices are useful for understanding the parameter such as noise, jitter and shimmer and feedback effect to patient with voice disorder.
PDF

On a Template Extraction of phrase unit by Pitch Searching (피치 검색에 의한 Phrase 단위의 Template 추출에 관한 연구)

Kim JongKuk;Bae MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.77-80
- /
- 2004
원화자로부터 목표 화자의 음성으로 변환을 위해서는 음운 및 피치변환이 이루어져야 한다. 원 음성과 목표 음성 신호 사이에 따른 발성길이, 크기 및 피치 등의 운율 특성은 화자의 개인성 및 발성문장의 의도를 나타내는 주요 역할을 한다. 본 논문에서는 음성 변환을 수행하기 위하여 발성된 음성의 강세구(phrase)단위의 피치 검출을 통하여 템플릿을 추출하는 방법을 제안한다. 우선 한국어의 운율구에 대한 정보가 필요한 것인지, 한국어는 어떤 운율 구조를 갖는지에 대하여 알아본다. 마지막으로 어떻게 연속음성으로부터 한국어에 적당한 운율구 단위를 나눌 것인지, 즉 자동 세그멘테이션 및 레이블링에 대하여 분석한다. 또한 논문에서는 한국어 문장음성의 운율구를 강세구와 억양구로 나누고 육안으로 표시한 운율구 단위를 기준으로 이 운율구 단위에 적합한 특징을 추출하여 패턴을 작성한다.
PDF

Statistical Korean Spoken Language Understanding System for Dialog Processing (대화처리를 위한 통계기반 한국어 음성언어이해 시스템)

Roh, Yoon-Hyung;Yang, Seong-II;Kim, Young-Gil
- Annual Conference on Human and Language Technology
- /
- 2012.10a
- /
- pp.215-218
- /
- 2012
본 논문에서는 한국어 대화 처리를 위한 통계기반 음성언어이해 시스템에 대해 기술한다. 음성언어이해시스템은 대화처리에서 음성 인식된 문장으로부터 사용자의 의도를 인식하여 의미표현으로 표현하는 기능을 담당한다. 한국어의 특성을 반영한 실용적인 음성언어이해 시스템을 위해서 강건성과 적용성, 확장성 등이 요구된다. 이를 위해 본 시스템은 음성언어의 특성상 구조분석을 하지 않고, 마이닝 기법을 이용하여 사용자 의도 표현을 생성하는 방식을 취하고 있다. 또한 한국어에서 나타나는 특징들에 대한 처리를 위해 자질 추가 및 점규화 처리 등을 수행하였다. 정보서비스용 대화처리 시스템을 대상으로 개발되고 있고, 차량 정보서비스용 학습 코퍼스를 대상으로 실험을 하여 문장단위 정확률로 약 89%의 성능을 보이고 있다.
PDF

The Recent Trends and Applications of Embedded TTS Technologies (내장형 음성합성 기술 동향 및 사례)

Kim, Jong-Jin;Kim, Jeong-Se;Kim, Sang-Hun;Park, Jun
- Electronics and Telecommunications Trends
- /
- v.23 no.1 s.109
- /
- pp.77-88
- /
- 2008
음성합성 기술은 1990년대 중반 음편접합 방법론이 출현하면서 괄목한 만한 기술적 발전을 이루어, 2000년 전후에는 전화망을 이용한 ARS, VMS, UMS 서비스를 중심으로 폭넓게 사용되면서 일반 사용자들에게 매우 친숙한 서비스를 제공하여 왔다. 그러나 최근 텔레포니 기반의 음성 기술 시장은 기업고객 위주로 그 성장이 더딘 반면, 지능형 로봇, 텔레매틱스, 홈네트워크, 차세대 PC와 같은 전략적 국가 신성장동력 산업분야나 MP3 플레이어, 휴대폰, PMP 단말기, 휴대용 단말기와 같은 임베디드 분야가 음성 기술의 새로운 시장으로 주목을 받고 있다. 임베디드 분야에서 요구하는 음성 기술은 기존 서버급 시스템에서 운영되었던 기술과는 상당히 다른 기술 특성을 가지고 있다. 이에 본 고에서는 음성 기술 중 특히 음성합성 기술에 관한 임베디드 분야의 요구사항을 고찰하고, 이를 해결하기 위한 최근의 기술적 발전 동향 및 응용 사례에 대해서 기술하고자 한다.
https://doi.org/10.22648/ETRI.2008.J.230108 인용 PDF

신성장동력산업용 대어휘 음성인식 기술 동향 및 응용

Gang, Jeom-Ja;Gang, Byeong-Ok;Jeong, Ho-Yeong;Jeong, Hun;Lee, Yun-Geun
- Electronics and Telecommunications Trends
- /
- v.23 no.1 s.109
- /
- pp.65-76
- /
- 2008
신성장동력산업용 음성인식 기술은 지능형 로봇, 텔레매틱스, 홈네트워크, 차세대 PC, 디지털 콘텐츠 검색 등에 음성인식 기술을 적용하기 위한 것이다. 음성인식 기술은 사람이 일상생활 속에서 사용하는 단말기들의 제어나 정보 서비스를 마우스나 키보드를 사용하지 않고, 사람이 갖는 가장 친화적이면서 편리한 의사소통 도구인 목소리를 사용하여 원하는 단말기의 제어나 정보 서비스를 제공 받을 수 있도록 지원하는 기술을 말한다. 본 고에서는 음성인식 기술의 발전과정을 통한 음성인식 기술의 발전 동향에 대해서 설명하고, 신성장동력산업 분야의 인터페이스로 음성인식 기술을 적용한 핵심 요소 기술에 대한 개발 동향과 응용 사례에 대해서 기술한다.
https://doi.org/10.22648/ETRI.2008.J.230107 인용 PDF

A Study on Preprocessing for Elderly Voice Recognition (노인음성인식을 위한 전처리에 관한 연구)

Park, Ji-Woong;Lee, Seoung-Jun;Kwon, Soonil
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.11a
- /
- pp.1646-1648
- /
- 2013
고령화 되어 가는 현대 사회에서 노인들이 일반 성인과 동등한 수준에서 정보를 접근 가능하도록 스마트기기의 손쉬운 인터페이스 방법이 요구된다. 음성 인터페이스는 노인들의 스마트기기 활용도를 높여 줄 수 있지만, 성능이 평균적 성인연령 대의 발성행태에 최적화되어 있어, 노인들이 사용할 경우 음성인식률 저하를 초래한다. 그래서 노인 친화형 음성 인터페이스를 개발하기 위한 일환으로 노인음성에 대한 인식률을 향상시켜 줄 수 있는 전처리 알고리즘을 개발하고자 한다. 이를 위해 노인층과 청년층을 대상으로 음성샘플을 수집하여 분석하였고, 그 결과 노인이 청년에 비해 발성속도가 느리며 이는 스마트기기의 음성인식 기능저하로 이어진다는 것을 확인할 수 있었다.
https://doi.org/10.3745/PKIPS.y2013m11a.1646 인용 PDF

Search Result 3,084, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)