통합 검색 | Korea Science

방송뉴스 인식에서의 잡음 처리 기법에 대한 고찰 (A Study on Noise-Robust Methods for Broadcast News Speech Recognition)

정용주
- 대한음성학회지:말소리
- /
- 제50호
- /
- pp.71-83
- /
- 2004
Recently, broadcast news speech recognition has become one of the most attractive research areas. If we can transcribe automatically the broadcast news and store their contents in the text form instead of the video or audio signal itself, it will be much easier for us to search for the multimedia databases to obtain what we need. However, the desirable speech signal in the broadcast news are usually affected by the interfering signals such as the background noise and/or the music. Also, the speech of the reporter who is speaking over the telephone or with the ill-conditioned microphone is severely distorted by the channel effect. The interfered or distorted speech may be the main reason for the poor performance in the broadcast news speech recognition. In this paper, we investigated some methods to cope with the problems and we could see some performance improvements in the noisy broadcast news speech recognition.
PDF

Korean Broadcast News Transcription Using Morpheme-based Recognition Units

Kwon, Oh-Wook;Alex Waibel
- The Journal of the Acoustical Society of Korea
- /
- 제21권1E호
- /
- pp.3-11
- /
- 2002
Broadcast news transcription is one of the hardest tasks in speech recognition because broadcast speech signals have much variability in speech quality, channel and background conditions. We developed a Korean broadcast news speech recognizer. We used a morpheme-based dictionary and a language model to reduce the out-of·vocabulary (OOV) rate. We concatenated the original morpheme pairs of short length or high frequency in order to reduce insertion and deletion errors due to short morphemes. We used a lexicon with multiple pronunciations to reflect inter-morpheme pronunciation variations without severe modification of the search tree. By using the merged morpheme as recognition units, we achieved the OOV rate of 1.7% comparable to European languages with 64k vocabulary. We implemented a hidden Markov model-based recognizer with vocal tract length normalization and online speaker adaptation by maximum likelihood linear regression. Experimental results showed that the recognizer yielded 21.8% morpheme error rate for anchor speech and 31.6% for mostly noisy reporter speech.
PDF KSCI

방송 메시지 전달 속도의 통시적 비교에 관한 연구: 라디오뉴스 전달 속도 분석을 중심으로 (A Comparative Study of the Diachronic Change in the Transmission Rate of Broadcast Messages)

박경희
- 대한음성학회지:말소리
- /
- 제64호
- /
- pp.15-37
- /
- 2007
The purpose of this paper is to examine the change of the times on the transmission rate of broadcast message. In order to find out the research results, I collected past recorded news tapes and selected 22 radio news out from era of Japanese Imperialism, 1950's, 1960's and contemporary age. Next I measured each announcer's reading rate, and compared change on news-reading rate between present and past approximately 50 years ago. The results of study with such procedures and methods are as follows : the average reporting rate of newscasters in each era is different. From these results, we can easily grasp diachronic change in the transmission rate of broadcast message. Namely, the results show us that present announcers read news faster than the group of past era of Japanese Imperialism by 68%.
PDF

Retrieval of Broadcast News Using Audio Content Analysis

Kim, Hyoung-Gook
- The Journal of the Acoustical Society of Korea
- /
- 제26권3E호
- /
- pp.74-79
- /
- 2007
In this paper, we report our recent work on a indexing and retrieval system of broadcast news using audio content analysis. Key issues addressed in this work are two major parts of the audio indexing system: anchorperson detection based on audio segmentation, and phone-based spoken document retrieval, developed in the framework of the emerging MPEG-7 standard. Experiments are conducted on a database of Britisch broadcast news videos. We discuss the development of the retrieval system, and the evaluation of each part and the retrieval system.
PDF KSCI

방송 뉴스 인식을 위한 언어 모델 적응 (Language Model Adaptation for Broadcast News Recognition)

김현숙;전형배;김상훈;최준기;윤승
- 대한음성학회지:말소리
- /
- 제51호
- /
- pp.99-115
- /
- 2004
In this parer, we propose LM adaptation for broadcast news recognition. We collect information of recent articles from the internet on real time, make a recent small size LM, and then interpolate recent LM with a existing LM composed of existing large broadcast news corpus. We performed interpolation experiments to get the best type of articles from recent corpus because collected recent corpus is composed of articles which are related with test set, and which are unrelated. When we made an adapted LM using recent LM with similar articles to test set through Tf-Idf method and existing LM, we got the best result that ERR of pseudo-morpheme based recognition performance has 17.2 % improvement and the number of OOV has reduction from 70 to 27.
PDF

언어모델 인터뷰 영향 평가를 통한 텍스트 균형 및 사이즈간의 통계 분석 (Statistical Analysis Between Size and Balance of Text Corpus by Evaluation of the effect of Interview Sentence in Language Modeling)

정의정;이영직
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 2002년도 하계학술발표대회 논문집 제21권 1호
- /
- pp.87-90
- /
- 2002
This paper analyzes statistically the relationship between size and balance of text corpus by evaluation of the effect of interview sentences in language model for Korean broadcast news transcription system. Our Korean broadcast news transcription system's ultimate purpose is to recognize not interview speech, but the anchor's and reporter's speech in broadcast news show. But the gathered text corpus for constructing language model consists of interview sentences a portion of the whole, $15\%$ approximately. The characteristic of interview sentence is different from the anchor's and the reporter's in one thing or another. Therefore it disturbs the anchor and reporter oriented language modeling. In this paper, we evaluate the effect of interview sentences in language model for Korean broadcast news transcription system and analyze statistically the relationship between size and balance of text corpus by making an experiment as the same procedure according to varying the size of corpus.
PDF

CNN을 활용한 방송 뉴스의 감정 분석 (Analysis of Emotions in Broadcast News Using Convolutional Neural Networks)

남영자
- 한국정보통신학회논문지
- /
- 제24권8호
- /
- pp.1064-1070
- /
- 2020
한국의 영상기반 뉴스 미디어는 크게 지상파 방송, 종합편성 방송, 그리고 유튜브 방송과 같은 온라인 미디어로 나뉘어진다. 최근 이들 미디어의 방송 뉴스는 특정 시청자를 목표로 삼아 공정성과 중립성을 기대할 수 없는 주관적, 감정적인 성향의 내용을 송출하는 경향이 있다는 지적을 받고 있다. 이러한 양상은 시청자의 이슈 지각에 부정적인 영향을 미칠 수 있다. 이에 본 연구는 그 결과는 영상기반 미디어 뉴스 유형별로 감정 유형을 드러내는 성향의 차이가 존재하는지, 그리고 만약 차이가 존재한다면, 그 양상은 어떠한지를 살펴보았다. 감정 유형은 '딥러닝' 기법인 Convolutional Neural Network를 사용하여 중립, 행복, 슬픔 그리고 분노와 관련하여 분석하였다. 분석 결과, 전반적으로 뉴스 보도가 감정을 드러내는 성향이 있음을 보여주었다. 본 연구는 방송 뉴스에서 표출되는 감정을 다룬 첫 양적 연구이자 방송 뉴스 감정 분석에서 딥러닝을 사용한 첫 사례이다.
https://doi.org/10.6109/jkiice.2020.24.8.1064 인용 PDF KSCI

Korean LVCSR for Broadcast News Speech

Lee, Gang-Seong
- The Journal of the Acoustical Society of Korea
- /
- 제20권2E호
- /
- pp.3-8
- /
- 2001
In this paper, we will examine a Korean large vocabulary continuous speech recognition (LVCSR) system for broadcast news speech. The combined vowel and implosive unit is included in a phone set together with other short phone units in order to obtain a longer unit acoustic model. The effect of this unit is compared with conventional phone units. The dictionary units for language processing are automatically extracted from eojeols appearing in transcriptions. Triphone models are used for acoustic modeling and a trigram model is used for language modeling. Among three major speaker groups in news broadcasts-anchors, journalists and people (those other than anchors or journalists, who are being interviewed), the speech of anchors and journalists, which has a lot of noise, was used for testing and recognition.
PDF

ETRI 방송뉴스음성인식시스템 소개 (Introduction of ETRI Broadcast News Speech Recognition System)

박준
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2006년도 춘계 학술대회 발표논문집
- /
- pp.89-93
- /
- 2006
This paper presents ETRI broadcast news speech recognition system. There are two major issues on the broadcast news speech recognition: 1) real-time processing and 2) out-of-vocabulary handling. For real-time processing, we devised the dual decoder architecture. The input speech signal is segmented based on the long-pause between utterances, and each decoder processes the speech segment alternatively. One decoder can start to recognize the current speech segment without waiting for the other decoder to recognize the previous speech segment completely. Thus, the processing delay is not accumulated. For out-of-vocabulary handling, we updated both the vocabulary and the language model, based on the recent news articles on the internet. By updating the language model as well as the vocabulary, we can improve the performance up to 17.2% ERR.
PDF

온라인 방송의 뉴스기사 유형에 대한 분석 -네이버 뉴스스탠드의 방송사 홈페이지를 중심으로- (Analysis of the Types of News Stories on the Online Broadcast -Focusing upon the Broadcasting Websites of NAVER Newsstand-)

박광순
- 디지털융복합연구
- /
- 제19권3호
- /
- pp.177-185
- /
- 2021
본 연구는 네이버 뉴스스탠드의 9개 방송사 홈페이지 뉴스기사에 대한 분석을 통해 온라인 방송의 뉴스기사 유형은 어떻게 구성되고 있는가를 파악하기 위해 실시되었다. 분석을 위해 1개 방송 당 30일 분량으로 9개 방송을 대상으로 총 270일간의 샘플을 선정하였다. 분석방법은 방송사 간 차이검정을 위해 일원분산분석(One-way ANOVA) 기법을 이용하였다. 분석은 언어구성에 의한 뉴스기사 유형, 기사내용에 따른 장르 유형 등을 중심으로 이루어졌다. 분석결과 오프라인 방송에서는 모든 프로그램이 비디오기사 유형으로 제작·송신되고 있는 것에 반해 온라인 방송에서는 약 50% 정도가 사진기사와 텍스트기사로 구성되었다. 온라인 신문에서 비디오기사나 컴퓨터 그래픽을 이용한 동영상 중심의 새로운 기사 유형을 제작·공급하고 있으나 온라인 방송에서는 신문의 주요 기사유형인 사진과 텍스트기사를 적극적으로 활용하고 있었다. 이 같은 결과를 통해 온라인 미디어 환경에서의 미디어 간 경계가 더욱 불분명해지고 있으며, 방송기사 유형의 올드화 현상을 파악할 수 있었다.
https://doi.org/10.14400/JDC.2021.19.3.177 인용 PDF KSCI

검색결과 97건 처리시간 0.02초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)