Search | Korea Science

음성과 인상의 관계: 예비 연구

문승재
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.387-389
- /
- 1998
사람의 음성을 들으면, 사람은 물론이지만 처음 듣는 목소리에 대해서도 그 목소리의 주인공에 대한 막연한 이상이 그려지게 된다. 본 논문은 이러한 현상이 얼마나 신빙성이 있는지, 즉, 음성만을 듣고 짐작한 그 음성의 주인공의 모습과 실제의 모습이 과연 얼마나 관계가 있는지를 알아보고자 하는 연구의 초기 단계에 대한 보고이다. 본 연구에서는 이처럼 음성이 전달하는 시각적 정보의 신빙성을 확인하기 위하여 남녀 각 8명의 사진을 찍고, 같은 내용의 짧은 문장을 녹취한 후, 100명 이상의 피실험자들에게 개별적으로 녹음을 듣고 가장 잘 어울릴 듯한 사진을 고르도록 할 것이다. 우선적으로 여성 8명의 녹음을 약간명(48명)의 피실험자에게 들려주어 실험한 결과, 목소리의 주인공을 바로 찾는 경우는 드물었지만, 흥미로운 것은 비록 틀린 경우라도 어떤 특정한 목소리는 어느 특정한 사진과 집중적으로 연결되었다는 것이다. 이 결과를 source-filter theory와 연관시켜 생각해보고, 이를 바탕으로 좀 더 구체적인 앞으로의 연구방향을 제시한다.
PDF

Speech Information conviction IT delivery system develop study (음성 정보 확인 및 전달 시스템 개발 연구)

Park, Jin-Ho;Kim, Bok-Young;Choi, Sung
- Proceedings of the Korea Technology Innovation Society Conference
- /
- 2001.11a
- /
- pp.187-190
- /
- 2001
음성정보 산업은 오래 전부터 정보화 사회의 핵심 분야로서 잠재가치를 인정받아 왔지만, 불행히도 기술개발 및 시장 수요가 뒷받침되지 않아 먼 미래의 산업으로 인식되어 왔다. 그러나 최근 음성처리기술이 가시화 되고 인터넷 이용이 활성화되면서 보이스포탈, 음성 증권정보 등 음성과 인터넷이 결합된 서비스가 급격히 증가하고 있다. 현재 인터넷의 사용이 많아지면서 회원가입을 받는 사이트들이 늘어나게 되었다. 본 연구에서는 이러한 웹사이트의 회원가입에서의 고객의 신분 확인 및 회원의 검증과 청소년 고객의 고의적인 오류자료 입력 방지와 전화로 고객을 응대하는 업무가 집중적으로 많은 콜 센터 등의 응용 프로그램 환경에 있어서 전화 통신의 편리성, 응대 시간 축소, 전화 업무 자동화 기능 등을 가능하게 하기 위한 기술의 개발에 대해 연구하였다.
PDF

The Design and Implementation of Personal Audio Recorder Service (개인 오디오 레코더 서비스 설계 및 구현)

Kim, Do-Hyung;Yun, Min-Hong;Kim, Sun-ja;Lee, Kyung-Hee
- Proceedings of the Korea Information Processing Society Conference
- /
- 2007.11a
- /
- pp.727-728
- /
- 2007
본 논문에서는 음성통화를 위해 CDMA 네트워크와 데이터 통신을 위해 와이브로 네트워크를 동시에 사용하는 임베디드 리눅스 기반 듀얼모드 응용 서비스인 개인 오디오 레코더의 구현에 대해서 기술한다. 개인 오디오 레코더는 듀얼모드 지원 단말에 탑재된 클라이언트에서 음성 녹음을 시작하면, 송신자와 수신자의 CDMA 음성 데이터가 와이브로 네트워크를 통해 인터넷 상의 개인 오디오 레코더 서버로 전달된다. 개인 오디오 레코더 서버는 통화 번호 및 통화 시간에 따라 음성 데이터를 저장하게 된다. 구현된 개인 오디오 레코더는 단말의 저장공간이 부족한 환경에서도 음성통화 내용을 저장할 수 있도록 한다.
https://doi.org/10.3745/PKIPS.y2007m11a.727 인용 PDF

On a Speech Coding Algorithm for Low Cost Implementation of Voice Telegram System (보이스 전보 시스템 구현을 위한 저가형 음성파형 부호화 알고리즘)

나덕수;민소연;배명진
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.2
- /
- pp.101-105
- /
- 2000
A telegram has been used to transmit the emergency news or celebration message. So, it has been very important media in our life. Although the telegram processing is more and more convenient, on the other hand, the telegram service contains only text message. The voice telegram is that delivering user's voice with text message. So, the voice telegram can be delivered sender's emotions and feelings. However, since voice information contains lots of data, large memory size and high cost processor are needed to deliver itself. In this paper, we proposed a new speech waveform coding method that has low complexity and low cost implementation for the voice telegram system. First, we fixed one basic speech waveform per pitch period and measured the waveform similarity between basic and neighbor speech waveform. Second, if the similarity satisfied threshold values, we compress the neighbor speech waveform with pitch and magnitude value per pitch period and if not, we save speech waveform. When the compression is about 45%, we obtained about 4 point in MOS.
PDF

A Study on the Effective Command Delivery of Commanders Using Speech Recognition Technology (국방 분야에서 전장 소음 환경 하에 음성 인식 기술 연구)

Yeong-hoon Kim;Hyun Kwon
- Convergence Security Journal
- /
- v.24 no.2
- /
- pp.161-165
- /
- 2024
Recently, speech recognition models have been advancing, accompanied by the development of various speech processing technologies to obtain high-quality data. In the defense sector, efforts are being made to integrate technologies that effectively remove noise from speech data in noisy battlefield situations and enable efficient speech recognition. This paper proposes a method for effective speech recognition in the midst of diverse noise in a battlefield scenario, allowing commanders to convey orders. The proposed method involves noise removal from noisy speech followed by text conversion using OpenAI's Whisper model. Experimental results show that the proposed method reduces the Character Error Rate (CER) by 6.17% compared to the existing method that does not remove noise. Additionally, potential applications of the proposed method in the defense are discussed.
https://doi.org/10.33778/kcsa.2024.24.2.161 인용 PDF HTML

Performance Analysis for Call Processing in NGN Voice Services (NGN에서 음성서비스의 호 처리 성능해석)

정문조;황찬식
- Journal of the Institute of Electronics Engineers of Korea TC
- /
- v.40 no.11
- /
- pp.42-50
- /
- 2003
In this paper we propose a method of evaluating the performance of a Softswitch that provides call control to voice services in NGN (next generation network). First, we describe the architecture for voice services in NGN and anatomize the call control processes such as call initiation, call re-initiation and call release of a voice connection. kiter that we propose a method of estimating appropriate server capacity of the Softswitch using approximate queuing model. Via numerical experiments we illustrate the implication of the work
PDF KSCI

The Effects of the Presentation Mode of Web Contents on the Children's Information Processing Process (웹 콘텐츠의 정보제시유형이 어린이 뉴스정보처리과정에 미치는 영향)

Choi E-Jung
- The Journal of the Korea Contents Association
- /
- v.5 no.3
- /
- pp.113-122
- /
- 2005
The major purpose of this study is to explore the effect of the presentation undo combined by main four media(moving Image, audio, turf image) of web contents on the children's information processing process. So children were assigned to one of five experimental medium conditions: 'moving Image1 (auditory-visual redundancy)', 'moving Image2 (auditory-visual dissonance)', 'text', 'text-with-image', 'audio'. Results indicated that the moving image was found to be the most effective transmitter of internet news information for children's recall. And the recall advantage of moving image was found to be particularly pronounced for verbal information supplemented with redundant visual.
PDF

LED PANNEL with Automobile Signal Controller and Advertising Board used to Local area Network (LED PANNEL을 사용하여 근거리 무선 통신망을 연결한 자동차 신호 제어기 및 광고판)

Park, Jin Ki;kim, young-kil
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2018.05a
- /
- pp.533-535
- /
- 2018
In the 21st century, in which the accident rate is rapidly increasing in proportion to the development of automobiles, In order to reduce the number of accidents, this paper was written for the convenience of the elderly people with disabilities and the handicapped. When a driver's safety accidents and various signals are transmitted through a smart phone by voice, the voice signal is processed as a video signal through the rear LED pannel of the vehicle, so that an urgent situation or a current state can be clarified It is also possible to use the local area network as a billboard and I would like to propose a study to show the advertising effect and current traffic situation.
PDF

A Comparative Study of the Diachronic Change in the Transmission Rate of Broadcast Messages (방송 메시지 전달 속도의 통시적 비교에 관한 연구: 라디오뉴스 전달 속도 분석을 중심으로)

Park, Kyung-Hee
- MALSORI
- /
- no.64
- /
- pp.15-37
- /
- 2007
The purpose of this paper is to examine the change of the times on the transmission rate of broadcast message. In order to find out the research results, I collected past recorded news tapes and selected 22 radio news out from era of Japanese Imperialism, 1950's, 1960's and contemporary age. Next I measured each announcer's reading rate, and compared change on news-reading rate between present and past approximately 50 years ago. The results of study with such procedures and methods are as follows : the average reporting rate of newscasters in each era is different. From these results, we can easily grasp diachronic change in the transmission rate of broadcast message. Namely, the results show us that present announcers read news faster than the group of past era of Japanese Imperialism by 68%.
PDF

Speech Intelligibility Analysis on the Vibration Sound of the Window Glass of a Conference Room (회의실 유리창 진동음의 명료도 분석)

Kim, Yoon-Ho;Kim, Hee-Dong;Kim, Seock-Hyun
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2006.11a
- /
- pp.150-155
- /
- 2006
Speech intelligibility is investigated on a conference room-window glass coupled system. Using MLS(Maximum Length Sequency) signal as a sound source, acceleration and velocity responses of the window glass are measured by accelerometer and laser doppler vibrometer. MTF(Modulation Transfer Function) is used to identify the speech transmission characteristics of the room and window system. STI(Speech Transmission Index) is calculated by using MTF and speech intelligibility of the room and the window glass is estimated. Speech intelligibilities by the acceleration signal and the velocity signal are compared and the possibility of the wiretapping is investigated. Finally, intelligibility of the conversation sound is examined by the subjective test.
PDF

Search Result 485, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)