• Title/Summary/Keyword: 녹음기능

Search Result 70, Processing Time 0.021 seconds

A Study on Realization of Speech Recognition System based on VoiceXML for Railroad Reservation Service (철도예약서비스를 위한 VoiceXML 기반의 음성인식 구현에 관한 연구)

  • Kim, Beom-Seung;Kim, Soon-Hyob
    • Journal of the Korean Society for Railway
    • /
    • v.14 no.2
    • /
    • pp.130-136
    • /
    • 2011
  • This paper suggests realization method for real-time speech recognition using VoiceXML in telephony environment based on SIP for Railroad Reservation Service. In this method, voice signal incoming through PSTN or Internet is treated as dialog using VoiceXML and the transferred voice signal is processed by Speech Recognition System, and the output is returned to dialog of VoiceXML which is transferred to users. VASR system is constituted of dialog server which processes dialog, APP server for processing voice signal, and Speech Recognition System to process speech recognition. This realizes transfer method to Speech Recognition System in which voice signal is recorded using Record Tag function of VoiceXML to process voice signal in telephony environment and it is played in real time.

Implementation of the High-Quality Audio System with the Separately Processed Musical Instrument Channels (악기별 분리처리를 통한 고음질 오디오 시스템 구현)

  • Kim, Tae-Hoon;Lee, Sang-Hak;Kim, Dae-Kyung;Lee, Sang-Chan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.4
    • /
    • pp.346-353
    • /
    • 2013
  • This paper deals with the implementation of a high-quality audio system for karaoke. For improving the key/tempo changes performance, we separated the audio into many musical instrument channels. By separating musical instrument channels, high-quality key/tempo changes can be achieved and we confirmed this using the cross-correlation distribution and the MOS evaluation. The improved audio system was implemented using the TMS320C6747 DSP with fixed/floating-point operations. The implemented audio system can perform the multi-channel WMA decoding, the MP3 encoding/decoding, the wav playing, the EQ, and the key/tempo changes in real time. The WMA channels used for processing the separated instrument channels. The audio system includs the MP3 encoding/decoding function for playing and recording and the wav channel for the effect sound.

Constructive music creation: the process and effectiveness of sampling in computer-based electronic music production (구성적 음악 창작: 컴퓨터 기반 전자적 음악 프로덕션 상에서 샘플링의 과정과 효과)

  • Han, Jinseung
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.127-134
    • /
    • 2009
  • In spite of controversial debates on aesthetic issues of computer-generated electronic music, rapid advancement of music technologies in the past decade have resulted proliferation of using virtual software synthesizers and samplers in music composition. Computer-based music production platform has become not only a norm among some of contemporary music composers but also vital apparatus for their compositional process. There are two imperative parts of this compositional process involving sampling in computer-based music production, which are commercially available sample libraries that include pre-recorded audio samples, and music production software that processes them. The purpose of this study is to investigate the process and effectiveness of reconstructive compositional process utilizing distinctive features of sampling on computer music production software. This study addresses issues such as: the definition of audio sampling, how sampling is incorporated in compositional process, and what features of music production software are particularly effective in various musical expressions. The result of this study will hopefully accommodate and fulfill the needs of electronic and acoustic musicians' creativeness.

  • PDF

Design and Implementation of Authoring Tools for Multimedia Production (멀티미디어 제작을 위한 저작도구의 설계 및 구현)

  • Yoo Su-Mi;Baik Sung-Wook;Bang Kee-Chun
    • Journal of Digital Contents Society
    • /
    • v.4 no.1
    • /
    • pp.45-55
    • /
    • 2003
  • Due to the rapid development of information & communication technology under high performance computing environments, the multimedia production techniques have been applied to a variety of multimedia fields such as general banner advertisements including texts, images and animations, and the internet-broadcasting dealing with videos and sounds. This paper presents an authoring tool with main functions to setup events objects (image, animation, sound, button, area) and to setup action functions, so that non-experts can easily produce multimedia including images, sounds, animations and so on. The authoring tool implemented in Java can be applied to the CD-ROM title production as well as the web-site construction. We can expect that when this authoring tool is used for in multimedia production, both cost and time will be reduced due to its convenience and powerful functions. We have a future plan to integrate intelligent multimedia presentation techniques with the presented tool for the autonomous multimedia authoring works.

  • PDF

Kiosk for the Visually Impaired using Voice Recognition (음성인식 기능을 이용한 시각장애인용 키오스크)

  • Kim, Dae-Young;Lee, Ah-Hyun;Lee, Gun-Haeng;Kim, Se-Hyun;Lee, Boong-Joo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.5
    • /
    • pp.873-882
    • /
    • 2022
  • In this paper, we studied the voice recognition system kiosk for convenience, thinking that the kiosk widely used in modern society should compensate for the inconvenience of using by the visually impaired. Using ultrasonic sensor and PIR(Passive Infrared), it recognizes the visually impaired within the range of 80cm-40cm, introduces the kiosk through the MP3 module and induces them to come closer. Also, when the visually impaired within 40cm is recognized, the product description and order are guided through the MP3 module. A recording-based data voice recognition system and a kiosk that outputs desired items through servo motors were studied. A kiosk for the convenience of the visually impaired was manufactured through operation and optimization experiments of PIR, ultrasonic, voice recognition, and shock sensor for the manufactured voice recognition kiosk. Finally, it was confirmed that security can be strengthened by using shock sensors and emergency bells to enhance security.

A Study on the Planting Design for the Renewal of Urban Neighborhood Park - In Case of Okgu Neighborhood Park, Siheung, Gyeonggi-do, Korea - (도시근린공원 리뉴얼을 위한 식재디자인 연구 - 경기도 시흥시 옥구공원을 대상으로 -)

  • Lee, Sang-Man;Jeong, Moon-Soon;Han, Bong-Ho;Park, Seok-Cheol
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.47 no.1
    • /
    • pp.88-103
    • /
    • 2019
  • This paper aims to identify planting design for the renewal of Okgu Park, located in Siheung, Gyeonggi-do. I designate planting concept fit spatial functions and also suggest planting designs that are proper for a growth environment. The spatial functions of the research site are divided on the basis of the park facilities, its surroundings, and usage. To understand the planting concept, this paper looks into the distribution of plant species and the precise planting structure. To understand the planting concept and the current usage of shade space in the park, I examine the distribution of plant species and the precise planting structure. There are 48 kinds of plants, with Zoysia japonica area (28.84%), Prunus yedoensis (8.0%), Pinus thunbergii (6.73%) and Zelkova serrata (6.38%) taking up the majority. 27 places were chosen for researching the precise planting structure. The research shows that the average green coverage ratio is 38.14% and the average green capacity coefficient is $0.72m^3/m^2$. The growth defective rate of trees in the shade areas is estimated by averaging the classified growth conditions of individual trees per block of shade areas. Areas with an inferior environment for growth and low spatial usage in Okgu Park are selected as subjects for planting design. After comparing the spatial functions with planting concepts and analyzing the growth of plants, I identify $36,236m^2$ areas with inferior growth condition. I also examine structures and the surrounding areas to find areas that require urgent planting improvement, specifically identifying landscape space and shade space around the fountain and the buffer space nearby the North gate. I rearrange spatial functions in the selected areas to devise a planting design considering the existing vegetation, layer structure, and its usage. I set the planting concept and direction to improve the landscape of the selected areas through implementing a planting design so the park users can be satisfied with each space.

Establishment of the Room Acoustic Criteria for the Korean Traditional Music Halls Using Subjective Listening Tests (청감실험에 의한 국악당의 음향설계조건 설정)

  • Haan, Chan-Hoon;Shin, Jic-Su
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.7
    • /
    • pp.343-352
    • /
    • 2007
  • The present study aims to investigate the design standard for acoustic criteria of Korean traditional music which could be used for the design of Korean traditional music halls. In order to do this, subjective listening tests were undertaken to musicians using auralized sounds which were convolved with the impulse response of traditional instruments recorded in an anechoic chamber. 94 pairs of sound were made which have different value of acoustic parameters including RT, BR, Brilliance, G, C80, ITDG, IACC. A paired comparison method(PCM) was used to analyze the results from the subjective listening tests. The results show that the preference of acoustic criteria for the Korean traditional music is far different from those of western music. As a result, specific range of acoustic criteria were suggested for the appropriate acoustic conditions of Korean traditional music. Also, a guideline of the acoustic design of halls for performing the Korean traditional music was suggested which could be used as a basic reference in the future works.

A Study on a Elevator Emergency Call Device System and Performance Evaluation based on ICT for Efficient Handling in Emergency Situation (위급상황 시 효율적인 대처를 위한 ICT 기반의 엘리베이터 비상통화장치 시스템 및 성능 비교 연구)

  • Jung, Se-Hoon;Park, Sung-Kyun;Park, Hong-Jun;So, Won-Ho;Park, Dong-Kook;Sim, Chun-Bo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.4
    • /
    • pp.449-459
    • /
    • 2015
  • A lot of people were trapped in elevators without power supply when BLACK-OUT situation occurred in 2011. The telephone network of control room connected to the elevators had problem operating poorly. In this paper we propose an ICT based elevator emergency call device prototype system and evaluate the performance of the system. The proposed system quickly responds in emergency situation to guarantee passenger safety. For the goal, firstly the system tries to connect to a control room. If it fails the system attempts to call numbers for emergency contact and a rescue team sequentially. The system is designed to quickly support emergency contact as well. Finally, the information of elevator failure is rapidly transferred to the failure process device by the proposed system.

Topic Continuity in Naturalistic Speech Data by Korean High-Functioning Autistic Children (한국 고기능 자폐 아동의 자연발화에 나타난 주제 지속성)

  • Jee, Min-Jung;Hong, Eun-Mi;Song, Young-Wan;Park, Sun-Eon;Cho, Sook-Whan
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2005.05a
    • /
    • pp.261-266
    • /
    • 2005
  • 본 논문은 고기능 자폐 아동들이 담화 주제어 연속성(topic continuity)을 어떻게 습득하는지에 대해 검토하였다. 연구의 목적을 위하여 세 고기능 자폐 아동(9;11-12:2)의 자연 발화를 관찰 분석하였다. 사전 연구에 의하면, 자폐아동들은 의사소통의 기본적인 규칙을 잘 이해하지 못할 뿐만 아니라 타인의 사고와 기대 등에 민감하지 못하여 담화 주제를 적절히 유지하거나 전환하는(topic shift) 일에 많은 어려움을 겪는다. 본 연구는 한국 자폐아동들이 주제어의 유지와 전환 등, 담화 화용적(discourse-pragmatic) 기능의 발달 양상을 규명하는 것을 주요 목표로 한다. 본 연구의 자료는 세 자폐 아동의 자연 발화 내용으로서 1주-2주에 한번씩 매번 방문 시 120분 동안 녹음하였다. 분석 결과 다음의 몇 가지 습득 양상을 발견하였다. 첫째, 세 자폐 아동들에게서 발견된 주제 유지 빈도는 정상 아동들에 비해 낮았다. 한편. 이 아이들은 가끔 화제를 자신의 담화 주제로 돌려 자신의 주제 중심으로 대화를 지속했다. 이 아동들은 대화 상대자의 주제에 대해서는 민감하지 않지만 자신의 주제를 유지하려는 경향을 보이기도 한다. 둘째, 개별 아동을 검토한 결과, 담화 주제의 지속성이 높은 발화를 하는 아동은 현재 담화 주제에 더 민감하고 반향어를 산출할 때에도 자기 자신의 말 반복과 담화 상대자의 말을 반복하는 빈도가 별로 차이가 나지 않았다. 반면, 담화 주제의 지속성이 낮은 발화를 하는 아동은 이전 담화 주제에 더 민감하고, 반향어는 담화 상대자의 말 보다는 자기 자신의 말을 반복하는 비율이 더 높았다. 본 연구의 결과는 자폐 아동들이 담화 주제를 지속하는 능력이 많이 부족하지만, 담화 주제의 연속성은 다른 발화 유형과 상호 작용을 하면서 발달될 수 있다는 가능성을 보여 주었다. 따라서 본 연구 결과는 앞으로 자폐 아동의 연구가 집단 간의 연구뿐만 아니라 개별 아동의 발화에 쓰인 유형 간의 상호 관계를 주목함으로써 자폐 아동의 개별적 언어 치료에 새로운 시각을 심어 줄 가능성을 시사한다.

  • PDF

Using DSLR Camera for Digital Film Making (영화제작에서 DSLR 카메라의 활용성에 관한 연구)

  • Son, Bo-Wook;Min, Kyung-Won
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.5
    • /
    • pp.81-90
    • /
    • 2012
  • Since the Canon EOS 5D Mark II with which the Full HD video shooting is possible was launched in 2008, the utilization of the DSLR cameras has been increasing in the video production field. In this thesis, the shortages and advantages of the video functions that the 5D Mark II cameras have will be analysed and they will be compared with the RED cameras that are most widely used in the video production field today. Through this, the utilization of the DSLR camera in the film production field will be investigated. The DSLR camera has the advantage of having good clear picture since it uses the image sensor of big size, and of being able to utilize the various lenses of good quality, and is small in size and light in weight compared to the conventional HD cameras. Although, there are some limitations that there are parts to be improved such as the sound recording problems and development of various additional equipments, the excellent usage that the DLSR cameras have is presenting a new possibility for the film production.