• Title/Summary/Keyword: 소리 인식

Search Result 213, Processing Time 0.028 seconds

음성인식 기반 인터렉티브 미디어아트의 연구 - 소리-시각 인터렉티브 설치미술 "Water Music" 을 중심으로-

  • Lee, Myung-Hak;Jiang, Cheng-Ri;Kim, Bong-Hwa;Kim, Kyu-Jung
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.354-359
    • /
    • 2008
  • This Audio-Visual Interactive Installation is composed of a video projection of a video Projection and digital Interface technology combining with the viewer's voice recognition. The Viewer can interact with the computer generated moving images growing on the screen by blowing his/her breathing or making sound. This symbiotic audio and visual installation environment allows the viewers to experience an illusionistic spacephysically as well as psychologically. The main programming technologies used to generate moving water waves which can interact with the viewer in this installation are visual C++ and DirectX SDK For making water waves, full-3D rendering technology and particle system were used.

  • PDF

The cinematic interpretation of pansori and its transformation process (판소리의 영화적 해석과 변모의 과정)

  • Song, So-ra
    • (The) Research of the performance art and culture
    • /
    • no.43
    • /
    • pp.47-78
    • /
    • 2021
  • This study was written to examine the acceptance of pansori in movies based on pansori, and to explore changes in modern society's perception and expectations of pansori. A pansori is getting the love of the upper and lower castes in the late Joseon period, but loses the status at the time of the Japanese colonial rule and Korean War. In response, the country designated pansori as an important intangible cultural asset in 1964 to protect the disappearance of pansori. Until the 1980s, however, pansori did not gain popularity by itself. After the 2000s, Pansori tried to breathe in with the contemporary public due to the socio-cultural demand to globalize our culture. And now Pansori is one of the most popular cultures in the world today, as the pop band Feel the Rhythm of KOREA shows. The changing public perception of pansori and its status in modern society can also be seen in the mass media called movies. This study explored the process of this change with six films based on pansori, from "Seopyeonje" directed by Lim Kwon-taek in 1993 to the film "The Singer" in 2020. First, the films "Seopyeonje" and "Hwimori" were produced in the 1990s. Both of these films show the reality of pansori, which has fallen out of public interest due to the crisis of transmission in the early and mid-20th century. And in the midst of that, he captured the scene of a singer struggling fiercely for the artistic completion of Pansori itself. Next, look at the film "Lineage of the Voice" in 2008 and "DURESORI: The Voice of East" in 2012. These two films depict the growth of children who perform art, featuring contemporary children who play pansori and Korean traditional music. Pansori in these films is no longer an old piece of music, nor is it a sublime art that is completed in harsh training. It is only naturally treated as one of the contemporary arts. Finally, "The Sound of a Flower" in 2015 and "The Singer" in 2020. The two films constructed a story from Pansori's history based on the time background of the film during the late Joseon Dynasty, when Pansori was loved the most by the people. This reflects the atmosphere of the times when traditions are used as the subject of cultural content, and shows the changed public perception of pansori and the status of pansori.

Classification of General Sound with Non-negativity Constraints (비음수 제약을 통한 일반 소리 분류)

  • 조용춘;최승진;방승양
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.10
    • /
    • pp.1412-1417
    • /
    • 2004
  • Sparse coding or independent component analysis (ICA) which is a holistic representation, was successfully applied to elucidate early auditor${\gamma}$ processing and to the task of sound classification. In contrast, parts-based representation is an alternative way o) understanding object recognition in brain. In this thesis we employ the non-negative matrix factorization (NMF) which learns parts-based representation in the task of sound classification. Methods of feature extraction from the spectro-temporal sounds using the NMF in the absence or presence of noise, are explained. Experimental results show that NMF-based features improve the performance of sound classification over ICA-based features.

Common ASR Interface format for increasing usability of cloud-based ASR services. (클라우드 기반 음성인식 서비스 활용도 향상을 위한 음성인식 공통 인터페이스 표준 포맷)

  • Oh, Jung-Sup;Lee, Byung-Hoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.423-425
    • /
    • 2022
  • 음성인식은 컴퓨터가 사람의 언어를 이해하여, 소리로 발화하는 사람의 음성을 인식하여 텍스트로 바꾸는 과정을 의미하며, 최근 활용도가 높아지고 있다. 음성인식 엔진은 얼마나 많은 학습데이터를 기반으로 훈련을 했느냐에 따라서 그 성능이 결정되기 때문에, 자신의 서비스 에 맞는 음성인식 엔진을 적절히 선택할 수 있어야 한다. 음성인식 엔진의 성능이 수시로 변경될 수 있기 때문에 표준 인터페이스를 빠른 개발을 진행할 수 있도록 표준 포맷을 제안하였다.

Reference Channel Input-Based Speech Enhancement for Noise-Robust Recognition in Intelligent TV Applications (지능형 TV의 음성인식을 위한 참조 잡음 기반 음성개선)

  • Jeong, Sangbae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.2
    • /
    • pp.280-286
    • /
    • 2013
  • In this paper, a noise reduction system is proposed for the speech interface in intelligent TV applications. To reduce TV speaker sound which are very serious noises degrading recognition performance, a noise reduction algorithm utilizing the direct TV sound as the reference noise input is implemented. In the proposed algorithm, transfer functions are estimated to compensate for the difference between the direct TV sound and that recorded with the microphone installed on the TV frame. Then, the noise power spectrum in the received signal is calculated to perform Wiener filter-based noise cancellation. Additionally, a postprocessing step is applied to reduce remaining noises. Experimental results show that the proposed algorithm shows 88% recognition rate for isolated Korean words at 5 dB input SNR.

Study on the multi-functional Cradle by Voice Recognitions (다기능성을 가진 음성 인식 요람 연구)

  • Park, Kwang-Sung;Ahn, Sang-jin;Cho, Kyeong-Rok;Choi, Si-On;Park, Yong-Wook
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.12 no.4
    • /
    • pp.701-706
    • /
    • 2017
  • In this study, existing remote control or the cradle manually drives to recognize the voice of the way and through the app the Cradle to work with a motor. In addition, the temperature and humidity sensor was mounted in the cradle, the temperature and humidity of the cradle can be checked through the LCD. Depending on the sound size of the sound sensor, the resulting value was used to indicate a value of a, b, c, and the sum of the results over 1150, the cradle was recognized as the baby's crying, then, notificate and alarm on app.

Effects of Reading Aloud on International Students' English Formulaic Sequences Learning (소리 내어 읽기가 유학생의 영어 정형화 배열 학습에 미치는 영향)

  • Lee, Ji-Hyun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.341-348
    • /
    • 2022
  • Formulaic sequences are continuous or discontinuous series of words that are seemingly treated like single units. Formulaic sequences play a key role in language development, and formulaic sequences acquisition determines the success or failure of language development. This study proposes a reading aloud activity as a way for international students to learn formulaic sequences. A class focused on reading aloud was conducted with 41 international students taking a general English course at a university in Seoul. For 15 weeks, video lectures and real-time Zoom classes were conducted in parallel. The animated film Frozen was used as course material. In the video lectures, the teacher interpreted the movie script in easy Korean and read aloud formulaic sequences. Students were tasked with reading the sentences with formulaic sequences aloud, recording themselves reading aloud, and submitting their recordings. During real-time class meetings, students performed the activity of reading aloud the formulaic sequences they had studied in the video lectures. There was a significant increase in the interpretation and sentence writing of formulaic sequences in participants' post-evaluation compared to the pre-evaluation. Through the study's survey, students exhibited positive views in the affective domains.

The Korean's Sound Recognition Impressed in Ancient Sijo (고시조에 표현된 한국인의 소리인식 조사에 관한 연구)

  • Lee, Tai-gang;Jang, Gil-Soo
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.15 no.6 s.99
    • /
    • pp.724-730
    • /
    • 2005
  • Literary works contain various human emotion and historical, cultural background. It is very significant to understand sound recognition and receptions represented in many literary works. This study aims to investigate the sound impression on ancient Korean Sijo( Korean Verse) involved various traditional korean emotion, which were expressed in different situations. Firstly we selected the appropriate Sijo to express sounds, and then classified the sound, analyzed the meaning of recognition to the sound. The number of 297 sounds were classified into 13 categories, and 20 emotional meanings. Especially, 'internal sadness' characterized the korean rooted emotion were more expressed than other meanings and this meaning were symbolized by the sound of wild geese and cuckoos.

Development of Sound Information Visualization Glasses for the Hearing Impaired (청각장애인을 위한 사운드 정보 시각화 안경의 개발)

  • Lee, Gye-hwan;Kim, In-hyun;Lee, Jun-ho;Lee, Jeong-hoon;Hwang, Kwang-il
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.656-659
    • /
    • 2018
  • 통계적으로 일반인보다 청각장애인의 교통 사고율이 높은 것으로 나타나는데, 이는 청각 장애로 대표되는 차량을 포함한 위험 요소를 인식하기 힘든 상태나 조건에서 기인한다. 자동차가 접근하는 등의 소리를 듣지 못한다는 것은 결국 어떠한 위치에 위험요소가 존재하는지 인지하지 못함에 따라 사고로 이어질 가능성이 존재함을 의미하는데 이러한 문제점을 개선함과 동시에 대화중인 사람의 목소리를 시각화하여 정보를 제공함으로써 청각장애인으로 하여금 더 안전하고 쾌적한 삶을 누리게 하는 것이 청각장애인을 위한 사운드 정보 시각화 안경의 개발 목적이다. 위와 같은 배경을 통해 딥 러닝 기술에 기반하여 분류 과정을 거친 소리 정보의 판별을 통해 위험 요소를 인식한 후 시각화 하여 정보를 제공하는 디바이스를 제안한다.

An ambient display for hearing impaired people (청각 장애인을 위한 소리 시각화 시스템)

  • Kim, Dae-Seok;Lee, Tae-Wha;Lee, Dong-Man;Park, Jin-Ah;Hahn, Min-Soo
    • 한국HCI학회:학술대회논문집
    • /
    • 2006.02a
    • /
    • pp.46-51
    • /
    • 2006
  • 청각 장애인은 집에서 발생하는 여러 가지 소리나 가전 제품의 신호를 감지하지 못하므로 생활의 불편을 상당히 느끼고 있다. 이러한 사람들을 위해 소리 정보를 시각 정보로 변경하여, 사용자들의 시야에 보여주는 것을 목적으로 연구를 시작하였다. 본 연구에서는 집이라는 환경에서 사용자의 위치와 오리엔테이션 정보를 습득하여, 사용자에게 필요한 정보를 시야에 들어오는 범위에 방해되거나 불편하지 않게 표시하는 시스템을 제안한다. 프로젝터에 부착된 카메라를 이용하여 사용자를 인식하고, 사용자를 따라다니며 화면을 디스플레이 하는 기존 방법의 단점들을 해결하기 위해 위치 센서로 사람의 위치와 방향을 파악하여 사용자에게 필요한 정보를 사용자가 현재 바라보는 곳에 디스플레이 하는 방법을 제안한다. 3D 모델로 제작된 집의 구조를 이용하여, 프로젝터의 방향과 초점 제어를 사전에 계산하여 보다 정확한 위치에 정보가 디스플레이 되도록 하였다. 본 논문에서 제안하는 방법이 기존의 PDA 나 PC 모니터를 이용해 정보를 제공하는 방법보다 사용자들이 정보를 인지하는 데 걸리는 시간이 좀더 빠르고 이 방법을 선호하기 때문에, 청각 장애인에게 정보를 제공하는 시스템으로 적합하다는 결론을 도출하였다.

  • PDF