Search | Korea Science

Adaptive TCX Windowing Technology for Unified Structure MPEG-D USAC

Lee, Tae-Jin;Beack, Seung-Kwon;Kang, Kyeong-Ok;Kim, Whan-Woo
- ETRI Journal
- /
- v.34 no.3
- /
- pp.474-477
- /
- 2012
The MPEG-D unified speech and audio coding (USAC) standardization process was initiated by MPEG to develop an audio codec that is able to provide consistent quality for mixed speech and music contents. The current USAC reference model structure consists of frequency domain (FD) and linear prediction domain (LPD) core modules and is controlled using a signal classifier tool. In this letter, we propose an LPD single-mode USAC structure using an adaptive widowing-based transform-coded excitation module. We tested our system using official test items for all mono-evaluation modes. The results of the experiment show that the objective and subjective performances of the proposed single-mode USAC system are better than those of the FD/LPD dual-mode USAC system.
https://doi.org/10.4218/etrij.12.0211.0404 인용 PDF KSCI

Effects of Group Piano Playing on International Marriage Immigrant Women's Self-Efficacy; A Case Study (국제결혼 이주여성의 자기효능감 증진을 위한 집단 피아노 연주 활동 사례 연구)

Woo, Hyun Jung
- Journal of Music and Human Behavior
- /
- v.7 no.2
- /
- pp.1-22
- /
- 2010
The purpose of this study is to examine the effectiveness of the group piano playing program, which was a part of psycho-emotional support program, on international marriage immigrant women's self-efficacy. For this case study, five international marriage immigrant women from "N Multi-culture Supporting Agency" were agreed to participate in the study. The group piano playing program was implemented for ten sessions. Each session took place once a week for ten weeks. In order to examine the effects of self-efficacy on the participants, both quantitative and qualitative methods were used as a mixed methods research design. As a result, the participants' self-efficacy average score was statistically significant (p< .05). Also, qualitative analysis of participants' verbal and behavioral response showed positive changes in their level of self-efficacy. Therefore, the results of this study suggest that the group piano playing can facilitate as an effective intervention to promote the self-efficacy on international marriage immigrant women.
PDF

Towards Low Complexity Model for Audio Event Detection

Saleem, Muhammad;Shah, Syed Muhammad Shehram;Saba, Erum;Pirzada, Nasrullah;Ahmed, Masood
- International Journal of Computer Science & Network Security
- /
- v.22 no.9
- /
- pp.175-182
- /
- 2022
In our daily life, we come across different types of information, for example in the format of multimedia and text. We all need different types of information for our common routines as watching/reading the news, listening to the radio, and watching different types of videos. However, sometimes we could run into problems when a certain type of information is required. For example, someone is listening to the radio and wants to listen to jazz, and unfortunately, all the radio channels play pop music mixed with advertisements. The listener gets stuck with pop music and gives up searching for jazz. So, the above example can be solved with an automatic audio classification system. Deep Learning (DL) models could make human life easy by using audio classifications, but it is expensive and difficult to deploy such models at edge devices like nano BLE sense raspberry pi, because these models require huge computational power like graphics processing unit (G.P.U), to solve the problem, we proposed DL model. In our proposed work, we had gone for a low complexity model for Audio Event Detection (AED), we extracted Mel-spectrograms of dimension 128×431×1 from audio signals and applied normalization. A total of 3 data augmentation methods were applied as follows: frequency masking, time masking, and mixup. In addition, we designed Convolutional Neural Network (CNN) with spatial dropout, batch normalization, and separable 2D inspired by VGGnet [1]. In addition, we reduced the model size by using model quantization of float16 to the trained model. Experiments were conducted on the updated dataset provided by the Detection and Classification of Acoustic Events and Scenes (DCASE) 2020 challenge. We confirm that our model achieved a val_loss of 0.33 and an accuracy of 90.34% within the 132.50KB model size.
https://doi.org/10.22937/IJCSNS.2022.22.9.26 인용 PDF KSCI

Convergence Characteristics of Contemporary Musical Vocal Techniques - Focusing on the Analysis of 'The Girl in 14G' - (현대 뮤지컬 보컬 테크닉의 융합적 특징 - 'The Girl in 14G' 분석을 중심으로 -)

Lee, Eun-Hye
- Journal of Korea Entertainment Industry Association
- /
- v.15 no.4
- /
- pp.157-166
- /
- 2021
The purpose of this study is to understand the characteristics of contemporary vocalization and songs in order to learn various vocal methods in musical vocal classes and apply them to students. Musical vocalization methods change and evolve according to the demands of the times. Today, the characteristics of contemporary musicals cannot be limited to anyone genre, and the genre of music as well as the style of work are derived from several genres and coexist. 'The Girl in 14G,' the subject of this study, is a song that appeared in the album of Kristin Chenoweth, a famous American musical actress who uses various vocal techniques. Jeanine Tesori composed this song with various vocal techniques such as Classical, Jazz, Belting, and Mixed Voice to express New York's representative music genres of Broadway Musical, Metropolitan Opera and East Village Jazz. The development of the song consists of a difficult process in which one actor has to act across three different characters in three musical styles and singing methods. Singing 'The Girl in 14G' requires a lot of effort and practice as it is necessary to acquire various vocal techniques, which makes it a good text for students and actors in the educational perspective. As a result, this study confirmed that this song is a representative piece with a solid musical and dramatic composition and is a good example that shows the convergence characteristics of contemporary musical vocal techniques.
https://doi.org/10.21184/jkeia.2021.6.15.4.157 인용

Allegory in Lady Gaga's Fashion Style (Part 1) (Lady Gaga 패션스타일에 나타난 알레고리 연구(제1보))

Kim, Hyang-Ja;Kwon, Mi-Jeong
- Fashion & Textile Research Journal
- /
- v.14 no.4
- /
- pp.519-531
- /
- 2012
This study comprehends the various expressions of Lady Gaga's fashion style based on Craig Owens's Allegory theory. This study analyzed four application elements of Borrow, Site Specificity, Accumulate of Strategy, and Hybridization in addition, it studied all aspects of the aesthetic value of Lady GaGa (an influential popular culture icon). It was classified in the external representation of the fashion style for the aesthetic value. The results are summarized as follows: First, 'Borrow' of the singers of the 80's music and fashion style present from her elders and visual homage to shock artists. It influenced her fans with a difference in viewpoint for a star's fashion that subsequently resulted in a deformation of form, playful kitsch style, and mixed gender. Second, 'Site specificity' presents an extreme make over through an intentional and grotesques fashion style to extend physical territory and defenseless. The results remove stereotypes and reveal deconstructive performances. Third, 'Accumulate of strategy' simultaneously presents voluptuous beauty, futurism, and avant-garde style. This shows the countercultural tendency through the random repetition of fashion images and layerd coordination. Finally, 'Hybridization' presents multiful fashion style through a collaboration with world-famous designers and cosmetic brands. She expressed a diverse and complex fashion style composed of an art form that combines a high-tech cyborg image. The aesthetic values of Lady Gaga' fashion style are 'ambivalence virtuality', 'Transcendental mixed gender', 'plural textuality', and 'unexpected play culture'.
https://doi.org/10.5805/KSCI.2012.14.4.519 인용 PDF KSCI

A Study of Representation of Jong-no and Bon-jung in Modern Boy and Assassination : Focusing on the Post-colonialism (<모던보이>와 <암살>의 본정과 종로 재현 연구 -탈식민주의를 중심으로-)

Chin, Su-Mee
- The Journal of the Korea Contents Association
- /
- v.19 no.7
- /
- pp.234-245
- /
- 2019
In this paper, I examined the representation of post-colonialism focusing on the spaces in Modern Boy and Assassination. These movies represented Bon-jung and Jong-no as a mixed-residence quarter, over the dual city theory, the orthodoxy of geography. It can be interpreted as the birth of a hybrid subject in post-colonialism. The representation of Bon-jong in Modern Boy was centered around Mitsukoshi Department Rooftop Garden, Namsan Music Center and Myeongdong Cathedral. The representation of Bon-jung in Assassination was centered around Anemone Cafe and Mitsukoshi Department Store. Set in the history of the new building the Japanese Government General of Korea in Jong-no, Modern Boy used it as a place of struggle. The representation of Jong-no in Assassination was centered around the mansion of Kang In-kuk, a pro-Japanese collaborator. Modern Boy and Assassination showed the post-colonialism that breaks through modern binary oppositions by a 'female' national heroine. describing Bon-jung as both a mixed-residence quarter and the original home of post-colonialism movement, they also showed a different aspect from the existing Kyung-sung representations.
https://doi.org/10.5392/JKCA.2019.19.07.234 인용 PDF KSCI HTML

Saseol-sijo singing aspect of current Gagok (현행 가곡의 사설시조 가창 양상)

Kim, Young-Woon
- Sijohaknonchong
- /
- v.43
- /
- pp.5-39
- /
- 2015
Shijo (Korean poetic form) is a representative literature genre of a short poem among the literary works of Korea in the late Chosen Dynasty. The format of Sijo is Normal-Shijo in the form of 3 verses, 6 sections and 12 sound, and the lyrics of one Normal-Shijo has within or without 45 words. But Saseol-sijo, a type of Sijo, there is a work that has more than 100 letters due to the number of lyrics were a lot increased. Among those Saseol-sijo there is a work with 'solemn and elegant feeling' borrowing some verses even from Chinese poem, using a lot of Chinese vocabulary, but there are a lot of works with 'salacious and explicit contents'. Literary work, Shijo, is used for lyrics of vocal music as Gagok (a genre of Korean vocal music for mixed female and male voices) and Sijochang, however, there are many cases that the same Sijo poem is used as lyrics of Gagok and Shijo. But those music that use Saseol-sijo as lyrics among Gagok, the vocal music, are mainly songs with 'solemn feeling' rather than 'salacious work'. This study looked into the reason why the Saseol-sijo with 'salacious and explicit contents' are hard to be used as lyrics in Gagok, confirming the fact that most music singing Saseol-sijo among Gagok that are being handed down till now use lyrics with 'solemn and elegant feeling'. The most important thing among those reasons seems to be irregularly increasing lyrics, and in accordance with accompaniment. Gagok accompanys a number of instruments the fixed melody recorded and delivered in score. So it's almost impossible to play unless it depends on the steadily made song melody and accompaniment melody according to the chosen lyrics in advanced. Also, appreciation of literary works is usually made privately through a private reading activity, but Gagok is conducted through public performance in an open space for many people. Especially, it would have been hard to sing a salacious and explicit song gathered together with men and women of different social status in social system and custom of the late of Chosen Dynasty. This study confirmed the fact that folksy and popular character that was praised for literary characteristic of Saseol-sijo can't be easily found from Saseol-sijo that was called Gagok.
PDF

Interactive Music Player using Augmented Reality (증강현실을 이용한 상호작용 음악 플레이어)

Lee, Jae-Young;Kim, Jae-Shin;Han, Jae-Hyun;Kim, Tae-Yong;Choi, Jong-Soo
- 한국HCI학회:학술대회논문집
- /
- 2006.02a
- /
- pp.1149-1154
- /
- 2006
증강현실 (AR: Augmented Reality)은 카메라를 통하여 현실의 공간에 가상의 물체를 삽입하는 기술로 사용자에게 컴퓨터상에서 정보를 보강해 줌으로서 사용자가 카메라로 보이는 환경에 대한 추가적인 정보를 취득 할 수 있게 해주는 분야이다. 가상환경(Virtual Reality) 및 혼합영상(Mixed Reality)을 이용해 보다 사실감 있는 가상의 영상을 일상 생활에 접목하려는 기술들이 부각되며 활발한 연구가 이루어 지고 있는데 이러한 시도는 일상생활의 다양한 분야에 적용이 되고 있다. 본 논문에서는 카메라로 취득된 화면상에 사용자의 움직임, 즉 사용자가 대상되는 마커를 삽입하고 컨트롤할 수 있는 음악플레이어를 구현한다. 사용자가 키보드나 마우스 등의 입력장치가 아닌 카메라에서 보여지는 마커의 움직임으로 원하는 음악을 플레이 할 수 있는 방법을 제안하고자 한다. 실시간으로 입력되는 카메라상의 프레임에서 대상되는 마커의 움직임을 찾고 그 대상물체 위에 정보를 증강시켜주고, 그 음악을 화면상에 사용자의 마커 움직임을 통해서 제어하는 방법이다.
PDF

Enhanced Spectral Hole Substitution for Improving Speech Quality in Low Bit-Rate Audio Coding

Lee, Chang-Heon;Kang, Hong-Goo
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.3E
- /
- pp.131-139
- /
- 2010
This paper proposes a novel spectral hole substitution technique for low bit-rate audio coding. The spectral holes frequently occurring in relatively weak energy bands due to zero bit quantization result in severe quality degradation, especially for harmonic signals such as speech vowels. The enhanced aacPlus (EAAC) audio codec artificially adjusts the minimum signal-to-mask ratio (SMR) to reduce the number of spectral holes, but it still produces noisy sound. The proposed method selectively predicts the spectral shapes of hole bands using either intra-band correlation, i.e. harmonically related coefficients nearby or inter-band correlation, i.e. previous frames. For the bands that have low prediction gain, only the energy term is quantized and spectral shapes are replaced by pseudo random values in the decoding stage. To minimize perceptual distortion caused by spectral mismatching, the criterion of the just noticeable level difference (JNLD) and spectral similarity between original and predicted shapes are adopted for quantizing the energy term. Simulation results show that the proposed method implemented into the EAAC baseline coder significantly improves speech quality at low bit-rates while keeping equivalent quality for mixed and music contents.
PDF KSCI

Multi-channel Speech Enhancement Using Blind Source Separation and Cross-channel Wiener Filtering

Jang, Gil-Jin;Choi, Chang-Kyu;Lee, Yong-Beom;Kim, Jeong-Su;Kim, Sang-Ryong
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.2E
- /
- pp.56-67
- /
- 2004
Despite abundant research outcomes of blind source separation (BSS) in many types of simulated environments, their performances are still not satisfactory to be applied to the real environments. The major obstacle may seem the finite filter length of the assumed mixing model and the nonlinear sensor noises. This paper presents a two-step speech enhancement method with multiple microphone inputs. The first step performs a frequency-domain BSS algorithm to produce multiple outputs without any prior knowledge of the mixed source signals. The second step further removes the remaining cross-channel interference by a spectral cancellation approach using a probabilistic source absence/presence detection technique. The desired primary source is detected every frame of the signal, and the secondary source is estimated in the power spectral domain using the other BSS output as a reference interfering source. Then the estimated secondary source is subtracted to reduce the cross-channel interference. Our experimental results show good separation enhancement performances on the real recordings of speech and music signals compared to the conventional BSS methods.
PDF KSCI

Search Result 50, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)