• Title/Summary/Keyword: 소리 분석

Search Result 595, Processing Time 0.028 seconds

Visualization of Korean Speech Based on the Distance of Acoustic Features (음성특징의 거리에 기반한 한국어 발음의 시각화)

  • Pok, Gou-Chol
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.13 no.3
    • /
    • pp.197-205
    • /
    • 2020
  • Korean language has the characteristics that the pronunciation of phoneme units such as vowels and consonants are fixed and the pronunciation associated with a notation does not change, so that foreign learners can approach rather easily Korean language. However, when one pronounces words, phrases, or sentences, the pronunciation changes in a manner of a wide variation and complexity at the boundaries of syllables, and the association of notation and pronunciation does not hold any more. Consequently, it is very difficult for foreign learners to study Korean standard pronunciations. Despite these difficulties, it is believed that systematic analysis of pronunciation errors for Korean words is possible according to the advantageous observations that the relationship between Korean notations and pronunciations can be described as a set of firm rules without exceptions unlike other languages including English. In this paper, we propose a visualization framework which shows the differences between standard pronunciations and erratic ones as quantitative measures on the computer screen. Previous researches only show color representation and 3D graphics of speech properties, or an animated view of changing shapes of lips and mouth cavity. Moreover, the features used in the analysis are only point data such as the average of a speech range. In this study, we propose a method which can directly use the time-series data instead of using summary or distorted data. This was realized by using the deep learning-based technique which combines Self-organizing map, variational autoencoder model, and Markov model, and we achieved a superior performance enhancement compared to the method using the point-based data.

Selection of Auditory Icons in Ship Bridge Alarm Management System Using the Sensibility Evaluation (감성평가를 이용한 선교알람관리시스템의 청각아이콘 평가)

  • Oh, Seungbin;Jang, Jun-Hyuk;Park, Jin Hyoung;Kim, Hongtae
    • Journal of Navigation and Port Research
    • /
    • v.37 no.4
    • /
    • pp.401-407
    • /
    • 2013
  • In parallel with the development of ship equipment, bridge systems have been improved, but marine accidents due to human error have not been reduced. Recently, research in nautical bridge equipment has focused on suitable ergonomic designs in order to reduce these errors due to human factors. In a bridge of a ship, there are numerous auditory signals that deliver important information clearly to the sailors. However, only a few studies have been conducted related to the human recognition of these auditory signals. There are three types of auditory signals: voice alarms, abstract sounds, and auditory icons. This study was conducted in order to design more appropriate auditory icons using a sensibility evaluation method. The auditory icons were rated to have five warning situations (engine failure, fire, steering failure, low power, and collision) using the Semantic Differential Method. It is expected that the results of this study will be used as basic data for auditory displays inside bridges and for integrated bridge alarm systems.

Multimodal Emotional State Estimation Model for Implementation of Intelligent Exhibition Services (지능형 전시 서비스 구현을 위한 멀티모달 감정 상태 추정 모형)

  • Lee, Kichun;Choi, So Yun;Kim, Jae Kyeong;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.1-14
    • /
    • 2014
  • Both researchers and practitioners are showing an increased interested in interactive exhibition services. Interactive exhibition services are designed to directly respond to visitor responses in real time, so as to fully engage visitors' interest and enhance their satisfaction. In order to install an effective interactive exhibition service, it is essential to adopt intelligent technologies that enable accurate estimation of a visitor's emotional state from responses to exhibited stimulus. Studies undertaken so far have attempted to estimate the human emotional state, most of them doing so by gauging either facial expressions or audio responses. However, the most recent research suggests that, a multimodal approach that uses people's multiple responses simultaneously may lead to better estimation. Given this context, we propose a new multimodal emotional state estimation model that uses various responses including facial expressions, gestures, and movements measured by the Microsoft Kinect Sensor. In order to effectively handle a large amount of sensory data, we propose to use stratified sampling-based MRA (multiple regression analysis) as our estimation method. To validate the usefulness of the proposed model, we collected 602,599 responses and emotional state data with 274 variables from 15 people. When we applied our model to the data set, we found that our model estimated the levels of valence and arousal in the 10~15% error range. Since our proposed model is simple and stable, we expect that it will be applied not only in intelligent exhibition services, but also in other areas such as e-learning and personalized advertising.

Empirical Mode Decomposition using the Second Derivative (이차 미분을 이용한 경험적 모드분해법)

  • Park, Min-Su;Kim, Donghoh;Oh, Hee-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.2
    • /
    • pp.335-347
    • /
    • 2013
  • There are various types of real world signals. For example, an electrocardiogram(ECG) represents myocardium activities (contraction and relaxation) according to the beating of the heart. ECG can be expressed as the fluctuation of ampere ratings over time. A signal is a composite of various types of signals. An orchestra (which boasts a beautiful melody) consists of a variety of instruments with a unique frequency; subsequently, each sound is combined to form a perfect harmony. Various research on how to to decompose mixed stationary signals have been conducted. In the case of non-stationary signals, there is a limitation to use methodologies for stationary signals. Huang et al. (1998) proposed empirical mode decomposition(EMD) to deal with non-stationarity. EMD provides a data-driven approach to decompose a signal into intrinsic mode functions according to local oscillation through the identification of local extrema. However, due to the repeating process in the construction of envelopes, EMD algorithm is not efficient and not robust to a noise, and its computational complexity tends to increase as the size of a signal grows. In this research, we propose a new method to extract a local oscillation embedded in a signal by utilizing the second derivative.

Cranes(Grus japonensis) Adopted as a Traditional Factor in Landscaping and Gardening Culture (전통조경요소로써 도입된 학(鶴)과 원림문화)

  • Kim, Hai-Gyoung;So, Hyun-Su
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.30 no.3
    • /
    • pp.57-67
    • /
    • 2012
  • This study draws the following conclusions about the nature garden culture with crane that is a dynamic landscaping factor introduced into nature garden by analyzing the literatures on cranes written and landscapes painted in Chosun Dynasty. First, crane symbolizes such Buddhist, Taoist and Confucius media as appearance of Buddha, desire for Taoist hermit world and long life, and a solitary's life of proud loneliness. It has been recorded that crane was raised from the Three Nations Era till the Japanese occupation and Confucius scholars in Chosun Dynasty opened shows concerning crane or formed literature groups enjoying poems of it, and often using crane as their denial of going into government service. Second, in order to introduce crane to nature garden, people caught wild crane and made a fence and some kind of pond for their growth. In addition, crane was strictly managed by appointed slaves and they trained crane for dancing and then tended to allow a crane to play on the yard in terms of abstract meaning or got two cranes free in consideration of their ecology. Third, for more appreciation of crane and the expression of some symbolism matching for it, both plum and pine, which mean a solitary's life and long life respectively, were planted in nature garden. And, Confucius scholars in Chosun Dynasty also enjoyed their refined tastes with appreciation, napping, reading and playing the harp, accompanied by crane. As aforementioned, Confucius scholars in Chosun Dynasty did not only draw the meaning symbolized by cranes and write poems about such symbolic meaning, but also positively introduce crane into nature garden as a dynamic landscaping factor, so that they enjoyed synesthetic senses including looks, motions and sounds of cranes for their refined tastes.

The Effects of Sensory Integration Intervention for the Improvement of Sleep Quality in Adults: A Single-Subject Experimental Design (성인의 수면의 질 향상을 위한 감각통합중재의 효과: 단일사례 연구)

  • Hong, Cho Rong;Yoo, Eun Young;Park, Ji Hyuk;Park, Hae Yean
    • The Journal of Korean Academy of Sensory Integration
    • /
    • v.17 no.2
    • /
    • pp.1-17
    • /
    • 2019
  • Objective : The purpose of this study was to investigate the effects of a sensory integration intervention to improve sleep quality in adults. Methods : The subjects of the study were three sleepless adults with sensory defensiveness. This study applied a multiple baseline design out of a single-subject study, with the process comprising 4 basic lines, 7 basic lines, and 14 basic lines. The intervention was conducted 3 times per week for a total of 12 times. Overall sleep time, sleep latency, and the number of times that participants awoke at night were measured during every session. Sleep quality, sensory integration capabilities (including sensory defensiveness), and sleep satisfaction were measured before and after treatment. Results : A visual analysis of total sleep time, sleep latency, and number of waking times during the night as measured daily after the sensory integration intervention was applied showed no noticeable improvement in any of the subjects. Conclusion : This study is meaningful in that it applied a sensory integration intervention to adults and proposed sensory integration therapy as a new method of intervention to treat sleep disorders.

Comparative Analysis of the Constituents of the Leaves and Roots of Rumex crispus and their Effects on the Differentiation of Human Osteoblast-like MG-63 Cells (소리쟁이 잎과 뿌리 성분 분석 및 사람 조골 유사 MG-63 세포 분화에 미치는 효과 비교)

  • Park, Heajin;Jeong, Jaehoon;Hyun, Hanbit;Kim, Jihye;Kim, Haesung;Oh, Hyun Il;Hwang, Hye Seong;Kim, Ha Hyung
    • YAKHAK HOEJI
    • /
    • v.58 no.5
    • /
    • pp.307-313
    • /
    • 2014
  • Rumex crispus (curled dock), which is a perennial wild plant, has long been used as a laxative, astringent, and medicine to treat blood and skin diseases. We recently reported that the roots of R. crispus are an effective nutraceutical for bone. This study prepared ethanol extracts of the leaves and roots of R. crispus, and analyzed the major constituents using liquid chromatography and mass spectrometry. In addition, their effects on the proliferation and differentiation of human osteoblast-like MG-63 cells, such as cell viability, alkaline phosphatase (ALP) activity, collagen content, and mineralization, were compared. The chromatograms of the chemical constituents of the two extracts exhibited quite different profiles: quercetin and quercitrin were identified as major peaks in the leaf extract, whereas cinnamtannin B1 and procyanidin isomers were the major peaks for the root extract. Neither extract was cytotoxic at concentrations of < $25{\mu}g/ml$. ALP activity and collagen synthesis-which are markers of the early stage of osteogenesis-in MG-63 cells were significantly increased upon the addition of the root extract compared with the addition of the leaf extract. In contrast, the leaf extract had a more stimulatory effect on mineralization-which is marker of the late stage of osteogenesis-in MG-63 cells than did the root extract. In conclusion, extracts of both leaves and roots of R. crispus stimulated the bone-forming activity of osteoblasts; in particular, the root extract was more effective in the early stage of osteoblast differentiation, while the leaf extract was more effective in the late stage. This difference in anabolic activity may be due to differences in the constituents of the leaves and roots. The leaves and roots of R. crispus appear to complement each other as stimulators of bone formation.

Enhanced Sound Signal Based Sound-Event Classification (향상된 음향 신호 기반의 음향 이벤트 분류)

  • Choi, Yongju;Lee, Jonguk;Park, Daihee;Chung, Yongwha
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.5
    • /
    • pp.193-204
    • /
    • 2019
  • The explosion of data due to the improvement of sensor technology and computing performance has become the basis for analyzing the situation in the industrial fields, and various attempts to detect events based on such data are increasing recently. In particular, sound signals collected from sensors are used as important information to classify events in various application fields as an advantage of efficiently collecting field information at a relatively low cost. However, the performance of sound-event classification in the field cannot be guaranteed if noise can not be removed. That is, in order to implement a system that can be practically applied, robust performance should be guaranteed even in various noise conditions. In this study, we propose a system that can classify the sound event after generating the enhanced sound signal based on the deep learning algorithm. Especially, to remove noise from the sound signal itself, the enhanced sound data against the noise is generated using SEGAN applied to the GAN with a VAE technique. Then, an end-to-end based sound-event classification system is designed to classify the sound events using the enhanced sound signal as input data of CNN structure without a data conversion process. The performance of the proposed method was verified experimentally using sound data obtained from the industrial field, and the f1 score of 99.29% (railway industry) and 97.80% (livestock industry) was confirmed.

Pilot Study of Single Session Song-Based Music Therapy for Decreasing ICU Caregiver Anxiety (중환자 보호자의 불안 감소를 위한 단회기 노래중심 음악치료 적용 예비연구)

  • Jung, Yu Sun;Na, Sungwon
    • Journal of Music and Human Behavior
    • /
    • v.16 no.1
    • /
    • pp.25-46
    • /
    • 2019
  • This pilot study involved a single session of song-based music therapy to relieve the anxiety of intensive care unit (ICU) caregivers. Six caregivers of ICU patients participated in the intervention session individually. During the initial stage of the intervention, the participants' current emotional states were identified. Then they listened to familiar songs and playing a tone chime, which was intended to help them relax their body and reduce their psychological resistance. During singing experiences as an essential part of the intervention, the participants discussed the lyrics of songs in an attempt to find the meaning related to them. Also, they sang the songs with a live accompaniment in which their emotional states were reflected with changes in musical elements (e.g., tempo, dynamics, rhythm, or chords). In the final stage, they identified personal application to their everyday lives. To analyze the results, the State-Trait Anxiety Inventory (STAI) and a visual analog scale on emotional states were completed by participants before and after the session, and participants' verbal responses during the session were also recorded. According to the results, STAI anxiety scores significantly declined following the session. Also, they showed significant increases in positive emotions and significant decreases in negative emotions. This suggests that short-term music therapy can be an effective intervention for relieving the psychological distress of ICU caregivers.

Causes of False Negative Bedside Head Impulse Test (나안 두부충동검사에서 위음성의 원인분석)

  • Kim, Dae-Young;Choi, Yoon-Gi;Kyung, Tae-Suk;Hwang, Jun-Ha;Kim, Hyun Ji;Lee, Seung Chul;Kim, Kyu-Sung
    • Korean Journal of Otorhinolaryngology-Head and Neck Surgery
    • /
    • v.60 no.3
    • /
    • pp.107-111
    • /
    • 2017
  • Background and Objectives The bedside head impulse test (bHIT) in bare eyes often overlooks possible vestibular losses by missing the corrective saccade. This is why it is necessary to compare bHIT against video head impulse test (vHIT), which is more accurate in identifying vestibular losses than the bedside test. Subjects and Method A total of 51 vHIT positive ears underwent the study, and out of those, 47 were diagnosed with dizziness. bHIT and vHIT were performed for patients, and the occurrence rate of overt saccade (OS) was calculated. Results Among the 51 vHIT positive ears, 33 (64.7%) were bHIT positive ears and 18 ears (35.3%) were bHIT negative. Patterns of positive vHIT were classified as A: no corrective saccade, B: covert saccade (CS) only, C: OS only, and D: CS with OS (CS+OS), which were 45 out of 51 ears (88%). The occurrence rate of OS was higher in the bHIT positive group than in the bHIT negative group (p=0.05), and higher in the CS negative group (CS-) than in the CS positive group (CS+) (p<0.001). Conclusion Possible causes of false negative results of bHIT are seen as following: the absence of corrective (covert and overt) saccade, the occurrence of CS only, and missing the OS during the bHIT (probably due to low occurrence rate of OS). The occurrence of CS should be considered as an important factor in false negative bHIT when lowering the occurrence rate of OS.