• Title/Summary/Keyword: Auditory Analysis

Search Result 324, Processing Time 0.025 seconds

Separation of Single Channel Mixture Using Time-domain Basis Functions

  • 장길진;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.146-146
    • /
    • 2002
  • We present a new technique for achieving source separation when given only a single channel recording. The main idea is based on exploiting the inherent time structure of sound sources by learning a priori sets of time-domain basis functions that encode the sources in a statistically efficient manner. We derive a learning algorithm using a maximum likelihood approach given the observed single channel data and sets of basis functions. For each time point we infer the source parameters and their contribution factors. This inference is possible due to the prior knowledge of the basis functions and the associated coefficient densities. A flexible model for density estimation allows accurate modeling of the observation, and our experimental results exhibit a high level of separation performance for simulated mixtures as well as real environment recordings employing mixtures of two different sources. We show separation results of two music signals as well as the separation of two voice signals.

On the Signal Analysis of Two Waterfall Sounds in Australia's Broken Falls

  • Tian, Zhixing;Bae, MyungJin
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.287-293
    • /
    • 2020
  • More and more people are paying attention to the psychological pleasure and relaxation that sound hearing brings. In most cases, humans seem to have a special preference for natural sounds. Natural sounds are mainly white noise and pink noise such as wind, rain, waves, waterfall sounds, etc. All of these are often considered to be beneficial to human health, but in reality the same category of natural sounds is no different. It will be very different due to space, time and other factors. Each sound can be unique, so people's hearing experience is also different. This paper quantitatively analyzes the spectrum and brain waves to analyze the feeling of hearing the natural Broken Falls sound. In particular, we aim to objectively analyze the objective feeling of Broken Falls sound falling on the human auditory system through sound spectrum and brain waves.

A Study on Development of Disney Animation's Box-office Prediction AI Model Based on Brain Science (뇌과학 기반의 디즈니 애니메이션 흥행 예측 AI 모형 개발 연구)

  • Lee, Jong-Eun;Yang, Eun-Young
    • Journal of Digital Convergence
    • /
    • v.16 no.9
    • /
    • pp.405-412
    • /
    • 2018
  • When a film company decides whether to invest or not in a scenario is the appropriate time to predict box office success. In response to market demands, AI based scenario analysis service has been launched, yet the algorithm is by no means perfect. The purpose of this study is to present a prediction model of movie scenario's box office hit based on human brain processing mechanism. In order to derive patterns of visual, auditory, and cognitive stimuli on the time spectrum of box office animation hit, this study applied Weber's law and brain mechanism. The results are as follow. First, the frequency of brain stimulation in the biggest box office movies was 1.79 times greater than that in the failure movies. Second, in the box office success, the cognitive stimuli codes are spread evenly, whereas in the failure, concentrated among few intervals. Third, in the box office success movie, cognitive stimuli which have big cognition load appeared alone, whereas visual and auditory stimuli which have little cognitive load appeared simultaneously.

Attenuation of ROS Generation by KCNE1 Genes in Cisplatin-treated Auditory Cells

  • Kim, Eun Sook;Park, Sang-Ho;Park, Raekil
    • Korean Journal of Clinical Laboratory Science
    • /
    • v.45 no.3
    • /
    • pp.114-119
    • /
    • 2013
  • Potassium is essential for the proper functioning of the ears. The inner ear's endolymph differs from all other extracellular fluids (in its positive potential) and in the ionic compositions in the various parts of the endolymphatic space. Ion concentration of the endolymph is 150 mM of potassium, which is comparable to the concentrations in other organs. Cisplatin (cis-diamminedichloroplatinum II: CDDP) is one of the most effective anticancer drugs, widely used against various tumors. However, its clinical use is limited by the onset of severe side effects, including ototoxicity and nephrotoxicity. For ototoxicity, a number of evidences in cytotoxic mechanism of cisplatin, including perturbation of redox status, increase in lipid peroxydation, and formation of DNA adduct, have been suggested. Therefore, in this study, the author investigated the relationship between the potassium ions on cisplatin-induced cytotoxicity in HEI-OC1 cells associated with reactive oxygen species (ROS). KCNE1 gene expression by the concentration of intracellular potassium appeared in the plasma membrane and increased the concentration of intracellular potassium. Cisplatin decreased the viability of HEI-OC1 cells, but the KCNE1 gene increased. Also, the KCNE1 gene significantly suppressed generation of intracellular ROS by cisplatin. Western blot analysis showed that the KCNE1 gene increased phase II detoxification enzymes markers such as superoxide dismutase 1 (SOD1), superoxide dismutase (SOD2), NAD(P)H:quinine oxidoreductases (NQO1), which were associated with the scavenger of ROS. These results suggest that the KCNE1 gene for intracellular potassium concentration ultimately prevents ROS generation from cisplatin and further contributes to protect auditory sensory hair cells from ROS produced by cisplatin.

  • PDF

The Impact of Cognitive Workload on Driving Performance and Visual Attention in Younger and Older Drivers (인지부하가 시각주의와 운전수행도에 미치는 영향에 관한 연령대별 분석)

  • Son, Joonwoo;Park, Myoungouk
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.21 no.4
    • /
    • pp.62-69
    • /
    • 2013
  • Visual demands associated with in-vehicle display usage and text messaging distract a driver's visual attention from the roadway. To minimize eyes-off-the-road demands, voice interaction systems are widely introduced. Under cognitively distracted condition, however, awareness of the operating environment will be degraded although the driver remains oriented to the roadway. It is also know that the risk of inattentive driving varies with age, thus systematic analysis of driving risks is required for the older drivers. This paper aims to understand the age-related driving performance degradation and visual attention changes under auditory cognitive demand which consists of three graded levels of cognitive complexity. In this study, two groups, aged 25-35 and 60-69, engaged in a delayed auditory recall task, so called N-back task, while driving a simulated highway. Comparisons of younger and older drivers' driving performance including mean speed, speed variability and standard deviation of lane position, and gaze dispersion changes, which consist of x-axis and y-axis of visual attention, were conducted. As a result, it was observed that gaze dispersion decreased with each level of demand, demonstrating that these indices can correctly rank order cognitive workload. Moreover, gaze dispersion change patterns were quite consistent in younger and older age groups. Effects were also observed on driving performance measures, but they were subtle, nonlinear, and did not effectively differentiate the levels of cognitive workload.

Speech Segmentation using Weighted Cross-correlation in CASA System (계산적 청각 장면 분석 시스템에서 가중치 상호상관계수를 이용한 음성 분리)

  • Kim, JungHo;Kang, ChulHo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.5
    • /
    • pp.188-194
    • /
    • 2014
  • The feature extraction mechanism of the CASA(Computational Auditory Scene Analysis) system uses time continuity and frequency channel similarity to compose a correlogram of auditory elements. In segmentation, we compose a binary mask by using cross-correlation function, mask 1(speech) has the same periodicity and synchronization. However, when there is delay between autocorrelation signals with the same periodicity, it is determined as a speech, which is considered to be a drawback. In this paper, we proposed an algorithm to improve discrimination of channel similarity using Weighted Cross-correlation in segmentation. We conducted experiments to evaluate the speech segregation performance of the CASA system in background noise(siren, machine, white, car, crowd) environments by changing SNR 5dB and 0dB. In this paper, we compared the proposed algorithm to the conventional algorithm. The performance of the proposed algorithm has been improved as following: improvement of 2.75dB at SNR 5dB and 4.84dB at SNR 0dB for background noise environment.

A Study of Visual Event-Related Potential P300 in Schizophrenia (정신분열병의 시각자극 사건유발전위 P300에 대한 연구)

  • Oh, Dong-Hoon;Nam, Jung-Hyun;Ahn, Dong-Hyun;Kim, Seok-Hyun;Choi, Joon-Ho
    • Korean Journal of Biological Psychiatry
    • /
    • v.11 no.1
    • /
    • pp.40-48
    • /
    • 2004
  • Objective:Event-related potentials(ERPs) are electrical changes recorded at the surface of the scalp in response to stimulus presentation, and their latency and amplitude change according to cognitive processes. Through past studies of the auditory ERP in schizophrenia, the P300 has been reported to be statistically smaller and delayed in schizophrenia than comparison groups. However, studies of the visual ERP have not been systematically examined. The present study was designed to investigate the visual P300 in patients with schizophrenia and normal controls and to compare the pattern of P300 between them. Methods:The subjects were composed of patients(N=22) with schizophrenia by DSM-IV and normal controls(N=22). The visual ERPs were measured by the visual continuous performance test. P300 amplitude and latency measured on 5 scalp electrodes(Fz, Cz, Pz, $T_7$, $T_8$) were compared between patients and controls. Results:The P300 latencies measured on Fz, Cz, Pz, and $T_7$ electrodes were significantly longer in patients than controls(p<0.05). The P300 amplitudes in patients were smaller than controls. However, the difference between them was not statistically significant. Conclusion:Analysis of the visual ERPs showed that the P300 latency is significantly delayed and the P300 amplitude is slightly smaller in patients than controls. These results are similar to established studies of the auditory P300 in schizophrenia.

  • PDF

Relationships between the sensory, cognitive and physical functions of young-old and old-old individuals (전·후기 노인들의 감각기능, 인지기능과 신체기능 간의 관련성)

  • Jeon, So-Youn;Lee, Sok-Goo
    • Korean Journal of Health Education and Promotion
    • /
    • v.33 no.5
    • /
    • pp.23-36
    • /
    • 2016
  • Objectives: This study aims to define the relationships between the sensory, cognitive and physical functions of young-old and old-old individuals. Methods: Participants were 10,451 elderly individuals aged 65 and above, raw data of a 2014 National Survey on Korean Older Persons was used. To investigate the relationships among the sensory, cognitive, and physical functions, a structural equation model was used. Results: The key analysis results are summarized as follows; 5% had poor vision function(young-old 3.5%, old-old 7.1%), 3.8% had poor auditory function(young-old 1.7%, old-old 6.7%), 33.0% had decline in cognitive function(young-old 30.9%, old-old 35.7%), 3.6% were disabled(young-old 1.6%, old-old 6.3%) and cognitive function influences physical function more greatly than does sensory function. Additionally, in the young-old groups, vision among sensory functions, attention among cognitive functions, and IADL among physical functions, turned out to be the most influential. However, in the old-old groups, auditory function among sensory functions, orientation among cognitive functions, and IADL among physical functions, turned out to be the most influential. Conclusions: This study implies that functions in the young-old and old-old individuals must be considered with all three functions-sensory, cognitive, and physical-together at the same time and that this comprehensive approach is necessary in national policy making.

A Study or the Effect of Electrical Stimulation on Tinnitus Treatment based on the Correlation Analysis of ABR and ECochG (ABR과 ECochG의 상관분석을 통한 전기자극이 이명치료에 미치는 영향에 관한 연구)

  • Kim, K.S.;Park, J.W.;Nam, S.H.;Im, J.J.;Choi, E.S.;Jeon, B.H.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.11
    • /
    • pp.87-90
    • /
    • 1997
  • Electrical stimulation has been used or diagnosis and treatment of impairment on the auditory system. Unfortunately, there were no standard methods or theoretical background or choosing stimulus conditions because of the lack of understanding on the current propagation through the auditory pathways. Nine guniea pigs, experimental group(A) and control group(B), were used for the experiment. ABR and ECochG were obtained under our experimental conditions, before tinnitus and 1, 6, 12 hours after tinnitus induction using salicylate. Electrical stimulations were applied to the group A, and the changes on ABR/ECochG's correlation coefficients were observed. Results showed that an electrical stimulation brings ABR waveform back to the normal states well in the group A compare to the group B, which proved the effectiveness of the stimulation. Based on the results of this experiment, establishment of an electrical model which provide the quantitative information regarding diagnosis and treatment of tinnitus could be achievied.

  • PDF

Speech Recognition Performance Improvement using Gamma-tone Feature Extraction Acoustic Model (감마톤 특징 추출 음향 모델을 이용한 음성 인식 성능 향상)

  • Ahn, Chan-Shik;Choi, Ki-Ho
    • Journal of Digital Convergence
    • /
    • v.11 no.7
    • /
    • pp.209-214
    • /
    • 2013
  • Improve the recognition performance of speech recognition systems as a method for recognizing human listening skills were incorporated into the system. In noisy environments by separating the speech signal and noise, select the desired speech signal. but In terms of practical performance of speech recognition systems are factors. According to recognized environmental changes due to noise speech detection is not accurate and learning model does not match. In this paper, to improve the speech recognition feature extraction using gamma tone and learning model using acoustic model was proposed. The proposed method the feature extraction using auditory scene analysis for human auditory perception was reflected In the process of learning models for recognition. For performance evaluation in noisy environments, -10dB, -5dB noise in the signal was performed to remove 3.12dB, 2.04dB SNR improvement in performance was confirmed.