• Title/Summary/Keyword: 음향 분석

Search Result 2,351, Processing Time 0.031 seconds

Real data-based active sonar signal synthesis method (실데이터 기반 능동 소나 신호 합성 방법론)

  • Yunsu Kim;Juho Kim;Jongwon Seok;Jungpyo Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.1
    • /
    • pp.9-18
    • /
    • 2024
  • The importance of active sonar systems is emerging due to the quietness of underwater targets and the increase in ambient noise due to the increase in maritime traffic. However, the low signal-to-noise ratio of the echo signal due to multipath propagation of the signal, various clutter, ambient noise and reverberation makes it difficult to identify underwater targets using active sonar. Attempts have been made to apply data-based methods such as machine learning or deep learning to improve the performance of underwater target recognition systems, but it is difficult to collect enough data for training due to the nature of sonar datasets. Methods based on mathematical modeling have been mainly used to compensate for insufficient active sonar data. However, methodologies based on mathematical modeling have limitations in accurately simulating complex underwater phenomena. Therefore, in this paper, we propose a sonar signal synthesis method based on a deep neural network. In order to apply the neural network model to the field of sonar signal synthesis, the proposed method appropriately corrects the attention-based encoder and decoder to the sonar signal, which is the main module of the Tacotron model mainly used in the field of speech synthesis. It is possible to synthesize a signal more similar to the actual signal by training the proposed model using the dataset collected by arranging a simulated target in an actual marine environment. In order to verify the performance of the proposed method, Perceptual evaluation of audio quality test was conducted and within score difference -2.3 was shown compared to actual signal in a total of four different environments. These results prove that the active sonar signal generated by the proposed method approximates the actual signal.

Accuracy Analysis of Velocity and Water Depth Measurement in the Straight Channel using ADCP (ADCP를 이용한 직선 하천의 유속 및 수심 측정 정확도 분석)

  • Kim, Jongmin;Kim, Dongsu;Son, Geunsoo;Kim, Seojun
    • Journal of Korea Water Resources Association
    • /
    • v.48 no.5
    • /
    • pp.367-377
    • /
    • 2015
  • ADCPs have been highlighted so far for measuring steramflow discharge in terms of their high-order of accuracy, relatively low cost and less field operators driven by their easy in-situ operation. While ADCPs become increasingly dominant in hydrometric area, their actual measurement accuracy for velocity and bathymetry measurement has not been sufficiently validated due to the lack of reliable bench-mark data, and subsequently there are still many uncertain aspects for using ADCPs in the field. This research aimed at analyzing inter-comparison results between ADCP measurements with respect to the detailed ADV measurement in a specified field environment. Overall, 184 ADV points were collected for densely designed grids for the given cross-section that has 6 m of width, 1 m of depth, and 0.7 m/s of averaged mean flow velocity. Concurrently, ADCP fixed-points measurements were conducted for each 0.2m and 0.02m of horizontal and vertical spacing respectively. The inter-comparison results indicated that ADCP matched ADV velocity very accurately for 0.4~0.8 of relative depth (y/h), but noticeable deviation occurred between them in near surface and bottom region. For evaluating the capacity of measuring bathymetry of ADCPs, bottom tracking bathymetry based on oblique beams showed better performance than vertical beam approach, and similar results were shown for fixed and moving-boat method as well. Error analysis for velocity and bathymetry measurements of ADCP can be potentially able to be utilized for the more detailed uncertainty analysis of the ADCP discharge measurement.

An analysis of emotional English utterances using the prosodic distance between emotional and neutral utterances (영어 감정발화와 중립발화 간의 운율거리를 이용한 감정발화 분석)

  • Yi, So-Pae
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.25-32
    • /
    • 2020
  • An analysis of emotional English utterances with 7 emotions (calm, happy, sad, angry, fearful, disgust, surprised) was conducted using the measurement of prosodic distance between 672 emotional and 48 neutral utterances. Applying the technique proposed in the automatic evaluation model of English pronunciation to the present study on emotional utterances, Euclidean distance measurement of 3 prosodic elements such as F0, intensity and duration extracted from emotional and neutral utterances was utilized. This paper, furthermore, extended the analytical methods to include Euclidean distance normalization, z-score and z-score normalization resulting in 4 groups of measurement schemes (sqrF0, sqrINT, sqrDUR; norsqrF0, norsqrINT, norsqrDUR; sqrzF0, sqrzINT, sqrzDUR; norsqrzF0, norsqrzINT, norsqrzDUR). All of the results from perceptual analysis and acoustical analysis of emotional utteances consistently indicated the greater effectiveness of norsqrF0, norsqrINT and norsqrDUR, among 4 groups of measurement schemes, which normalized the Euclidean measurement. The greatest acoustical change of prosodic information influenced by emotion was shown in the values of F0 followed by duration and intensity in descending order according to the effect size based on the estimation of distance between emotional utterances and neutral counterparts. Tukey Post Hoc test revealed 4 homogeneous subsets (calm

Improvement and Promotion Plan for the Screen Baseball Utilization (스크린야구 이용의 개선 및 증진방안)

  • Koo, Soo-Yong;Jeon, Yong-Bae;Choi, Eui-Yul
    • 한국체육학회지인문사회과학편
    • /
    • v.54 no.4
    • /
    • pp.363-372
    • /
    • 2015
  • The purpose of this study was to suggest marketing plan to improve and promote screen baseball utilization, applying Importance-Performance Analysis (IPA) incorporating marketing mix 4Cs; Customer value, Convenience, Cost, and Communication. A convenience sample was made up of 267 users in screen baseball clubs located in metropolitan area. Evidence on validity and reliability of the data was obtained through exploratory factor analysis and internal consistency analysis. Frequency analysis and paired sample t-test for the difference verification of IPA were conducted also in SPSS version 21.0 with .05 of significance level. The main results of the study were as follows. First, there were partially significant differences between importance and satisfaction regarding the sub-categories of Customer value and Convenience. Second, there were all significant differences between importance and satisfaction regarding the sub-categories of Cost and Communication. Third, as the results regarding IPA, quadrant I indicating 'Keep up Good Work' included healthy use of leisure time, improvement of self-achievement, accessibility, communication between service user and provider, etc. Fourth, quadrant II indicating 'Concentrate Here' included diversification of screen baseball program, cost regarding facility use, etc. Fifth, quadrant III indicating 'Low Priority' included interpersonal relationship, subsidiary facilities, cost of food and beverage, etc. Lastly, quadrant IV indicating 'Possible Overkill' included improvement of physical health and life satisfaction and rules and procedures of screen baseball.

Spectral moment analysis of distortion errors in alveolar fricatives in Korean children (치조 마찰음 왜곡 오류 유무에 따른 아동 발화 적률분석 비교)

  • Yunju Han;Do Hyung Kim;Ja Eun Hwang;Dae-Hyun Jang;Jae Won Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.33-40
    • /
    • 2024
  • This study investigated acoustic features in spectral moment analysis, comparing accurate articulations with distortions of alveolar fricatives such as dentalization, palatalization, and lateralization. A retrospective analysis was conducted on speech samples from 61 children (mean age: 5.6±1.5 years, 19 females, 42 males) using the Assessment of Phonology & Articulation for Children (APAC) and Urimal-test of Articulation and Phonology I (U-TAP I). Spectral moment analysis was applied to 169 speech samples. The results revealed that the center of gravity of accurate articulations was higher than that of palatalization, while palatalization was lower than dentalization. The variance of dentalization was higher than that of both accurate articulations and palatalization. The skewness of dentalization was higher than that of accurate articulations, and the skewness of palatalization was higher than that of accurate articulations. The kurtosis of palatalization was higher than that of both accurate articulations and dentalization. No significant differences were observed for the position of fricatives (initial, medial) and tense type (plain, tense) across all variables of spectral moment analysis for each distortion type. This study confirmed distinct patterns in center of gravity, variance, skewness, and kurtosis depending on the type of alveolar fricative distortion. The objective values provided in this study will serve as foundational data for diagnosing alveolar fricative distortions in children with speech sound disorders.

Analysis of Semantic Attributes of Korean Words for Sound Quality Evaluation in Music Listening (음악감상에서의 음질 평가를 위한 한국어 어휘의 의미론적 속성 분석)

  • Lee, Eun Young;Yoo, Ga Eul;Lee, Youngmee
    • Journal of Music and Human Behavior
    • /
    • v.21 no.2
    • /
    • pp.107-134
    • /
    • 2024
  • This study aims to classify the semantic words commonly used to evaluate sound quality and to analyze their differences in reflecting the level of musical stimuli. Participants were thirty-one music majors in their 20s and 30s, with an average of 9.4 years of professional training. Each participant listened to nine pieces of music with variations in texture and instrument type and evaluated them using 18 pairs of semantic words describing sound quality. A factor analysis was conducted to group words influenced by the same latent factor, and a multivariate ANOVA determined the differences in ratings based on texture and instrument type. Radar charts were also drawn based on the identified sets of semantic words. The results showed that four factors were identified, and the word pairs 'soft-hard,' 'dull-sharp,' 'muddy-clean' and 'low-high' showed significant differences based on the level of musical stimuli. The radar charts effectively distinguished the sound quality evaluations for each music. These results indicate that developing Korean semantic words for sound quality evaluation requires a structure different from the previous categories used in Western countries and that linguistic and cultural factors are crucial. This study will provide foundational data for developing a verbal sound quality evaluation framework suited to the Korean context, while reflecting acoustic attributes in music listening.

Characteristics of scenario text reading fluency in middle school students with poor reading skills (중학교 읽기부진 학생의 시나리오 글 읽기 유창성 특성)

  • Jihye Park;Cheoljae Seong
    • Phonetics and Speech Sciences
    • /
    • v.16 no.3
    • /
    • pp.39-48
    • /
    • 2024
  • Reading fluency refers to the ability to read sentences or paragraphs accurately, quickly, and with appropriate prosodic expression. Most reading fluency assessments exclude expressive ability because it is difficult to objectively measure. Therefore, in this study, we examined all elements of reading fluency by analyzing prosodic characteristics of reading scenario texts to maximize expressive reading. The subjects were 30 male students in the first and second grades of middle school (15 normal and 15 poor readers). To analyze the accuracy aspect, error types at the syllable level were analyzed for each group, and related acoustic variables were measured and examined in terms of prosodic aspects. The reading accuracy analysis showed that the poor reading group had a higher error rate than the normal. In terms of error types, the normal group showed the order of 'substitution>omission>correction>insertion>repetition', whereas the poor reading group was in the order of 'correction>substitution>repetition/insertion>omission'. For the speech tempo, the dyslexic students were slower than the typical students for all sentence types. The prosodic variables also showed a high frequency of accentual phrases (AP) and intonation phrases (IP) in sentences along with a wide intensity range.

Speech Recognition Using Linear Discriminant Analysis and Common Vector Extraction (선형 판별분석과 공통벡터 추출방법을 이용한 음성인식)

  • 남명우;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.35-41
    • /
    • 2001
  • This paper describes Linear Discriminant Analysis and common vector extraction for speech recognition. Voice signal contains psychological and physiological properties of the speaker as well as dialect differences, acoustical environment effects, and phase differences. For these reasons, the same word spelled out by different speakers can be very different heard. This property of speech signal make it very difficult to extract common properties in the same speech class (word or phoneme). Linear algebra method like BT (Karhunen-Loeve Transformation) is generally used for common properties extraction In the speech signals, but common vector extraction which is suggested by M. Bilginer et at. is used in this paper. The method of M. Bilginer et al. extracts the optimized common vector from the speech signals used for training. And it has 100% recognition accuracy in the trained data which is used for common vector extraction. In spite of these characteristics, the method has some drawback-we cannot use numbers of speech signal for training and the discriminant information among common vectors is not defined. This paper suggests advanced method which can reduce error rate by maximizing the discriminant information among common vectors. And novel method to normalize the size of common vector also added. The result shows improved performance of algorithm and better recognition accuracy of 2% than conventional method.

  • PDF

Investigation into influence of sound absorption block on interior noise of high speed train in tunnel (터널 내부 도상 블록형 흡음재의 고속철도차량 내부 소음에 미치는 영향에 대한 고찰)

  • Lee, Sang-heon;Cheong, Cheolung;Lee, Song-June;Kim, Jae-Hwan;Son, Dong-Gi;Sim, Gyu-Cheol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.37 no.4
    • /
    • pp.223-231
    • /
    • 2018
  • Recently, due to various environmental problems, blast tracks in tunnel are replaced with concrete tracks, but they have more adverse effects on noise than blast tracks so that additional noise measures are needed. Among these measures, sound-absorbing blocks start to be used due to its easy and quick installation. However, the performance of sound absorption blocks need to be verified under real environmental and operational conditions. In this paper, interior noise levels in KTX train cruising in Dalseong tunnel are measured before and after the installation of sound-absorbing blocks and the measured data are analyzed and compared. Additionally, noise reduction are estimated by modeling the high speed train, the tunnel and absorption blocks. Measurement devices and methods are used according to ISO 3381 and the equivalent sound pressure levels during the cruising time inside the tunnel are computed. In addition to overall SPLs(Sound Pressure Levels), 1/3-octave-band levels are also analyzed to account for the frequency characteristics of sound absorption and equipment noise in a cabin. In addition, to consider the effects of train cruising speeds and environmental conditions on the measurements, the measured data are corrected by using those measured during the train-passing through the tunnels located before and behind the Dalseong tunnel. Analysis of measured results showed that the maximum noise reduction of 6.8 dB (A) can be achieved for the local region where the sound-absorbing blocks are installed. Finally, through the comparison of predicted 1/3-octave band SPLs for the KTX interior noise with the measurements, the understanding of noise reduction mechanism due to sound-absorbing blocks is enhanced.

The characteristics of sentence reading intonations in North Korean defectors based on pitch range and an auditory-perceptual rating scale (북한이탈주민의 문장 읽기 억양 특성-음도범위와 청지각적 평가를 중심으로)

  • Kim, Damee;Kim, Shinhee;Kim, Jiseong;An, Eunsol;Cho, Yongyun;Yang, Yoonhee;Yim, Dongsun
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.9-21
    • /
    • 2019
  • This study aimed to compare the prosodic characteristics of North Korean defectors and South Koreans in three types of sentences (declarative, interrogative, and negative) in two reading tasks (short and dialogue) through acoustic analysis and auditory-perceptual evaluation. In addition, this study examined the relationship between the auditory-perceptual evaluation scores and self-assessment questionnaires on intonation for North Korean defectors. The participants were 15 North Korean defectors and 15 Korean speakers with standard Seoul accents. For statistical analysis, three-way mixed ANOVA and multivariate analysis were performed within the three types of sentences in the reading tasks through acoustic analysis and the Mann-Whitney U Test for auditory-perceptual evaluation. Pearson's product-moment correlation coefficients were also used to identify the correlations between the results of the self-assessment questionnaire on intonation and the auditory-perceptual evaluation. The North Korean defectors were found to have a significantly lower pitch range and auditory-perceptual evaluation score than South Koreans in reading tasks. Moreover, there was a significant correlation between their auditory-perceptual evaluations and self-assessment questionnaires on intonation. The study findings suggest that North Korean defectors, who face many challenges with intonation, showed a tendency to think that their intonation differed from the standard Korean intonation and showed better auditory evaluation results for interrogative sentences.