Search | Korea Science

Extraction of MFCC feature parameters based on the PCA-optimized filter bank and Korean connected 4-digit telephone speech recognition (PCA-optimized 필터뱅크 기반의 MFCC 특징파라미터 추출 및 한국어 4연숫자 전화음성에 대한 인식실험)

정성윤;김민성;손종목;배건성
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.6
- /
- pp.279-283
- /
- 2004
In general, triangular shape filters are used in the filter bank when we extract MFCC feature parameters from the spectrum of the speech signal. A different approach, which uses specific filter shapes in the filter bank that are optimized to the spectrum of training speech data, is proposed by Lee et al. to improve the recognition rate. A principal component analysis method is used to get the optimized filter coefficients. Using a large amount of 4-digit telephone speech database, in this paper, we get the MFCCs based on the PCA-optimized filter bank and compare the recognition performance with conventional MFCCs and direct weighted filter bank based MFCCs. Experimental results have shown that the MFCC based on the PCA-optimized filter bank give slight improvement in recognition rate compared to the conventional MFCCs but fail to achieve better performance than the MFCCs based on the direct weighted filter bank analysis. Experimental results are discussed with our findings.
PDF KSCI

Nonlinear Characteristics Evaluation of the FBMC and UFMC System for the 5G Mobile Communication (5세대 이동통신을 위한 FBMC와 UFMC 시스템의 비선형 특성 평가)

An, Changyoung;Ryu, Heung-Gyoon
- The Journal of Korean Institute of Electromagnetic Engineering and Science
- /
- v.27 no.8
- /
- pp.725-734
- /
- 2016
Recently, novel candidate waveform techniques for spectral efficiency improvement was proposed in order to satisfy key performance indicators(KPIs) of 5th generation(5G) mobile communication. Multi-carrier based universal filtered multi-carrier(UFMC) and filter bank multi-carrier(FBMC) are very famous as 5G candidate waveform techniques. Also, weighted orthogonal frequency division multiplexing (W-OFDM) that has low-complexity is receiving the spotlight slowly. In this paper, firstly, we describe a basic OFDM system. And then, we also describe UFMC, FBMC, and W-OFDM system. Next, we evaluate and analyze spectrum and BER performance of these systems under the nonlinear high power amplifier(HPA) environment. As simulation results, spectrum characteristic and BER performance of UFMC, FBMC, and W-OFDM are similar to each other. Therefore, under the nonlinear HPA environment, W-OFDM system is more advantageous because W-OFDM system uses a simple time-domain windowing technique and has similar characteristics to the others.
https://doi.org/10.5515/KJKIEES.2016.27.8.725 인용 PDF KSCI

Application of the modified fast fourier transformation weighted with refractive index dispersion far an accurate determination of film thickness (굴절률 분산을 반영한 고속 푸리에 변환 및 막두께 정밀결정)

김상준;김상열
- Korean Journal of Optics and Photonics
- /
- v.14 no.3
- /
- pp.266-271
- /
- 2003
The reflectance spectrum of optical films thicker than a few microns shows an intensity oscillation due to interference. Since the spectral period of the oscillation is inversely related to film thickness, the thickness of an optical film can be determined from the spectral frequency of the oscillation. For rapid data processing, the spectral frequency is obtained by use of a Fast Fourier Transformation technique. The conventional method of applying a Fast Fourier Transformation to the reflectance spectrum versus photon energy is modified so as to clear the ambiguity in choosing the proper effective refractive index value and to prevent the broadening of the Fourier transformed peak due to the refractive index dispersion. This technique of modified Fast Fourier Transformation is suggested by the authors for the first time to their knowledge. From the analysis of the calculated reflectance spectrum of a 30-${\mu}{\textrm}{m}$-thick dielectric film. it is shown to improve the accuracy in determining film thickness by a great amount. The improved accuracy of the modified Fast Fourier Transformation is also confirmed from the analysis of the reflectance spectra of a sample with 80-${\mu}{\textrm}{m}$-thick cover layer and 13-${\mu}{\textrm}{m}$-thick spacer layer on a PC substrate.
https://doi.org/10.3807/KJOP.2003.14.3.266 인용 PDF KSCI

A study on skip-connection with time-frequency self-attention for improving speech enhancement based on complex-valued spectrum (복소 스펙트럼 기반 음성 향상의 성능 향상을 위한 time-frequency self-attention 기반 skip-connection 기법 연구)

Jaehee Jung;Wooil Kim
- The Journal of the Acoustical Society of Korea
- /
- v.42 no.2
- /
- pp.94-101
- /
- 2023
A deep neural network composed of encoders and decoders, such as U-Net, used for speech enhancement, concatenates the encoder to the decoder through skip-connection. Skip-connection helps reconstruct the enhanced spectrum and complement the lost information. The features of the encoder and the decoder connected by the skip-connection are incompatible with each other. In this paper, for complex-valued spectrum based speech enhancement, Self-Attention (SA) method is applied to skip-connection to transform the feature of encoder to be compatible with the features of decoder. SA is a technique in which when generating an output sequence in a sequence-to-sequence tasks the weighted average of input is used to put attention on subsets of input, showing that noise can be effectively eliminated by being applied in speech enhancement. The three models using encoder and decoder features to apply SA to skip-connection are studied. As experimental results using TIMIT database, the proposed methods show improvements in all evaluation metrics compared to the Deep Complex U-Net (DCUNET) with skip-connection only.
https://doi.org/10.7776/ASK.2023.42.2.094 인용 PDF

Radiologic-Pathologic Correlation of Unusual Lingual Masses: Part II: Benign and Malignant Tumors

Se Hyung Kim;Moon Hee Han;Sun Won Park;Kee-Hyun Chang
- Korean Journal of Radiology
- /
- v.2 no.1
- /
- pp.42-51
- /
- 2001
Because the tongue is superficially located and the initial manifestation of most diseases occurring there is mucosal change, lingual lesionscan be easily accessed and diagnosed without imaging analysis. Some lingual neoplasms, however, may manifest as a submucosal bulge and be located in a deep portion of the tongue, such as its base; their true characteristics and extent may be recognized only on cross-sectional images such as those obtained by CT or MRI. Some uncommon tongue neoplasms may have characteristic radiologic features, thus permitting quite specific radiologic diagnosis. Lipomas typically manifest at both CT and MR imaging as homogeneous nonenhancing lesions. Relative to subcutaneous fat they are isoattenuating on CT images, and all MR sequences show them as isointense. Due to the paramagnetic properties of melanin, metastases from melanotic melanoma usually demonstrate high signal intensity on T1-weighted MR images and low signal intensity on T2-weighted images. Although the radiologic findings for other submucosal neoplasms are nonspecific, CT and MR imaging can play an important role in the diagnostic work-up of these unusual tumors. Delineation of the extent of the tumor, and recognition and understanding of the spectrum of imaging and the pathologic features of these lesions, often help narrow the differential diagnosis.
PDF

Radionuclide identification based on energy-weighted algorithm and machine learning applied to a multi-array plastic scintillator

Hyun Cheol Lee ;Bon Tack Koo ;Ju Young Jeon ;Bo-Wi Cheon ;Do Hyeon Yoo ;Heejun Chung;Chul Hee Min
- Nuclear Engineering and Technology
- /
- v.55 no.10
- /
- pp.3907-3912
- /
- 2023
Radiation portal monitors (RPMs) installed at airports and harbors to prevent illicit trafficking of radioactive materials generally use large plastic scintillators. However, their energy resolution is poor and radionuclide identification is nearly unfeasible. In this study, to improve isotope identification, a RPM system based on a multi-array plastic scintillator and convolutional neural network (CNN) was evaluated by measuring the spectra of radioactive sources. A multi-array plastic scintillator comprising an assembly of 14 hexagonal scintillators was fabricated within an area of 50 × 100 cm². The energy spectra of ¹³⁷Cs, ⁶⁰Co, ²²⁶Ra, and ⁴K (KCl) were measured at speeds of 10-30 km/h, respectively, and an energy-weighted algorithm was applied. For the CNN, 700 and 300 spectral images were used as training and testing images, respectively. Compared to the conventional plastic scintillator, the multi-arrayed detector showed a high collection probability of the optical photons generated inside. A Compton maximum peak was observed for four moving radiation sources, and the CNN-based classification results showed that at least 70% was discriminated. Under the speed condition, the spectral fluctuations were higher than those under dwelling condition. However, the machine learning results demonstrated that a considerably high level of nuclide discrimination was possible under source movement conditions.
https://doi.org/10.1016/j.net.2023.07.005 인용 PDF

Magnetic Resonance Findings in Two Episodes of Repeated Cerebral Fat Embolisms in a Patient with Autologous Fat Injection into the Face

Lee, Kyung-Mi;Kim, Eui-Jong;Jahng, Geon-Ho;Chang, Dae-Il
- Journal of Korean Neurosurgical Society
- /
- v.51 no.5
- /
- pp.312-315
- /
- 2012
We report magnetic resonance image (MRI) and magnetic resonance spectroscopy (MRS) findings in a patient of cerebral fat embolism (CFE) occurred in a 26-year-old woman after an autologous fat injection into the face. After initial neurologic symptom onset, MRI and MRS data were obtained two times to investigate repeated CFE. We obtained the MRS data in the two different time intervals and two different echo times to compare the lesions with normal brain parenchyma. The results of MRS data showed that a decrease in N-acetyl-aspartate, an increase in lactate and a very high early peak of free lipids between 0.9 and 1.4 ppm were obtained at the acute infarcted lesion as compared with normal brain parenchyma. In addition, these findings were more clearly detected on short echo time spectrum rather than long spectrum. A close relationship between the clinical manifestations and MRI and MRS findings of the brain can helpful to distinguish CFE with other conditions and to evaluate the cause materials of infarctions rather than conventional MRI or diffusion-weighted imaging.
https://doi.org/10.3340/jkns.2012.51.5.312 인용 PDF KSCI

Resource Allocation and EE-SE Tradeoff for H-CRAN with NOMA-Based D2D Communications

Wang, Jingpu;Song, Xin;Dong, Li
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.4
- /
- pp.1837-1860
- /
- 2020
We propose a general framework for studying resource allocation problem and the tradeoff between spectral efficiency (SE) and energy efficiency (EE) for downlink traffic in power domain-non-orthogonal multiple access (PD-NOMA) and device to device (D2D) based heterogeneous cloud radio access networks (H-CRANs) under imperfect channel state information (CSI). The aim is jointly optimize radio remote head (RRH) selection, spectrum allocation and power control, which is formulated as a multi-objective optimization (MOO) problem that can be solved with weighted Tchebycheff method. We propose a low-complexity algorithm to solve user association, spectrum allocation and power coordination separately. We first compute the CSI for RRHs. Then we study allocating the cell users (CUs) and D2D groups to different subchannels by constructing a bipartite graph and Hungrarian algorithm. To solve the power control and EE-SE tradeoff problems, we decompose the target function into two subproblems. Then, we utilize successive convex program approach to lower the computational complexity. Moreover, we use Lagrangian method and KKT conditions to find the global optimum with low complexity, and get a fast convergence by subgradient method. Numerical simulation results demonstrate that by using PD-NOMA technique and H-CRAN with D2D communications, the system gets good EE-SE tradeoff performance.
https://doi.org/10.3837/tiis.2020.04.023 인용 PDF KSCI HTML

Digital Audio Watermarking in The Cepstrum Domain (켑스트럼 영역에서의 오디오 워터마킹 방법)

이상광;호요성
- Journal of Broadcast Engineering
- /
- v.6 no.1
- /
- pp.13-20
- /
- 2001
In this paper, we propose a new digital audio watermarking scheme In the cepstrum domain. We insert a digital watermark signal Into the cepstral components of the audio signal using a technique analogous to spread spectrum Communications, hiding a narrow band signal in a wade band channel. In our proposed method, we use pseudo-random sequences to watermark the audio signal. The watermark Is then weighted in the cepstrum domain according to the distribution of cepstral coefficients and the frequency masking characteristics of the human auditory system. The proposed watermark embedding scheme minimizes audibility of the watermark signal. and the embedded watermark is robust to mu1tip1e watermarks, MPEG audio ceding and additive noose.
PDF

Multiple-Phase Energy Detection and Effective Capacity Based Resource Allocation Against Primary User Emulation Attacks in Cognitive Radio Networks

Liu, Zongyi;Zhang, Guomei;Meng, Wei;Ma, Xiaohui;Li, Guobing
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.3
- /
- pp.1313-1336
- /
- 2020
Cognitive radio (CR) is regarded as an effective approach to avoid the inefficient use of spectrum. However, CRNs have more special security problems compared with the traditional wireless communication systems due to its open and dynamic characteristics. Primary user emulation attack (PUEA) is a common method which can hinder secondary users (SUs) from accessing the spectrum by transmitting signals who has the similar characteristics of the primary users' (PUs) signals, and then the SUs' quality of service (QoS) cannot be guaranteed. To handle this issue, we first design a multiple-phase energy detection scheme based on the cooperation of multiple SUs to detect the PUEA more precisely. Second, a joint SUs scheduling and power allocation scheme is proposed to maximize the weighted effective capacity of multiple SUs with a constraint of the average interference to the PU. The simulation results show that the proposed method can effectively improve the effective capacity of the secondary users compared with the traditional overlay scheme which cannot be aware of the existence of PUEA. Also the good delay QoS guarantee for the secondary users is provided.
https://doi.org/10.3837/tiis.2020.03.022 인용 PDF KSCI HTML

Search Result 93, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)