Search | Korea Science

Method for Spectral Enhancement by Binary Mask for Speech Recognition Enhancement Under Noise Environment (잡음환경에서 음성인식 성능향상을 위한 바이너리 마스크를 이용한 스펙트럼 향상 방법)

Choi, Gab-Keun;Kim, Soon-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.7
- /
- pp.468-474
- /
- 2010
The major factor that disturbs practical use of speech recognition is distortion by the ambient and channel noises. Generally, the ambient noise drops the performance and restricts places to use. DSR (Distributed Speech Recognition) based speech recognition also has this problem. Various noise cancelling algorithms are applied to solve this problem, but loss of spectrum and remaining noise by incorrect noise estimation at low SNR environments cause drop of recognition rate. This paper proposes methods for speech enhancement. This method uses MMSE-STSA for noise cancelling and ideal binary mask to compensate damaged spectrum. According to experiments at noisy environment (SNR 15 dB ~ 0 dB), the proposed methods showed better spectral results and recognition performance.
https://doi.org/10.7776/ASK.2010.29.7.468 인용 PDF KSCI

Development and Application of Water Balance Network Model in Agricultural Watershed (농업용수 유역 물수지 분석 모델 개발 및 적용)

Yoon, Dong-Hyun;Nam, Won-Ho;Koh, Bo-Sung;Kim, Kyung-Mo;Jo, Young-Jun;Park, Jin-Hyeon
- Journal of The Korean Society of Agricultural Engineers
- /
- v.66 no.3
- /
- pp.39-51
- /
- 2024
To effectively implement the integrated water management policy outlined in the National Water Management Act, it is essential to analyze agricultural water supply and demand at both basin and water district levels. Currently, agricultural water is primarily distributed through open canal systems and controlled by floodgates, yet the utilization-to-supply ratio remains at a mere 48%. In the case of agricultural water, when analyzing water balance through existing national basin water resource models (K-WEAP, K-MODISM), distortion of supply and regression occurs due to calculation of regression rate based on the concept of net water consumption. In addition, by simplifying the complex and diverse agricultural water supply system within the basin into a single virtual reservoir, it is difficult to analyze the surplus or shortage of agricultural water for each field within the basin. There are limitations in reflecting the characteristics and actual sites of rural water areas, such as inconsistencies with river and reservoir supply priority sites. This study focuses on the development of a model aimed at improving the deficiencies of current water balance analysis methods. The developed model aims to provide standardized water balance analysis nationwide, with initial application to the Anseo standard watershed. Utilizing data from 32 facilities within the standard watershed, the study conducted water balance analysis through watershed linkage, highlighting differences and improvements compared to existing methods.
https://doi.org/10.5389/KSAE.2024.66.3.039 인용 PDF HTML

Compensation Characteristics Depending on Extinction Ratio of RZ Pulse in Dispersion-managed Link Combined with MSSI (MSSI와 결합된 분산 제어 링크에서 RZ 펄스의 소광비에 따른 보상 특성)

Seong-Real Lee
- Journal of Advanced Navigation Technology
- /
- v.28 no.1
- /
- pp.123-128
- /
- 2024
When mid-span spectral inversion (MSSI), which inverts the propagated wave into phase-conjugated wave in the middle of the entire transmission distance, is combined with dispersion-managed link, it is very effective in compensating for the wavelength division multiplexed (WDM) signal distortion due to chromatic dispersion and nonlinear effects. In this MSSI combined dispersion-managed link, the shape of the dispersion map, channel data rate, channel wavelength and wavelength spacing, etc. affect the compensation and, consequently, determine the transmission distance and capacity of the WDM signal. In this paper, the compensation according to the extinction ratio of the return-to-zero (RZ) pulse that constitutes the WDM signal in the MSSI combined distributed control link was numerically analyzed. As a result of the simulation, it was conformed that the extinction ratio to obtain the best compensation should be determined depending on the shape of the dispersion map and the size of the residual dispersion per span, which determines the specific shape of the dispersion map. These results show a significant difference from the results in a general optical transmission system, where as the extinction ratio increases, the power difference between the '1' and '0' signals increases, thereby improving reception performance.
https://doi.org/10.12673/jant.2024.28.1.123 인용 PDF HTML

Research of Satellite Autonomous Navigation Using Star Sensor Algorithm (별 추적기 알고리즘을 활용한 위성 자율항법 연구)

Hyunseung Kim;Chul Hyun;Hojin Lee;Donggeon Kim
- Journal of Space Technology and Applications
- /
- v.4 no.3
- /
- pp.232-243
- /
- 2024
In order to perform various missions in space, including planetary exploration, estimating the position of a satellite in orbit is a very important factor because it is directly related to the success rate of mission performance. As a study for autonomous satellite navigation, this study estimated the satellite's attitude and real time orbital position using a star sensor algorithm with two star trackers and earth sensor. To implement the star sensor algorithm, a simulator was constructed and the position error of the satellite estimated through the technique presented in the paper was analyzed. Due to lens distortion and errors in the center point finding algorithm, the average attitude estimation error was at the level of 2.6 rad in the roll direction. And the position error was confirmed by attitude error, so average error in altitude direction was 516 m. It is expected that the proposed satellite attitude and position estimation technique will contribute to analyzing star sensor performance and improving position estimation accuracy.
https://doi.org/10.52912/jsta.2024.4.3.232 인용 PDF

A study on the Logical Reclassification of Parcel Service Tariffs (택배요금기준의 합리적 재설정에 관한 연구)

Cho, Yoon-Sung;Lee, Tae-Hwee
- Journal of Distribution Science
- /
- v.10 no.5
- /
- pp.45-55
- /
- 2012
In Korea, the parcel delivery service was launched officially in 1992, and the market has grown to 13.2 billion units, or 3.5 trillion won, as of 2011. The service companies accept small packages under 30 kg and deliver them on the next day in most domestic areas. This service plays an important role in business and personal activities. The parcel service companies have themselves designed the tariff for the delivery service based on two criteria: weight and the sum of three side lengths. Further, the tariff is graded in steps of three or four rate structures based on size (small, medium, large, and extra-small). However, the basic freight rate is generally decided according to the cargo's weight or measurement size, and an extra rate is added according to some factors (handling, stowability, liability, and so on). The parcel service tariff adopted by the companies is illogically designed, and this study was carried out to assess the need for redesigning the tariff structure. The cargo volume cannot be logically reflected by three side lengths. For example, two parcels measuring 160 cm based on three side lengths may have different volumes, one measuring 0.152 cbm (53.33 cm × 53.33 cm × 53.34 cm) and the other 0.05 cbm (100 cm × 50 cm × 10 cm). A small package of less than120 cm (sum of three side lengths) may have a volume of as much as 0.064 cbm (40 cm × 40 cm × 40 cm). Sample comparison showed that 17% of medium-size parcels (based on the sum of three side lengths) are small-volume packages, 24% of large-size parcels are small- or medium-volume packages, and 40% of extra-big-size parcels are big- or under-size packages. Therefore, if parcel service companies rate their services for volume cargo based on the three side lengths standard, users may have to pay higher than normal rates, particularly because a large percentage of parcels are volume cargo. According to this study, the average weight per 1 cbm is less than 300 kg. Therefore, users face an increasing risk of paying higher than logical freight charges. Generally, transportation companies are called "public interest enterprises," and parcel service companies operate as postal services. Public interest enterprises must provide the delivery service to all customers without discrimination at a reasonable service level and logical service charges. Therefore, parcels service tariffs must be designed and adopted logically. In this study, freight theories and prior research findings were used to consider the importance of freight rates, and distortion of parcel service rates based on the three side lengths system was verified through regression analysis of a parcel sample and sample comparison. In conclusion, volume sizes based on three side lengths have a higher correlation to the rate level than does the sum of three side lengths. Further, compared to the sum of three side lengths, volume size has a higher correlation to cargo weight, which is the most basic factor determining transportation cost. Therefore, the existing parcel service tariff should be changed to weight- and volume-based rates, and the tariff must be graded in steps of 8 to 10 higher rate structures for a logical freight schedule based on service cost.
PDF

Front-End Processing for Speech Recognition in the Telephone Network (전화망에서의 음성인식을 위한 전처리 연구)

Jun, Won-Suk;Shin, Won-Ho;Yang, Tae-Young;Kim, Weon-Goo;Youn, Dae-Hee
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.4
- /
- pp.57-63
- /
- 1997
In this paper, we study the efficient feature vector extraction method and front-end processing to improve the performance of the speech recognition system using KT(Korea Telecommunication) database collected through various telephone channels. First of all, we compare the recognition performances of the feature vectors known to be robust to noise and environmental variation and verify the performance enhancement of the recognition system using weighted cepstral distance measure methods. The experiment result shows that the recognition rate is increasedby using both PLP(Perceptual Linear Prediction) and MFCC(Mel Frequency Cepstral Coefficient) in comparison with LPC cepstrum used in KT recognition system. In cepstral distance measure, the weighted cepstral distance measure functions such as RPS(Root Power Sums) and BPL(Band-Pass Lifter) help the recognition enhancement. The application of the spectral subtraction method decrease the recognition rate because of the effect of distortion. However, RASTA(RelAtive SpecTrAl) processing, CMS(Cepstral Mean Subtraction) and SBR(Signal Bias Removal) enhance the recognition performance. Especially, the CMS method is simple but shows high recognition enhancement. Finally, the performances of the modified methods for the real-time implementation of CMS are compared and the improved method is suggested to prevent the performance degradation.
PDF

Wavelet Video Coding Using Low-Band-Shift Method and Multiresolution Motion Estimation (저대역 이동법과 다해상도 움직임 추정을 이용한 웨이블릿 동영상 부호화)

박영덕;서석용;고형화
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.3
- /
- pp.17-24
- /
- 2004
In this paper, the wavelet video coding using Low-Band-Shift(LBS) method and multiresolution motion estimation(MRME) is proposed. To overcome shift- variant property on wavelet coefficients, the LBS was proposed. LBS method previously has superior performance in terms of rate-distortion characteristic. However, this method needs more memory and computational complexity. Therefore to reduce computational complexity of video coding using LBS, we combine MRME with LBS. When mm is applied only, it has 7 times as much as existing method's motion vector because each subband has different motion vector using property of LBS, number of motion vector decreases. Proposed method decreases motion vector, and it decreases motion compensated Prediction error by detailed motion estimation. And then it shows better coding performance. Also this method reduces computational amount by smaller search area in higher resolution. The computational complexity of the proposed method is 12.1% of that of existing method at 3-level wavelet transform. The experimental results with the proposed method show about 0.2∼9.7% improvement of MAD performance in case of lossless coding, and 0.1∼2.0㏈ improvement of PSNR performance at 4he same bit rate in case of lossy coding.
PDF KSCI

A l0b 150 MSample/s 1.8V 123 mW CMOS A/D Converter (l0b 150 MSample/s 1.8V 123 mW CMOS 파이프라인 A/D 변환기)

Kim Se-Won;Park Jong-Bum;Lee Seung-Hoon
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.41 no.1
- /
- pp.53-60
- /
- 2004
This work describes a l0b 150 MSample/s CMOS pipelined A/D converter (ADC) based on advanced bootsuapping techniques for higher input bandwidth than a sampling rate. The proposed ADC adopts a typical multi-step pipelined architecture, employs the merged-capacitor switching technique which improves sampling rate and resolution reducing by $50\%$ the number of unit capacitors used in the multiplying digital-to-analog converter. On-chip current and voltage references for high-speed driving capability of R & C loads and on-chip decimator circuits for high-speed testability are implemented with on-chip decoupling capacitors. The proposed AU is fabricated in a 0.18 um 1P6M CMOS technology. The measured differential and integral nonlinearities are within $-0.56{\~}+0.69$ LSB and $-1.50{\~}+0.68$ LSB, respectively. The prototype ADC shows the signal-to-noise-and-distortion ratio (SNDR) of 52 dB at 150 MSample/s. The active chip area is 2.2 mm2 (= 1.4 mm ${\times}$ 1.6 mm) and the chip consumes 123 mW at 150 MSample/s.
PDF KSCI

A Fast Inter-layer Mode Decision Method inScalable Video Coding (공간적 스케일러블 비디오 부호화에서 계층간 모드 고속 결정 방법)

Lee, Bum-Shik;Hahm, Sang-Jin;Park, Chang-Seob;Park, Keun-Soo;Kim, Mun-Churl
- Journal of Broadcast Engineering
- /
- v.12 no.4
- /
- pp.360-372
- /
- 2007
We propose a fast inter-layer mode decision method by utilizing coding information of base layer upward its enhancement layer inscalable video coding (SVC), also called MPEG-4 part 10 Advanced Video Coding Amendment 3 or H.264 Scalable Extension (SE) which is being standardized. In this paper, when the motion vectors from the base layer have zero motion (0, 0) in inter-layer motion prediction or the Integer Transform coefficients of the residual between current MB and the motion compensated MB by the predicted motion vectors from the base layer are all zero, the block mode of the corresponding block to be encoded at the enhancement layer is determined to be the $16{\times}16$ mode. In addition, if the predicted mode of the MB to be encoded at the enhancement layer is not equal to the $16{\times}16$ mode, then the rate-distortion optimization is only performed on the reduced candidated modes which are same or smaller partitioned modes. Our proposed method exhibits the complexity reduction in encoding time up to 72%. Nevertheless, it shows negligible PSNR degradation and bit rate increase up to 0.25dB and 1.73%, respectively.
https://doi.org/10.5909/JBE.2007.12.4.360 인용 PDF KSCI

Block-Based Transform-Domain Measurement Coding for Compressive Sensing of Images (영상 압축센싱을 위한 블록기반 변환영역 측정 부호화)

Nguyen, Quang Hong;Nguyen, Viet Anh;Trinh, Chien Van;Dinh, Khanh Quoc;Park, Younghyeon;Jeon, Byeungwoo
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.39A no.12
- /
- pp.746-755
- /
- 2014
Compressive sensing (CS) has drawn much interest as a new sampling technique that enables signals to be sampled at a much lower than the Nyquist rate. By noting that the block-based compressive sensing can still keep spatial correlation in measurement domain, in this paper, we propose a novel encoding technique for measurement data obtained in the block-based CS of natural image. We apply discrete wavelet transform (DWT) to decorrelate CS measurements and then assign a proper quantization scheme to those DWT coefficients. Thus, redundancy of CS measurements and bitrate of system are reduced remarkably. Experimental results show improvements in rate-distortion performance by the proposed method against two existing methods of scalar quantization (SQ) and differential pulse-code modulation (DPCM). In the best case, the proposed method gains up to 4 dB, 0.9 dB, and 2.5 dB compared with the Block-based CS-Smoothed Projected Landweber plus SQ, Block-based CS-Smoothed Projected Landweber plus DPCM, and Multihypothesis Block-based CS-Smoothed Projected Landweber plus DPCM, respectively.
https://doi.org/10.7840/kics.2014.39A.12.746 인용 PDF KSCI

Search Result 819, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)