• Title/Summary/Keyword: training signal

Search Result 506, Processing Time 0.023 seconds

Creation of a Voice Recognition-Based English Aided Learning Platform

  • Hui Xu
    • Journal of Information Processing Systems
    • /
    • v.20 no.4
    • /
    • pp.491-500
    • /
    • 2024
  • In hopes of resolving the issue of poor quality of information input for teaching spoken English online, the study creates an English teaching assistance model based on a recognition algorithm named dynamic time warping (DTW) and relies on automated voice recognition technology. In hopes of improving the algorithm's efficiency, the study modifies the speech signal's time-domain properties during the pre-processing stage and enhances the algorithm's performance in terms of computational effort and storage space. Finally, a simulation experiment is employed to evaluate the model application's efficacy. The study's revised DTW model, which achieves recognition rates of above 95% for all phonetic symbols and tops the list for cloudy consonant recognition with rates of 98.5%, 98.8%, and 98.7% throughout the three tests, respectively, is demonstrated by the study's findings. The enhanced model for DTW voice recognition also presents higher efficiency and requires less time for training and testing. The DTW model's KS value, which is the highest among the models analyzed in the KS value analysis, is 0.63. Among the comparative models, the model also presents the lowest curve position for both test functions. This shows that the upgraded DTW model features superior voice recognition capabilities, which could significantly improve online English education and lead to better teaching outcomes.

Noise2Atom: unsupervised denoising for scanning transmission electron microscopy images

  • Feng Wang;Trond R. Henninen;Debora Keller;Rolf Erni
    • Applied Microscopy
    • /
    • v.50
    • /
    • pp.23.1-23.9
    • /
    • 2020
  • We propose an effective deep learning model to denoise scanning transmission electron microscopy (STEM) image series, named Noise2Atom, to map images from a source domain 𝓢 to a target domain 𝓒, where 𝓢 is for our noisy experimental dataset, and 𝓒 is for the desired clear atomic images. Noise2Atom uses two external networks to apply additional constraints from the domain knowledge. This model requires no signal prior, no noise model estimation, and no paired training images. The only assumption is that the inputs are acquired with identical experimental configurations. To evaluate the restoration performance of our model, as it is impossible to obtain ground truth for our experimental dataset, we propose consecutive structural similarity (CSS) for image quality assessment, based on the fact that the structures remain much the same as the previous frame(s) within small scan intervals. We demonstrate the superiority of our model by providing evaluation in terms of CSS and visual quality on different experimental datasets.

Frequency Domain Pattern Recognition Method for Damage Detection of a Steel Bridge (강교량의 손상감지를 위한 주파수 영역 패턴인식 기법)

  • Lee, Jung Whee;Kim, Sung Kon;Chang, Sung Pil
    • Journal of Korean Society of Steel Construction
    • /
    • v.17 no.1 s.74
    • /
    • pp.1-11
    • /
    • 2005
  • A bi-level damage detection algorithm that utilizes the dynamic responses of the structure as input and neural network (NN) as pattern classifier is presented. Signal anomaly index (SAI) is proposed to express the amount of changes in the shape of frequency response functions (FRF) or strain frequency response function (SFRF). SAI is calculated using the acceleration and dynamic strain responses acquired from intact and damaged states of the structure. In a bi-level damage identification algorithm, the presence of damage is first identified from the magnitude of the SAI value, then the location of the damage is identified using the pattern recognition capability of NN. The proposed algorithm is applied to an experimental model bridge to demonstrate the feasibility of the algorithm. Numerically simulated signals are used for training the NN, and experimentally-acquired signals are used to test the NN. The results of this example application suggest that the SAI-based pattern recognition approach may be applied to the structural health monitoring system for a real bridge.

Development of a Clinical Decision Support System Utilizing Support Vector Machine (Support Vector Machine을 이용한 생체 신호 분류기 개발)

  • Hong, Dong-Kwon;Chai, Yong-Yoong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.3
    • /
    • pp.661-668
    • /
    • 2018
  • Biomedical signals using skin resistance have different characteristics according to stress diseases. Biological diagnostic devices for diagnosing stress diseases have been developed by using these characteristics, and devices have been developed so that the signals measured by the skin storage meter can be easily analyzed. Experts in the field will look directly at the output signal to determine the likelihood of any stress disorder. However, it is very difficult for a person to accurately determine whether a person to be measured has a stress disorder by analyzing a bio-signal measured by each person to be measured, and the result of the judgment is very likely to be wrong. In order to solve these problems, we implemented the function of determining the signal of a stress disorder by using the machine learning technique. SVM was used as a classification method in consideration of low computing ability of measurement equipment. Training data and test data were randomly generated for each disease using error range 5 based on 13 diseases. Simulation results showed more than 90% decision accuracy. In the future, if the measurement equipment is actually applied to the patients, we can retrain the classifier with the newly generated data.

Design and Performance Analysis of the Efficient Equalization Method for OFDM system using QAM in multipath fading channel (다중경로 페이딩 채널에서 QAM을 사용하는 OFDM시스템의 효율적인 등화기법 설계 및 성능분석)

  • 남성식;백인기;조성호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.6B
    • /
    • pp.1082-1091
    • /
    • 2000
  • In this paper, the efficient equalization method for OFDM(Orthogonal Frequency Division Multiflexing) System using the QAM(Quadrature Amplitude Modulation) in multipath fading channel is proposed in order to faster and more efficiently equalize the received signals that are sent over real channel. In generally, the one-tap linear equalizers have been used in the frequency-domain as the existing equalization method for OFDM system. In this technique, if characteristics of the channel are changed fast, the one-tap linear equalizers cannot compensate for the distortion due to time variant multipath channels. Therefore, in this paper, we use one-tap non-linear equalizers instead of using one-tap linear equalizers in the frequency-domain, and also use the linear equalizer in the time-domain to compensate the rapid performance reduction at the low SNR(Signal-to-Noise Ratio) that is the disadvantage of the non-linear equalizer. In the frequency-domain, when QAM signals, consisting of in-phase components and quadrature (out-phase) components, are sent over the complex channel, the only in-phase and quadrature components of signals distorted by the multipath fading are changed the same as signals distorted by the noise. So the cross components are canceled in the frequency-domain equalizer. The time-domain equalizer and the adaptive algorithm that has lower-error probability and fast convergence speed are applied to compensate for the error that is caused by canceling the cross components in the frequency-domain equalizer. In the time-domain, To compensate for the performance of frequency-domain equalizer the time-domain equalizes the distorted signals at a frame by using the Gold-code as a training sequence in the receiver after the Gold-codes are inserted into the guard signal in the transmitter. By using the proposed equalization method, we can achieve faster and more efficient equalization method that has the reduced computational complexity and improved performance.

  • PDF

A Study on Performance Improvement Method for the Multi-Model Speech Recognition System in the DSR Environment (DSR 환경에서의 다 모델 음성 인식시스템의 성능 향상 방법에 관한 연구)

  • Jang, Hyun-Baek;Chung, Yong-Joo
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.2
    • /
    • pp.137-142
    • /
    • 2010
  • Although multi-model speech recognizer has been shown to be quite successful in noisy speech recognition, the results were based on general speech front-ends which do not take into account noise adaptation techniques. In this paper, for the accurate evaluation of the multi-model based speech recognizer, we adopted a quite noise-robust speech front-end, AFE, which was proposed by the ETSI for the noisy DSR environment. For the performance comparison, the MTR which is known to give good results in the DSR environment has been used. Also, we modified the structure of the multi-model based speech recognizer to improve the recognition performance. N reference HMMs which are most similar to the input noisy speech are used as the acoustic models for recognition to cope with the errors in the selection of the reference HMMs and the noise signal variability. In addition, multiple SNR levels are used to train each of the reference HMMs to improve the robustness of the acoustic models. From the experimental results on the Aurora 2 databases, we could see better recognition rates using the modified multi-model based speech recognizer compared with the previous method.

Wavelet-based Statistical Noise Detection and Emotion Classification Method for Improving Multimodal Emotion Recognition (멀티모달 감정인식률 향상을 위한 웨이블릿 기반의 통계적 잡음 검출 및 감정분류 방법 연구)

  • Yoon, Jun-Han;Kim, Jin-Heon
    • Journal of IKEEE
    • /
    • v.22 no.4
    • /
    • pp.1140-1146
    • /
    • 2018
  • Recently, a methodology for analyzing complex bio-signals using a deep learning model has emerged among studies that recognize human emotions. At this time, the accuracy of emotion classification may be changed depending on the evaluation method and reliability depending on the kind of data to be learned. In the case of biological signals, the reliability of data is determined according to the noise ratio, so that the noise detection method is as important as that. Also, according to the methodology for defining emotions, appropriate emotional evaluation methods will be needed. In this paper, we propose a wavelet -based noise threshold setting algorithm for verifying the reliability of data for multimodal bio-signal data labeled Valence and Arousal and a method for improving the emotion recognition rate by weighting the evaluation data. After extracting the wavelet component of the signal using the wavelet transform, the distortion and kurtosis of the component are obtained, the noise is detected at the threshold calculated by the hampel identifier, and the training data is selected considering the noise ratio of the original signal. In addition, weighting is applied to the overall evaluation of the emotion recognition rate using the euclidean distance from the median value of the Valence-Arousal plane when classifying emotional data. To verify the proposed algorithm, we use ASCERTAIN data set to observe the degree of emotion recognition rate improvement.

Development of Collaborative Robot Control Training Medium to Improve Worker Safety and Work Convenience Using Image Processing and Machine Learning-Based Hand Signal Recognition (작업자의 안전과 작업 편리성 향상을 위한 영상처리 및 기계학습 기반 수신호 인식 협동로봇 제어 교육 매체 개발)

  • Jin-heork Jung;Hun Jeong;Gyeong-geun Park;Gi-ju Lee;Hee-seok Park;Chae-hun An
    • Journal of Practical Engineering Education
    • /
    • v.14 no.3
    • /
    • pp.543-553
    • /
    • 2022
  • A collaborative robot(Cobot) is one of the production systems presented in the 4th industrial revolution and are systems that can maximize efficiency by combining the exquisite hand skills of workers and the ability of simple repetitive tasks of robots. Also, research on the development of an efficient interface method between the worker and the robot is continuously progressing along with the solution to the safety problem arising from the sharing of the workspace. In this study, a method for controlling the robot by recognizing the worker's hand signal was presented to enhance the convenience and concentration of the worker, and the safety of the worker was secured by introducing the concept of a safety zone. Various technologies such as robot control, PLC, image processing, machine learning, and ROS were used to implement this. In addition, the roles and interface methods of the proposed technologies were defined and presented for using educational media. Students can build and adjust the educational media system by linking the introduced various technologies. Therefore, there is an excellent advantage in recognizing the necessity of the technology required in the field and inducing in-depth learning about it. In addition, presenting a problem and then seeking a way to solve it on their own can lead to self-directed learning. Through this, students can learn key technologies of the 4th industrial revolution and improve their ability to solve various problems.

Low-cost Prosthetic Hand Model using Machine Learning and 3D Printing (머신러닝과 3D 프린팅을 이용한 저비용 인공의수 모형)

  • Donguk Shin;Hojun Yeom;Sangsoo Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.19-23
    • /
    • 2024
  • Patients with amputations of both hands need prosthetic hands that serve both cosmetic and functional purposes, and research on prosthetic hands using electromyography of remaining muscles is active, but there is still the problem of high cost. In this study, an artificial prosthetic hand was manufactured and its performance was evaluated using low-cost parts and software such as a surface electromyography sensor, machine learning software Edge Impulse, Arduino Nano 33 BLE, and 3D printing. Using signals acquired with surface electromyography sensors and subjected to digital signal processing through Edge Impulse, the flexing movement signals of each finger were transmitted to the fingers of the prosthetic hand model through training to determine the type of finger movement using machine learning. When the digital signal processing conditions were set to a notch filter of 60 Hz, a bandpass filter of 10-300 Hz, and a sampling frequency of 1,000 Hz, the accuracy of machine learning was the highest at 82.1%. The possibility of being confused between each finger flexion movement was highest for the ring finger, with a 44.7% chance of being confused with the movement of the index finger. More research is needed to successfully develop a low-cost prosthetic hand.

Development of Five Finger type Myoelectric Hand Prosthesis for State Transition-Based Multi-Hand Gestures change (다중 손동작 변환을 위한 상태 전이 기반 5손가락 근전전동의수 개발)

  • Seung-Gi Kim;Sung-Yoon Jung;Beom-ki Hong;Hyun-Jun Shin;Kyoung-Ho Kim;Se-Hoon Park
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.25 no.2
    • /
    • pp.67-76
    • /
    • 2024
  • Various types of assistive devices have been developed for upper limb amputees over the years, with myoelectric prosthesis particularly aimed at improving user convenience by enabling a range of hand gestures beyond simple grasping, tailored to the size and shape of objects. In this study, we developed a five-finger myoelectric prosthesis mimicking human hand size and finger movements, utilizing motor and worm gear mechanisms for stable and independent operation. Based on this, we designed a control system for independent finger control through electromyographic signal input, proposed a state transition-based hand gesture conversion algorithm by selecting representative eight hand gestures and defining conversion condition parameters. We introduced training and usability evaluation methods, and conducted usability assessments among upper limb amputees using dedicated tools, confirming the potential for commercial application of the algorithm and observing adaptive capabilities and high performance through iterative evaluations.