Search | Korea Science

Reducing latency of neural automatic piano transcription models (인공신경망 기반 저지연 피아노 채보 모델)

Dasol Lee;Dasaem Jeong
- The Journal of the Acoustical Society of Korea
- /
- v.42 no.2
- /
- pp.102-111
- /
- 2023
Automatic Music Transcription (AMT) is a task that detects and recognizes musical note events from a given audio recording. In this paper, we focus on reducing the latency of real-time AMT systems on piano music. Although neural AMT models have been adapted for real-time piano transcription, they suffer from high latency, which hinders their usefulness in interactive scenarios. To tackle this issue, we explore several techniques for reducing the intrinsic latency of a neural network for piano transcription, including reducing window and hop sizes of Fast Fourier Transformation (FFT), modifying convolutional layer's kernel size, and shifting the label in the time-axis to train the model to predict onset earlier. Our experiments demonstrate that combining these approaches can lower latency while maintaining high transcription accuracy. Specifically, our modified model achieved note F1 scores of 92.67 % and 90.51 % with latencies of 96 ms and 64 ms, respectively, compared to the baseline model's note F1 score of 93.43 % with a latency of 160 ms. This methodology has potential for training AMT models for various interactive scenarios, including providing real-time feedback for piano education.
https://doi.org/10.7776/ASK.2023.42.2.102 인용 PDF

Development of Real-time Closed-loop Neurostimulation System for Epileptic Seizure Suppression (뇌전증 경련 억제를 위한 실시간 폐루프 신경 자극 시스템 설계)

Kim, Sowon;Kim, Sunhee;Lee, Yena;Hwang, Seoyoung;Kang, Taekyeong;Jun, Sang Beom;Lee, Hyang Woon;Lee, Seungjun
- Journal of Biomedical Engineering Research
- /
- v.36 no.4
- /
- pp.95-102
- /
- 2015
Epilepsy is a chronic neurological disease which produces repeated seizures. Over 30% of epileptic patients cannot be treated with anti-epileptic drugs, and surgical resection may cause loss of brain functions. Seizure suppression by electrical stimulation is currently being investigated as a new treatment method as clinical evidence has shown that electrical stimulation to brain could suppress seizure activity. In this paper, design of a real-time closed-loop neurostimulation system for epileptic seizure suppression is presented. The system records neural signals, detects seizures and delivers electrical stimulation. The system consists of a 6-channel electrode, front-end amplifiers, a data acquisition board by National Instruments, and a neurostimulator and Generic Osorio-Frei algorithm was applied for seizure detection. The algorithm was verified through simulation using electroencephalogram data, and the operation of whole system was verified through simulation and in- vivo test.
https://doi.org/10.9718/JBER.2015.36.4.95 인용 PDF KSCI

Post-ischemic Time-dependent Activity Changes of Hippocampal CA1 cells of the Mongolian Gerbils

Won, Moo-Ho;Shin, Hyung-Cheul
- The Korean Journal of Physiology and Pharmacology
- /
- v.11 no.6
- /
- pp.247-251
- /
- 2007
Changes of single unit activity of CA1 hippocampus region were investigated in anesthetized Mongolian gerbils for six days following transient ischemia. Ischemia was produced immediately before the implantation of micro-wire recording electrodes. In control animals receiving pseudo-ischemic surgery, neither spontaneous neuronal activities ($5.70{\pm}0.4Hz$) nor the number of recorded neurons per animal changed significantly for six days. Correlative firings among simultaneously recorded neurons were weak (correlation coefficient > 0.6) in the control animals. Animals subjected to ischemia exhibited a significant elevation of neural firing at post-ischemic 12 hr ($9.95{\pm}0.9Hz$) and day 1 ($8.48{\pm}0.8Hz$), but a significant depression of activity at post-ischemic day 6 ($1.84{\pm}0.3Hz$) when compared to the activities of non-ischemic control animal. Ischemia significantly (correlation coefficient > 0.6) increased correlative firings among simultaneously recorded neurons, which were prominent especially during post-ischemic days 1, 2 and 6. Although the numbers of spontaneously active neurons recorded from control group varied within normal range during the experimental period, those from ischemic group changed in post-ischemic time-dependent manner. Temporal changes of the number of cells recorded per animal between control group and ischemic group were also significantly different (p = 0.0084, t = 3.271, df = 10). Cresyl violet staining indicated significant loss of CA1 cells at post-ischemic day 7. Overall, we showed post-ischemic time-dependent, differential changes of three characteristics, including spontaneous activity, network relationship and excitability of CA1 cells, suggesting sustained neural functions. Thus, histological observation of CA1 cell death till post-ischemic day 7 may not represent actual neuronal death.
PDF KSCI

An EEG-based Deep Neural Network Classification Model for Recognizing Emotion of Users in Early Phase of Design (초기설계 단계 사용자의 감정 인식을 위한 뇌파기반 딥러닝 분류모델)

Chang, Sun-Woo;Dong, Won-Hyeok;Jun, Han-Jong
- Journal of the Architectural Institute of Korea Planning & Design
- /
- v.34 no.12
- /
- pp.85-94
- /
- 2018
The purpose of this paper was to propose a model that recognizes potential users' emotional response toward design by classifying Electroencephalography(EEG). Studies in neuroscience and psychology have made an effort to recognize subjects' emotional response by analyzing EEG data. And this approach has been adopted in design since it is critical to monitor users' subjective response in the preface of design. Moreover, the building design process cannot be reversed after construction, recognizing clients' affection toward design alternatives plays important role. An experiment was conducted to record subjects' EEG data while they view their most/least liked images of small-house designs selected by them among the eight given images. After the recording, a subjective questionnaire, PANAS, was distributed to the subjects in order to describe their own affection score in quantitative way. Google TensorFlow was used to build and train the model. Dataset for model training and testing consist of feature columns for recorded EEG data and labels for the questionnaire results. After training and testing, the measured accuracy of the model was 0.975 which was higher than the other machine learning based classification methods. The proposed model may suggest one quantitative way of evaluating design alternatives. In addition, this method may support designer while designing the facilities for people like disabled or children who are not able to express their own feelings toward alternatives.
https://doi.org/10.5659/JAIK_PD.2018.34.12.85 인용 KSCI

Application of Machine Learning on Voice Signals to Classify Body Mass Index - Based on Korean Adults in the Korean Medicine Data Center (머신러닝 기반 음성분석을 통한 체질량지수 분류 예측 - 한국 성인을 중심으로)

Kim, Junho;Park, Ki-Hyun;Kim, Ho-Seok;Lee, Siwoo;Kim, Sang-Hyuk
- Journal of Sasang Constitutional Medicine
- /
- v.33 no.4
- /
- pp.1-9
- /
- 2021
Objectives The purpose of this study was to check whether the classification of the individual's Body Mass Index (BMI) could be predicted by analyzing the voice data constructed at the Korean medicine data center (KDC) using machine learning. Methods In this study, we proposed a convolutional neural network (CNN)-based BMI classification model. The subjects of this study were Korean adults who had completed voice recording and BMI measurement in 2006-2015 among the data established at the Korean Medicine Data Center. Among them, 2,825 data were used for training to build the model, and 566 data were used to assess the performance of the model. As an input feature of CNN, Mel-frequency cepstral coefficient (MFCC) extracted from vowel utterances was used. A model was constructed to predict a total of four groups according to gender and BMI criteria: overweight male, normal male, overweight female, and normal female. Results & Conclusions Performance evaluation was conducted using F1-score and Accuracy. As a result of the prediction for four groups, The average accuracy was 0.6016, and the average F1-score was 0.5922. Although it showed good performance in gender discrimination, it is judged that performance improvement through follow-up studies is necessary for distinguishing BMI within gender. As research on deep learning is active, performance improvement is expected through future research.
https://doi.org/10.7730/JSCM.2021.33.4.1 인용 PDF KSCI

The Edge Computing System for the Detection of Water Usage Activities with Sound Classification (음향 기반 물 사용 활동 감지용 엣지 컴퓨팅 시스템)

Seung-Ho Hyun;Youngjoon Chee
- Journal of Biomedical Engineering Research
- /
- v.44 no.2
- /
- pp.147-156
- /
- 2023
Efforts to employ smart home sensors to monitor the indoor activities of elderly single residents have been made to assess the feasibility of a safe and healthy lifestyle. However, the bathroom remains an area of blind spot. In this study, we have developed and evaluated a new edge computer device that can automatically detect water usage activities in the bathroom and record the activity log on a cloud server. Three kinds of sound as flushing, showering, and washing using wash basin generated during water usage were recorded and cut into 1-second scenes. These sound clips were then converted into a 2-dimensional image using MEL-spectrogram. Sound data augmentation techniques were adopted to obtain better learning effect from smaller number of data sets. These techniques, some of which are applied in time domain and others in frequency domain, increased the number of training data set by 30 times. A deep learning model, called CRNN, combining Convolutional Neural Network and Recurrent Neural Network was employed. The edge device was implemented using Raspberry Pi 4 and was equipped with a condenser microphone and amplifier to run the pre-trained model in real-time. The detected activities were recorded as text-based activity logs on a Firebase server. Performance was evaluated in two bathrooms for the three water usage activities, resulting in an accuracy of 96.1% and 88.2%, and F1 Score of 96.1% and 87.8%, respectively. Most of the classification errors were observed in the water sound from washing. In conclusion, this system demonstrates the potential for use in recording the activities as a lifelog of elderly single residents to a cloud server over the long-term.
https://doi.org/10.9718/JBER.2023.44.2.147 인용 PDF

Research on Developing a Conversational AI Callbot Solution for Medical Counselling

Won Ro LEE;Jeong Hyon CHOI;Min Soo KANG
- Korean Journal of Artificial Intelligence
- /
- v.11 no.4
- /
- pp.9-13
- /
- 2023
In this study, we explored the potential of integrating interactive AI callbot technology into the medical consultation domain as part of a broader service development initiative. Aimed at enhancing patient satisfaction, the AI callbot was designed to efficiently address queries from hospitals' primary users, especially the elderly and those using phone services. By incorporating an AI-driven callbot into the hospital's customer service center, routine tasks such as appointment modifications and cancellations were efficiently managed by the AI Callbot Agent. On the other hand, tasks requiring more detailed attention or specialization were addressed by Human Agents, ensuring a balanced and collaborative approach. The deep learning model for voice recognition for this study was based on the Transformer model and fine-tuned to fit the medical field using a pre-trained model. Existing recording files were converted into learning data to perform SSL(self-supervised learning) Model was implemented. The ANN (Artificial neural network) neural network model was used to analyze voice signals and interpret them as text, and after actual application, the intent was enriched through reinforcement learning to continuously improve accuracy. In the case of TTS(Text To Speech), the Transformer model was applied to Text Analysis, Acoustic model, and Vocoder, and Google's Natural Language API was applied to recognize intent. As the research progresses, there are challenges to solve, such as interconnection issues between various EMR providers, problems with doctor's time slots, problems with two or more hospital appointments, and problems with patient use. However, there are specialized problems that are easy to make reservations. Implementation of the callbot service in hospitals appears to be applicable immediately.
https://doi.org/10.24225/kjai.2023.11.4.9 인용 PDF

Research on development of electroencephalography Measurement and Processing system (뇌전도 측정 및 처리 시스템 개발에 관한 연구)

Doo-hyun Lee;Yu-jun Oh;Jin-hee Hong;Jun-su chae;Young-gyu Choi
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.17 no.1
- /
- pp.38-46
- /
- 2024
In general, EEG signal analysis has been the subject of several studies due to its ability to provide an objective mode of recording brain stimulation, which is widely used in brain-computer interface research with applications in medical diagnosis and rehabilitation engineering. In this study, we developed EEG reception hardware to measure electroencephalograms and implemented a processing system, classifying it into server and data processing. It was conducted as an intermediate-stage research on the implementation of a brain-computer interface using electroencephalograms, and was implemented in the form of predicting the user's arm movements according to measured electroencephalogram data. Electroencephalogram measurements were performed using input from four electrodes through an analog-to-digital converter. After sending this to the server through a communication process, we designed and implemented a system flow in which the server classifies the electroencephalogram input using a convolutional neural network model and displays the results on the user terminal.
https://doi.org/10.17661/jkiiect.2024.17.1.38 인용 PDF HTML

Ginsenoside Rb$_1$ Reduces Spontaneous Bursting Activity in Thalamocortical Slices of the Rat

Yang, Sung-Chil;Lee, Sang-Hun;Park, Jin-Kyu;Jung, Min-Whan;Lee, Chang-Joong
- Journal of Ginseng Research
- /
- v.24 no.3
- /
- pp.134-137
- /
- 2000
Spontaneous bursting activity was studied in rat thalamocortical slices using extracellular field potential recording to test the potential utilization of ginsenoside Rb$_1$ in controlling overactivated neural systems. In order to induce bursting activity, slices were perfused with Mg$\^$2+/-free artificial cerebrospinal fluid (ACSF). Two major types of spontaneous bursting activity, simple thalamocortical burst complexes (sTBCs) and complex thalamocortical burst complexes (cTBCs), were recorded in Mg$\^$2+/ -free ACSF. Ginsenoside Rb$_1$ selectively suppressed cTBCs. Duration and occurrence rate of cTBCs were reduced by 87.3${\pm}$10.2% and 85.3${\pm}$ 14.7% in the presence of 90 ${\mu}$M ginsenoside Rb$_1$ respectively, while amplitude and intraburst frequency were slightly changed by ginsenoside Rb$_1$. In contrast, ginsenoside Rb$_1$was much less effective in reducing duration and occurrence rate of sTBCs. We also tested effects of ginsenoside Rb$_1$ on bursting activity in the presence of a GABA$\sub$A/ receptor antagonist, bicuculline methiodide (BMI). Ginsenoside Rb$_1$ had no effect in suppressing BMI-induced bursting activities. These results suggest that ginsenoside Rbi may be useful in controlling seizure-like bursting activity under pathological conditions.
PDF

Invader Detection System Using the Morphological Filtering and Difference Images Based on the Max-Valued Edge Detection Algorithm

Lee, Jae-Hyun;Kim, Sung-Shin;Kim, Jung-Min
- Journal of Advanced Marine Engineering and Technology
- /
- v.36 no.5
- /
- pp.645-661
- /
- 2012
Recently, pirates are infesting on the sea and they have been hijacking the several vessels for example Samho Dream and Samho Jewelry of Korea. One of the items to reduce the risk is to adopt the invader detection system. If the pirates break in to the ship, the detection system can monitor the pirates and then call the security alarm. The crew can gain time to hide to the safe room and the report can be automatically sent to the control room to cope with the situation. For the invader detection, an unmanned observation system was proposed using the image detection algorithm that extracts the invader image from the recording image. To detect the motion area, the difference value was calculated between the current image and the prior image of the invader, and the 'AND' operator was used in calculated image and edge line. The image noise was reduced based on the morphology operation and then the image was transformed into morphological information. Finally, a neural network model was applied to recognize the invader. In the experimental results, it was confirmed that the proposed approach can improve the performance of the recognition in the invader monitoring system.
https://doi.org/10.5916/jkosme.2012.36.5.645 인용 PDF KSCI

Search Result 81, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)