Search | Korea Science

RoutingConvNet: A Light-weight Speech Emotion Recognition Model Based on Bidirectional MFCC (RoutingConvNet: 양방향 MFCC 기반 경량 음성감정인식 모델)

Hyun Taek Lim;Soo Hyung Kim;Guee Sang Lee;Hyung Jeong Yang
- Smart Media Journal
- /
- v.12 no.5
- /
- pp.28-35
- /
- 2023
In this study, we propose a new light-weight model RoutingConvNet with fewer parameters to improve the applicability and practicality of speech emotion recognition. To reduce the number of learnable parameters, the proposed model connects bidirectional MFCCs on a channel-by-channel basis to learn long-term emotion dependence and extract contextual features. A light-weight deep CNN is constructed for low-level feature extraction, and self-attention is used to obtain information about channel and spatial signals in speech signals. In addition, we apply dynamic routing to improve the accuracy and construct a model that is robust to feature variations. The proposed model shows parameter reduction and accuracy improvement in the overall experiments of speech emotion datasets (EMO-DB, RAVDESS, and IEMOCAP), achieving 87.86%, 83.44%, and 66.06% accuracy respectively with about 156,000 parameters. In this study, we proposed a metric to calculate the trade-off between the number of parameters and accuracy for performance evaluation against light-weight.
https://doi.org/10.30693/SMJ.2023.12.5.28 인용 PDF

A Study on parameter choice system design for EEG classifications (EEG 분류를 위한 매개변수 선택형 시스템 설계에 관한 연구)

Cho, Hee-Jun;Shin, Dong-Kyoo;Shin, Dong-Il
- Proceedings of the Korean Information Science Society Conference
- /
- 2012.06a
- /
- pp.334-336
- /
- 2012
EEG 신호에 대한 연구는 의학, 신경과학, 심리학, 컴퓨터과학, 전자 공학 등 여러 학문 분야에서 많은 연구가 진행되고 있다. EEG 신호는 추출하는데 있어서 필연적으로 각종 Artifact와 분석대상이 아닌 신호가 혼재되어 분석 결과의 부정확성을 가지고 있어 EEG 신호의 활용이 주목받은지 오래되었지만 충분히 활용되지 못하고 있다. 이 문제를 해결하기 위해 각종 필터링 연산 등을 통하여 잡음을 제거하고 혼재된 신호를 분류해 내고 있지만, 잡음제거나 신호분류에 사용되는 방법이 고정된 수식을 이용하는 방법이기 때문에 유연한 측정 및 분류를 할 수 없는 것이 현실이다. 본 논문에서 제안하는 매개변수 선택형 시스템은 정제되지 않은 EEG 신호에서 잡파를 제거하고 정제된 신호에서 분석에 필요한 특징을 추출하는데 있어 사용자에게 착용된 EEG 신호 측정기기에서 전극채널, 신호발생영역 및 주파수 대역 등의 매개변수를 선택하고 필요에 따라 매개변수에 가중치를 부여함으로써, 측정의 정확성을 높이고 EEG 신호의 활용에 신뢰도를 향상 시킬 수 있다.

Feature Extraction of Shape of Image Objects in Content-based Image Retrieval (내용기반으로한 이미지 검색에서 이미지 객체들의 외형특징추출)

Cho, June-Suh
- The KIPS Transactions:PartB
- /
- v.10B no.7
- /
- pp.823-828
- /
- 2003
The main objective of this paper is to provide a methodology of feature extraction using shape of image objects for content-based image retrieval. The shape of most real-life objects is irregular, and hence there is no universal approach to quantify the shape of an arbitrary object. In particular. electronic catalogs contain many image objects for their products. In this paper, we perform feature extraction based on individual objects in images rather than on the whole image itself, since our method uses a shape-based approach of objects using RLC lines within an image. Experiments show that shape parameters distinctly represented image objects and provided better classification and discrimination among image objects in an image database compared to Texture.
https://doi.org/10.3745/KIPSTB.2003.10B.7.823 인용 PDF KSCI

Real-Time Automatic Tracking of Facial Feature (얼굴 특징 실시간 자동 추적)

박호식;배철수
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.8 no.6
- /
- pp.1182-1187
- /
- 2004
Robust, real-time, fully automatic tracking of facial features is required for many computer vision and graphics applications. In this paper, we describe a fully automatic system that tracks eyes and eyebrows in real time. The pupils are tracked using the red eye effect by an infrared sensitive camera equipped with infrared LEDs. Templates are used to parameterize the facial features. For each new frame, the pupil coordinates are used to extract cropped images of eyes and eyebrows. The template parameters are recovered by PCA analysis on these extracted images using a PCA basis, which was constructed during the training phase with some example images. The system runs at 30 fps and requires no manual initialization or calibration. The system is shown to work well on sequences with considerable head motions and occlusions.
PDF KSCI

Large Language Models-based Feature Extraction for Short-Term Load Forecasting (거대언어모델 기반 특징 추출을 이용한 단기 전력 수요량 예측 기법)

Jaeseung Lee;Jehyeok Rew
- Journal of Korea Society of Industrial Information Systems
- /
- v.29 no.3
- /
- pp.51-65
- /
- 2024
Accurate electrical load forecasting is important to the effective operation of power systems in smart grids. With the recent development in machine learning, artificial intelligence-based models for predicting power demand are being actively researched. However, since existing models get input variables as numerical features, the accuracy of the forecasting model may decrease because they do not reflect the semantic relationship between these features. In this paper, we propose a scheme for short-term load forecasting by using features extracted through the large language models for input data. We firstly convert input variables into a sentence-like prompt format. Then, we use the large language model with frozen weights to derive the embedding vectors that represent the features of the prompt. These vectors are used to train the forecasting model. Experimental results show that the proposed scheme outperformed models based on numerical data, and by visualizing the attention weights in the large language models on the prompts, we identified the information that significantly influences predictions.
https://doi.org/10.9723/jksiis.2024.29.3.051 인용 PDF

Facial Expression Recognition with Instance-based Learning Based on Regional-Variation Characteristics Using Models-based Feature Extraction (모델기반 특징추출을 이용한 지역변화 특성에 따른 개체기반 표정인식)

Park, Mi-Ae;Ko, Jae-Pil
- Journal of Korea Multimedia Society
- /
- v.9 no.11
- /
- pp.1465-1473
- /
- 2006
In this paper, we present an approach for facial expression recognition using Active Shape Models(ASM) and a state-based model in image sequences. Given an image frame, we use ASM to obtain the shape parameter vector of the model while we locate facial feature points. Then, we can obtain the shape parameter vector set for all the frames of an image sequence. This vector set is converted into a state vector which is one of the three states by the state-based model. In the classification step, we use the k-NN with the proposed similarity measure that is motivated on the observation that the variation-regions of an expression sequence are different from those of other expression sequences. In the experiment with the public database KCFD, we demonstrate that the proposed measure slightly outperforms the binary measure in which the recognition performance of the k-NN with the proposed measure and the existing binary measure show 89.1% and 86.2% respectively when k is 1.
PDF

Metric Reconstruction for Augmented Reality (증강현실을 위한 매트릭 복원)

Yu, Jeong-Jae;Kim, Hye-Mi;Park, Chang-Jun;Kim, Hong-Seok;Lee, In-Ho
- 한국HCI학회:학술대회논문집
- /
- 2007.02a
- /
- pp.649-652
- /
- 2007
이 논문에서는 영화, CF 같은 영상물 제작 시 CG/실사 합성을 위해 배경기하정보를 추출하는 알고리즘을 제안한다. Metric Reconstruction 은 카메라 자동 보정을 통해 이루어지며 이는 오랫동안 연구되어 온 분야이다. 접근방법은 영상의 특징점 추적 정보와 카메라 내부변수 가정으로부터 유도되는 자기 보정 방식과 공간상에서 미리 기하 정보를 알고 있는 보정틀을 사용하는 방식으로 크게 분류될 수 있다. CG/실사 합성의 작업 효율성을 위해서는 배경 영상에 보정틀이 보이지 않는 것이 좋은데 자연 특징점(Natural Feature)에만 의존하는 자기 보정 방식의 경우 2K 급 영상에서 CG 객체를 합성했을 때 떨림이 느껴지지 않을 만큼 정확한 결과를 얻기 힘들다. 이 논문에서는 Polleyfeys[2]가 제안하였던 영상 시퀀스를 입력으로 하는 자기 보정 시스템을 바탕으로 마야 작업 환경에서의 핀홀 카메라 모델에 맞도록 카메라 내부변수의 비선형 최적화를 수행하는 방법과 사용자 개입을 통한 카메라 변수 정확도 향상방법을 제안한다.
PDF

Performance Improvement of Polynomial Adaline Using Principal Component Analysis (주요성분분석을 이용한 Polynomial Adaline의 성능개선)

Cho, Yong-Hyun;Park, Yong-Soo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.04a
- /
- pp.313-316
- /
- 2001
본 논문에서는 입력변수들의 차원을 감소시켜 polynomial adaline의 성능을 개선하는 방법을 제안하였다. 제안된 방법에서는 적응적 학습알고리즘의 주요성분분석 기법을 이용하여 입력변수의 특징을 추출하고 이를 polynomial adaline의 학습데이터로 이용하였다. 이는 주요성분분석 기법이 가지는 대용량의 입력 데이터를 통계적으로 독립인 특징들의 집합으로 변환시키는 속성을 살려 입력데이터의 차원을 감소시킴으로서 고차원의 데이터에 따른 polynomial adaline이 가지는 제약을 해결하기 위함이다. 제안된 기법의 polynomial adaline을 5 개의 입력변수를 가진 패턴분류 문제에 적용하여 시뮬레이션한 결과, 기존의 다차원 polynomial adaline보다 더욱 우수한 분류성능이 있음을 확인할 수 있었다. 그리고 커널함수의 평활요소 설정 면에서도 우수한 특성이 있음을 확인할 수 있었다.
PDF

Analysis of EEG Signal Evoked by Auditory Stimulation (청각 자극에 의해 유발되는 뇌파신호의 분석)

Lee, Dong-Han;Kim, Jae-Wook;Lee, Chong-Ho
- Proceedings of the KIEE Conference
- /
- 2000.07d
- /
- pp.3227-3229
- /
- 2000
본 논문은 청각 자극이 제시되었을 때 변화되는 뇌파로부터 의미 있는 특징을 찾아내서 정량화 할 수 있는 변수 추출 및 분류 기법을 제시한다. 건강한 피실험자로부터 방향성 있는 청각 자극을 인가했을 때의 뇌파를 검출, 분류하였다. 뇌파의 변수 추출 방법으로는 짧은 시간영역에서의 신호의 갑작스런 변화량도 정량적으로 분석할 수 있는 Mallat's A1gorithm을 이용한 웨이블릿 변환(wavelet transform)을 적용하였고, 분류 방법으로는 그 결과로 나온 웨이블릿 계수를 변수로 하여 Neural Network을 학습하여 사용하였다. 향후 피실험자의 훈련을 통해서 청각 자극이 없이 순수한 생각만으로 방향을 검출할 수 있는 뇌파분석기를 만든다면 생각만으로도 물체의 방향을 제어할 수 있을 것이다.
PDF

A Generalized Method for Extracting Characters and Video Captions (일반화된 문자 및 비디오 자막 영역 추출 방법)

Chun, Byung-Tae;Bae, Young-Lae;Kim, Tai-Yun
- Journal of KIISE:Software and Applications
- /
- v.27 no.6
- /
- pp.632-641
- /
- 2000
Conventional character extraction methods extract character regions using methods such as color reduction, region split and merge and texture analysis from the whole image. Because these methods use many heuristic variables and thresholding values derived from a priori knowledge, it is difficult to generalize them algorithmically. In this paper, we propose a method that can extract character regions using a topographical feature extraction method and a point-line-region extension method. The proposed method can also solve the problems of conventional methods by reducing heuristic variables and generalizing thresholding values. We see that character regions can be extracted by generalized variables and thresolding values without using a priori knowledge of character region. Experimental results show that the candidate region extraction rate is 100%, and the character region extraction rate is over 98%.
PDF

Search Result 175, Processing Time 0.039 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)