통합 검색 | Korea Science

Estimation of Automatic Video Captioning in Real Applications using Machine Learning Techniques and Convolutional Neural Network

Vaishnavi, J;Narmatha, V
- International Journal of Computer Science & Network Security
- /
- 제22권9호
- /
- pp.316-326
- /
- 2022
The prompt development in the field of video is the outbreak of online services which replaces the television media within a shorter period in gaining popularity. The online videos are encouraged more in use due to the captions displayed along with the scenes for better understandability. Not only entertainment media but other marketing companies and organizations are utilizing videos along with captions for their product promotions. The need for captions is enabled for its usage in many ways for hearing impaired and non-native people. Research is continued in an automatic display of the appropriate messages for the videos uploaded in shows, movies, educational videos, online classes, websites, etc. This paper focuses on two concerns namely the first part dealing with the machine learning method for preprocessing the videos into frames and resizing, the resized frames are classified into multiple actions after feature extraction. For the feature extraction statistical method, GLCM and Hu moments are used. The second part deals with the deep learning method where the CNN architecture is used to acquire the results. Finally both the results are compared to find the best accuracy where CNN proves to give top accuracy of 96.10% in classification.
https://doi.org/10.22937/IJCSNS.2022.22.9.42 인용 PDF KSCI

Statistical Speech Feature Selection for Emotion Recognition

Kwon Oh-Wook;Chan Kwokleung;Lee Te-Won
- The Journal of the Acoustical Society of Korea
- /
- 제24권4E호
- /
- pp.144-151
- /
- 2005
We evaluate the performance of emotion recognition via speech signals when a plain speaker talks to an entertainment robot. For each frame of a speech utterance, we extract the frame-based features: pitch, energy, formant, band energies, mel frequency cepstral coefficients (MFCCs), and velocity/acceleration of pitch and MFCCs. For discriminative classifiers, a fixed-length utterance-based feature vector is computed from the statistics of the frame-based features. Using a speaker-independent database, we evaluate the performance of two promising classifiers: support vector machine (SVM) and hidden Markov model (HMM). For angry/bored/happy/neutral/sad emotion classification, the SVM and HMM classifiers yield $42.3\%\;and\;40.8\%$ accuracy, respectively. We show that the accuracy is significant compared to the performance by foreign human listeners.
PDF KSCI

연속 항공영상에서의 Image Registration (Image Registration of Aerial Image Sequences)

강민석;김준식;박래홍;이쾌희
- 전자공학회논문지B
- /
- 제29B권4호
- /
- pp.48-57
- /
- 1992
This paper addresses the estimation of the shift vector from aerial image sequences. The conventional feature-based and area-based matching methods are simulated for determining the suitable image registration scheme. Computer simulations show that the feature-based matching schemes based on the co-occurrence matrix, autoregressive model, and edge information do not give a reliable matching for aerial image sequences which do not have a suitable statistical model or significant features. In area-based matching methods we try various similarity functions for a matching measure and discuss the factors determining the matching accuracy. To reduce the estimation error of the shift vector we propose the reference window selection scheme. We also discuss the performance of the proposed algorithm based on the simulation results.
PDF

Face Representation and Face Recognition using Optimized Local Ternary Patterns (OLTP)

Raja, G. Madasamy;Sadasivam, V.
- Journal of Electrical Engineering and Technology
- /
- 제12권1호
- /
- pp.402-410
- /
- 2017
For many years, researchers in face description area have been representing and recognizing faces based on different methods that include subspace discriminant analysis, statistical learning and non-statistics based approach etc. But still automatic face recognition remains an interesting but challenging problem. This paper presents a novel and efficient face image representation method based on Optimized Local Ternary Pattern (OLTP) texture features. The face image is divided into several regions from which the OLTP texture feature distributions are extracted and concatenated into a feature vector that can act as face descriptor. The recognition is performed using nearest neighbor classification method with Chi-square distance as a similarity measure. Extensive experimental results on Yale B, ORL and AR face databases show that OLTP consistently performs much better than other well recognized texture models for face recognition.
https://doi.org/10.5370/JEET.2017.12.1.402 인용 PDF KSCI

Korean Document Classification using Characteristics of Word Information

Kim, Seok-Ki;Han, Kyung-Soo;Ahn, Jeong-Yong
- Journal of the Korean Data and Information Science Society
- /
- 제14권2호
- /
- pp.167-175
- /
- 2003
In document classification, target of analysis is not document itself but words appeared in the document. Word information, therefore, is a significant factor in document classification. In this study, we are dealing with the classification of Korean document based on words and feature vectors. First, we present the performance of document classification using nouns and keywords. Second, we compare to the results for the size of feature vectors.
PDF

Fault Detection of Reciprocating Compressor for Small-Type Refrigerators Using ART-Kohonen Networks and Wavelet Analysis

Yang, Bo-Suk;Lee, Soo-Jong;Han, Tian
- Journal of Mechanical Science and Technology
- /
- 제20권12호
- /
- pp.2013-2024
- /
- 2006
This paper proposes a condition classification system using wavelet transform, feature evaluation and artificial neural networks to detect faulty products on the production line of reciprocating compressors for refrigerators. The stationary features of vibration signals are extracted from statistical cumulants of the discrete wavelet coefficients and root mean square values of band-pass frequencies. The neural networks are trained by the sample data, including healthy or faulty compressors. Based on training, the proposed system can be used on the automatic mass production line to classify product quality instead of people inspection. The validity of this system is demonstrated by the on-site test at LG Electronics, Inc. for reciprocating compressors. According to different products, this system after some modification may be useful to increase productivity in different types of production lines.
PDF KSCI

Refinement of Disparity Map using the Rule-based Fusion of Area and Feature-based Matching Results

Um, Gi-Mun;Ahn, Chung-Hyun;Kim, Kyung-Ok;Lee, Kwae-Hi
- 대한원격탐사학회:학술대회논문집
- /
- 대한원격탐사학회 1999년도 Proceedings of International Symposium on Remote Sensing
- /
- pp.304-309
- /
- 1999
In this paper, we presents a new disparity map refinement algorithm using statistical characteristics of disparity map and edge information. The proposed algorithm generate a refined disparity map using disparity maps which are obtained from area and feature-based Stereo Matching by selecting a disparity value of edge point based on the statistics of both disparity maps. Experimental results on aerial stereo image show the better results than conventional fusion algorithms in the disparity error. This algorithm can be applied to the reconstruction of building image from the high resolution remote sensing data.
PDF

사무소용빌딩의 전력소비특성을 고려한 특징파라메터 및 회귀분석을 통한 조명 및 일반동력부하의 수용률 분석 (Recommended Practice for Demand Factor by Feature Parameters and Regression Analysis depending on Power Consumption Characteristics in Office Buildings)

김세동;남기범;정형용;신화영;김성환
- 한국조명전기설비학회:학술대회논문집
- /
- 한국조명전기설비학회 2005년도 춘계학술대회논문집
- /
- pp.331-336
- /
- 2005
It is increased electrical energy consumption with the development of intelligence society in the office buildings and thus an energy conservation through efficient use of electricity became more important. This paper shows a reasonable design demand factor in office buildings, that was made by the systematic and statistical way considering actual conditions, such as investigated electric equipment capacity, peak power consumption, demand factor, etc., for 54 office buildings and 34 electrical design offices. In this dissertation, it is necessary to analyse the key features and general trend from the investigated data. It made an analysis of the feature parameters, such as average, standard deviation, median, maximum, minimun and thus it was carried linear and nonlinear regression analysis.
PDF

사무소용빌딩의 전력소비특성을 고려한 특징파라메터 및 회귀분석을 통한 수용률과 변압기최대이용률 비교 분석 (Recommended Practice for Demand Factor and Maximum Utilization Factor by Feature Parameters and Regression Analysis depending on Power Consumption Characteristics in Office Buildings)

김세동
- 한국조명전기설비학회:학술대회논문집
- /
- 한국조명전기설비학회 2007년도 춘계학술대회 논문집
- /
- pp.241-244
- /
- 2007
It is increased electrical energy consumption with the development of intelligence society in office buildings and thus an energy conservation through efficient use of electricity became more important. This paper shows a reasonable design demand factor in office buildings, that was made by the systematic and statistical way considering actual conditions, such as investigated electric equipment capacity, peak power consumption, demand factor, etc., for 132 office buildings. In this dissertation, it is necessary to analyze the key features and general trend from the investigated data. It made an analysis of the feature parameters, such as average, standard deviation, median, maximum, minimun and thus it was carried linear and nonlinear regression analysis.
PDF

음성의 감성요소 추출을 통한 감성 인식 시스템 (The Emotion Recognition System through The Extraction of Emotional Components from Speech)

박창현;심귀보
- 제어로봇시스템학회논문지
- /
- 제10권9호
- /
- pp.763-770
- /
- 2004
The important issue of emotion recognition from speech is a feature extracting and pattern classification. Features should involve essential information for classifying the emotions. Feature selection is needed to decompose the components of speech and analyze the relation between features and emotions. Specially, a pitch of speech components includes much information for emotion. Accordingly, this paper searches the relation of emotion to features such as the sound loudness, pitch, etc. and classifies the emotions by using the statistic of the collecting data. This paper deals with the method of recognizing emotion from the sound. The most important emotional component of sound is a tone. Also, the inference ability of a brain takes part in the emotion recognition. This paper finds empirically the emotional components from the speech and experiment on the emotion recognition. This paper also proposes the recognition method using these emotional components and the transition probability.
https://doi.org/10.5302/J.ICROS.2004.10.9.763 인용 PDF KSCI

검색결과 664건 처리시간 0.034초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)