Search | Korea Science

An Extended Generative Feature Learning Algorithm for Image Recognition

Wang, Bin;Li, Chuanjiang;Zhang, Qian;Huang, Jifeng
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.11 no.8
- /
- pp.3984-4005
- /
- 2017
Image recognition has become an increasingly important topic for its wide application. It is highly challenging when facing to large-scale database with large variance. The recognition systems rely on a key component, i.e. the low-level feature or the learned mid-level feature. The recognition performance can be potentially improved if the data distribution information is exploited using a more sophisticated way, which usually a function over hidden variable, model parameter and observed data. These methods are called generative score space. In this paper, we propose a discriminative extension for the existing generative score space methods, which exploits class label when deriving score functions for image recognition task. Specifically, we first extend the regular generative models to class conditional models over both observed variable and class label. Then, we derive the mid-level feature mapping from the extended models. At last, the derived feature mapping is embedded into a discriminative classifier for image recognition. The advantages of our proposed approach are two folds. First, the resulted methods take simple and intuitive forms which are weighted versions of existing methods, benefitting from the Bayesian inference of class label. Second, the probabilistic generative modeling allows us to exploit hidden information and is well adapt to data distribution. To validate the effectiveness of the proposed method, we cooperate our discriminative extension with three generative models for image recognition task. The experimental results validate the effectiveness of our proposed approach.
https://doi.org/10.3837/tiis.2017.08.013 인용 PDF KSCI

Development of Recognition and Reaction Time Prediction Model in Road Signs using Negative Binomial Regression (음이항회귀식을 이용한 도로표지의 인지반응시간 추정모형 개발)

Park, Hyung-Jin;Lee, Ki-Young;Kim, Jung-Young
- Journal of the Ergonomics Society of Korea
- /
- v.25 no.4
- /
- pp.23-33
- /
- 2006
The purpose of this study is to determine the economical standard of road signs by verifying the difference of driver's recognition and reaction time according to the space rate of letters on the road signs. For this reason, indoor simulations was conducted to confirm difference of recognition and reaction time on six sign-targets having different space rate. Also, a negative binomial regression model was used to find the main factors which could lower the rate of misreading. For this model, increasing of legibility of sign is not only simple enlargement of sign, but also suitable match of letters and sign. The result of this study is capable of verifying the importance of the space rate in road signs, and being utilized as a effective method to determine the standard of the road signs.
https://doi.org/10.5143/JESK.2006.25.4.023 인용 PDF KSCI

A Study on the Facility Utilization and the Residents¡？Cognition of Public Open Spaces in Apartment Housing (아파트 옥외공유공간의 이용실태에 관한 조사연구)

최상호;석호태
- Journal of the Korean housing association
- /
- v.13 no.3
- /
- pp.93-101
- /
- 2002
The goal of this survey is to propose planning and design informations for the public open spaces in apartment housing, through the observation and analysis of the current situations. For this, the planning information of housing suppliers about public open spaces and the spatial utilization of users were compared and by analyzing facility utilization and resident\`s recognition. This study is also intended to guide the future directions of the research for the improvement of public open spaces. The research follows three phases; \circled1 To understand the conditions of public open spaces in apartment housing sites through survey and analysis of catalogues and references. \circled2 To study on facility utilization and resident's recognition by observation and analysis. \circled3 To propose planning guidelines for the improvement of public open space by recognition differences of facilities.
PDF KSCI

Information Processing in Primate Retinal Ganglion

Je, Sung-Kwan;Cho, Jae-Hyun;Kim, Gwang-Baek
- Journal of information and communication convergence engineering
- /
- v.2 no.2
- /
- pp.132-137
- /
- 2004
Most of the current computer vision theories are based on hypotheses that are difficult to apply to the real world, and they simply imitate a coarse form of the human visual system. As a result, they have not been showing satisfying results. In the human visual system, there is a mechanism that processes information due to memory degradation with time and limited storage space. Starting from research on the human visual system, this study analyzes a mechanism that processes input information when information is transferred from the retina to ganglion cells. In this study, a model for the characteristics of ganglion cells in the retina is proposed after considering the structure of the retina and the efficiency of storage space. The MNIST database of handwritten letters is used as data for this research, and ART2 and SOM as recognizers. The results of this study show that the proposed recognition model is not much different from the general recognition model in terms of recognition rate, but the efficiency of storage space can be improved by constructing a mechanism that processes input information.
PDF KSCI

Discriminative Training of Stochastic Segment Model Based on HMM Segmentation for Continuous Speech Recognition

Chung, Yong-Joo;Un, Chong-Kwan
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.4E
- /
- pp.21-27
- /
- 1996
In this paper, we propose a discriminative training algorithm for the stochastic segment model (SSM) in continuous speech recognition. As the SSM is usually trained by maximum likelihood estimation (MLE), a discriminative training algorithm is required to improve the recognition performance. Since the SSM does not assume the conditional independence of observation sequence as is done in hidden Markov models (HMMs), the search space for decoding an unknown input utterance is increased considerably. To reduce the computational complexity and starch space amount in an iterative training algorithm for discriminative SSMs, a hybrid architecture of SSMs and HMMs is programming using HMMs. Given the segment boundaries, the parameters of the SSM are discriminatively trained by the minimum error classification criterion based on a generalized probabilistic descent (GPD) method. With the discriminative training of the SSM, the word error rate is reduced by 17% compared with the MLE-trained SSM in speaker-independent continuous speech recognition.
PDF

A Novel Multiple Kernel Sparse Representation based Classification for Face Recognition

Zheng, Hao;Ye, Qiaolin;Jin, Zhong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.4
- /
- pp.1463-1480
- /
- 2014
It is well known that sparse code is effective for feature extraction of face recognition, especially sparse mode can be learned in the kernel space, and obtain better performance. Some recent algorithms made use of single kernel in the sparse mode, but this didn't make full use of the kernel information. The key issue is how to select the suitable kernel weights, and combine the selected kernels. In this paper, we propose a novel multiple kernel sparse representation based classification for face recognition (MKSRC), which performs sparse code and dictionary learning in the multiple kernel space. Initially, several possible kernels are combined and the sparse coefficient is computed, then the kernel weights can be obtained by the sparse coefficient. Finally convergence makes the kernel weights optimal. The experiments results show that our algorithm outperforms other state-of-the-art algorithms and demonstrate the promising performance of the proposed algorithms.
https://doi.org/10.3837/tiis.2014.04.017 인용 PDF KSCI KPUBS HTML

Multiple Human Recognition for Networked Camera based Interactive Control in IoT Space

Jin, Taeseok
- Journal of the Korean Society of Industry Convergence
- /
- v.22 no.1
- /
- pp.39-45
- /
- 2019
We propose an active color model based method for tracking motions of multiple human using a networked multiple-camera system in IoT space as a human-robot coexistent system. An IoT space is a space where many intelligent devices, such as computers and sensors(color CCD cameras for example), are distributed. Human beings can be a part of IoT space as well. One of the main goals of IoT space is to assist humans and to do different services for them. In order to be capable of doing that, IoT space must be able to do different human related tasks. One of them is to identify and track multiple objects seamlessly. In the environment where many camera modules are distributed on network, it is important to identify object in order to track it, because different cameras may be needed as object moves throughout the space and IoT space should determine the appropriate one. This paper describes appearance based unknown object tracking with the distributed vision system in IoT space. First, we discuss how object color information is obtained and how the color appearance based model is constructed from this data. Then, we discuss the global color model based on the local color information. The process of learning within global model and the experimental results are also presented.
https://doi.org/10.21289/KSIC.2019.22.1.039 인용 PDF KSCI HTML

Color Pattern Recognition and Tracking for Multi-Object Tracking in Artificial Intelligence Space (인공지능 공간상의 다중객체 구분을 위한 컬러 패턴 인식과 추적)

Tae-Seok Jin
- Journal of the Korean Society of Industry Convergence
- /
- v.27 no.2_2
- /
- pp.319-324
- /
- 2024
In this paper, the Artificial Intelligence Space(AI-Space) for human-robot interface is presented, which can enable human-computer interfacing, networked camera conferencing, industrial monitoring, service and training applications. We present a method for representing, tracking, and objects(human, robot, chair) following by fusing distributed multiple vision systems in AI-Space. The article presents the integration of color distributions into particle filtering. Particle filters provide a robust tracking framework under ambiguous conditions. We propose to track the moving objects(human, robot, chair) by generating hypotheses not in the image plane but on the top-view reconstruction of the scene.
https://doi.org/10.21289/KSIC.2024.27.2.319 인용 PDF HTML

Korean Digit Recognition Under Noise Environment Using Spectral Mapping Training (스펙트럼사상학습을 이용한 잡음환경에서의 한국어숫자음인식)

Lee, Ki-Young
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.3
- /
- pp.25-32
- /
- 1994
This paper presents the Korean digit recognition method under noise environment using the spectral mapping training based on static supervised adaptation algorithm. In the presented recognition method, as a result of spectral mapping from one space of noisy speech spectrum to another space of speech spectrum without noise, spectral distortion of noisy speech is improved, and the recognition rate is higher than that of the conventional method using VQ (vector quatization) and DTW(dynamic time warping) without noise processing, and even when SNR level is 0dB, the recognition rate is 10 times of that using the conventional method. It has been confirmed that the spectral mapping training has an ability to improve the recognition performance for speech in noise environment.
PDF

Gesture Recognition by Analyzing a Trajetory on Spatio-Temporal Space (시공간상의 궤적 분석에 의한 제스쳐 인식)

민병우;윤호섭;소정;에지마 도시야끼
- Journal of KIISE:Software and Applications
- /
- v.26 no.1
- /
- pp.157-157
- /
- 1999
Researches on the gesture recognition have become a very interesting topic in the computer vision area, Gesture recognition from visual images has a number of potential applicationssuch as HCI (Human Computer Interaction), VR(Virtual Reality), machine vision. To overcome thetechnical barriers in visual processing, conventional approaches have employed cumbersome devicessuch as datagloves or color marked gloves. In this research, we capture gesture images without usingexternal devices and generate a gesture trajectery composed of point-tokens. The trajectory Is spottedusing phase-based velocity constraints and recognized using the discrete left-right HMM. Inputvectors to the HMM are obtained by using the LBG clustering algorithm on a polar-coordinate spacewhere point-tokens on the Cartesian space .are converted. A gesture vocabulary is composed oftwenty-two dynamic hand gestures for editing drawing elements. In our experiment, one hundred dataper gesture are collected from twenty persons, Fifty data are used for training and another fifty datafor recognition experiment. The recognition result shows about 95% recognition rate and also thepossibility that these results can be applied to several potential systems operated by gestures. Thedeveloped system is running in real time for editing basic graphic primitives in the hardwareenvironments of a Pentium-pro (200 MHz), a Matrox Meteor graphic board and a CCD camera, anda Window95 and Visual C++ software environment.

Search Result 1,170, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)