Search | Korea Science

A Novel Text Sample Selection Model for Scene Text Detection via Bootstrap Learning

Kong, Jun;Sun, Jinhua;Jiang, Min;Hou, Jian
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.2
- /
- pp.771-789
- /
- 2019
Text detection has been a popular research topic in the field of computer vision. It is difficult for prevalent text detection algorithms to avoid the dependence on datasets. To overcome this problem, we proposed a novel unsupervised text detection algorithm inspired by bootstrap learning. Firstly, the text candidate in a novel form of superpixel is proposed to improve the text recall rate by image segmentation. Secondly, we propose a unique text sample selection model (TSSM) to extract text samples from the current image and eliminate database dependency. Specifically, to improve the precision of samples, we combine maximally stable extremal regions (MSERs) and the saliency map to generate sample reference maps with a double threshold scheme. Finally, a multiple kernel boosting method is developed to generate a strong text classifier by combining multiple single kernel SVMs based on the samples selected from TSSM. Experimental results on standard datasets demonstrate that our text detection method is robust to complex backgrounds and multilingual text and shows stable performance on different standard datasets.
https://doi.org/10.3837/tiis.2019.02.016 인용 PDF KSCI HTML

The FE-SM/SONN for Recognition of the Car Skid Mark (자동차 스키드마크 인식을 위한 FE-SM/SONN)

Koo, Gun-Seo
- Journal of the Korea Society of Computer and Information
- /
- v.17 no.1
- /
- pp.125-132
- /
- 2012
In this paper, We proposes FE-SM/SONN for recognizing blurred and smeared skid mark image caused by sudden braking of a vehicle. In a blurred and smeared skid marks, tread pattern image is ambiguous. To improve recognition of such image, FE-SM/SONN reads skid marks utilizing Fuzzy Logic and distinguishing tread pattern SONN(Self Organization Neural Networks) recognizer. In order to substantiate this finding, 48 tire models and 144 skid marks were compared and overall recognition ratio was 89%. This study showed 13.51% improved recognition compared to existing back propagation recognizer, and 8.78% improvement than FE-MCBP. The expected effect of this research is achieving recognition of ambiguous images by extracting distinguishing features, and the finding concludes that even when tread pattern image is in grey scale, Fuzzy Logic enables the tread pattern recognizable.
https://doi.org/10.9708/jksci.2012.17.1.125 인용 PDF KSCI

A Method to Enhance the Recognition Rate of Marker Images in Augmented Reality (증강현실 마커 이미지의 인식률 개선 방안)

Park, Chan;Lee, Wan-Bok
- Journal of Convergence for Information Technology
- /
- v.12 no.1
- /
- pp.1-6
- /
- 2022
As augmented reality technology becomes more common and prevelant, marker-based AR contents are applied in various ways. However AR contents are still hardly utilized due to the low recognition rate of marker images. In order to increase the recognition rate of AR marker images, this paper experiment and analyzed how much the recognition rate of markers could be improved when image correction and design changes was applied. The experimental result shows that the image correction task could significantly improve the number of image characteristics and the recognition grade if the image was modified in a way its saturation value is increased. Moreover, the recognition rate was improved even more when regular pattern design was added to the original marker image. In conclusion, it was possible to make the marker well recognized through proper correction of the image and additional process of pattern design in the process of producing the marker image.
https://doi.org/10.22156/CS4SMB.2022.12.01.001 인용 PDF KSCI

Hand Shape Recognition with Disparity Pattern of Multiple Model Images (복수 모델영상의 상위도 패턴을 이용한 손형상 인식)

이칠우
- Journal of Korea Multimedia Society
- /
- v.2 no.4
- /
- pp.400-408
- /
- 1999
This paper describes a method for making the "disparity pattern" which is basis of image matching with brightness difference; called disparity, between multiple model images, and an algorithm which recognizes hand shape by utilizing the pattern in measuring the distance between a input image and model images. The virtue of the algorithm is that only simple brightness difference calculated from multiple images by managing a whole image as the fundamental processing unit is patterned in two dimensional shape and then is used in the recognition process. Consequently, this method is very useful for other recognition algorithm requiring comparison of large scale image since correlation among multiple model images is applied simultaneously in recognition process.
PDF

Robust Facial Expression Recognition Based on Local Directional Pattern

Jabid, Taskeed;Kabir, Md. Hasanul;Chae, Oksam
- ETRI Journal
- /
- v.32 no.5
- /
- pp.784-794
- /
- 2010
Automatic facial expression recognition has many potential applications in different areas of human computer interaction. However, they are not yet fully realized due to the lack of an effective facial feature descriptor. In this paper, we present a new appearance-based feature descriptor, the local directional pattern (LDP), to represent facial geometry and analyze its performance in expression recognition. An LDP feature is obtained by computing the edge response values in 8 directions at each pixel and encoding them into an 8 bit binary number using the relative strength of these edge responses. The LDP descriptor, a distribution of LDP codes within an image or image patch, is used to describe each expression image. The effectiveness of dimensionality reduction techniques, such as principal component analysis and AdaBoost, is also analyzed in terms of computational cost saving and classification accuracy. Two well-known machine learning methods, template matching and support vector machine, are used for classification using the Cohn-Kanade and Japanese female facial expression databases. Better classification accuracy shows the superiority of LDP descriptor against other appearance-based feature descriptors.
https://doi.org/10.4218/etrij.10.1510.0132 인용 PDF KSCI

An Improved LBP-based Facial Expression Recognition through Optimization of Block Weights (블록가중치의 최적화를 통해 개선된 LBP기반의 표정인식)

Park, Seong-Chun;Koo, Ja-Young
- Journal of the Korea Society of Computer and Information
- /
- v.14 no.11
- /
- pp.73-79
- /
- 2009
In this paper, a method is proposed that enhances the performance of the facial expression recognition using template matching of Local Binary Pattern(LBP) histogram. In this method, the face image is segmented into blocks, and the LBP histogram is constructed to be used as the feature of the block. Block dissimilarity is calculated between a block of input image and the corresponding block of the model image. Image dissimilarity is defined as the weighted sum of the block dissimilarities. In conventional methods, the block weights are assigned by intuition. In this paper a new method is proposed that optimizes the weights from training samples. An experiment shows the recognition rate is enhanced by the proposed method.
https://doi.org/10.9708/jksci.2009.14.11.073 인용 PDF

Fast Face Gender Recognition by Using Local Ternary Pattern and Extreme Learning Machine

Yang, Jucheng;Jiao, Yanbin;Xiong, Naixue;Park, DongSun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.7 no.7
- /
- pp.1705-1720
- /
- 2013
Human face gender recognition requires fast image processing with high accuracy. Existing face gender recognition methods used traditional local features and machine learning methods have shortcomings of low accuracy or slow speed. In this paper, a new framework for face gender recognition to reach fast face gender recognition is proposed, which is based on Local Ternary Pattern (LTP) and Extreme Learning Machine (ELM). LTP is a generalization of Local Binary Pattern (LBP) that is in the presence of monotonic illumination variations on a face image, and has high discriminative power for texture classification. It is also more discriminate and less sensitive to noise in uniform regions. On the other hand, ELM is a new learning algorithm for generalizing single hidden layer feed forward networks without tuning parameters. The main advantages of ELM are the less stringent optimization constraints, faster operations, easy implementation, and usually improved generalization performance. The experimental results on public databases show that, in comparisons with existing algorithms, the proposed method has higher precision and better generalization performance at extremely fast learning speed.
https://doi.org/10.3837/tiis.2013.07.011 인용 PDF KSCI

Development of Non-Contacting Automatic Inspection Technology of Precise Parts (정밀부품의 비접촉 자동검사기술 개발)

Lee, Woo-Sung;Han, Sung-Hyun
- Transactions of the Korean Society of Machine Tool Engineers
- /
- v.16 no.6
- /
- pp.110-116
- /
- 2007
This paper presents a new technique to implement the real-time recognition for shapes and model number of parts based on an active vision approach. The main focus of this paper is to apply a technique of 3D object recognition for non-contacting inspection of the shape and the external form state of precision parts based on the pattern recognition. In the field of computer vision, there have been many kinds of object recognition approaches. And most of these approaches focus on a method of recognition using a given input image (passive vision). It is, however, hard to recognize an object from model objects that have similar aspects each other. Recently, it has been perceived that an active vision is one of hopeful approaches to realize a robust object recognition system. The performance is illustrated by experiment for several parts and models.
PDF KSCI

The Research of Shape Recognition Algorithm for Image Processing of Cucumber Harvest Robot (오이수확로봇의 영상처리를 위한 형상인식 알고리즘에 관한 연구)

Min, Byeong-Ro;Lim, Ki-Taek;Lee, Dae-Weon
- Journal of Bio-Environment Control
- /
- v.20 no.2
- /
- pp.63-71
- /
- 2011
Pattern recognition of a cucumber were conducted to detect directly the binary images by using thresholding method, which have the threshold level at the optimum intensity value. By restricting conditions of learning pattern, output patterns could be extracted from the same and similar input patterns by the algorithm. The algorithm of pattern recognition was developed to determine the position of the cucumber from a real image within working condition. The algorithm, designed and developed for this project, learned two, three or four learning pattern, and each learning pattern applied it to twenty sample patterns. The restored success rate of output pattern to sample pattern form two, three or four learning pattern was 65.0%, 45.0%, 12.5% respectively. The more number of learning pattern had, the more number of different out pattern detected when it was conversed. Detection of feature pattern of cucumber was processed by using auto scanning with real image of 30 by 30 pixel. The computing times required to execute the processing time of cucumber recognition took 0.5 to 1 second. Also, five real images tested, false pattern to the learning pattern is found that it has an elimination rate which is range from 96 to 98%. Some output patterns was recognized as a cucumber by the algorithm with the conditions. the rate of false recognition was range from 0.1 to 4.2%.
PDF KSCI

A Study on Iris Image Restoration Based on Focus Value of Iris Image (홍채 영상 초점 값에 기반한 홍채 영상 복원 연구)

Kang Byung-Jun;Park Kang-Ryoung
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.43 no.2 s.308
- /
- pp.30-39
- /
- 2006
Iris recognition is that identifies a user based on the unique iris texture patterns which has the functionalities of dilating or contracting pupil region. Iris recognition systems extract the iris pattern in iris image captured by iris recognition camera. Therefore performance of iris recognition is affected by the quality of iris image which includes iris pattern. If iris image is blurred, iris pattern is transformed. It causes FRR(False Rejection Error) to be increased. Optical defocusing is the main factor to make blurred iris images. In conventional iris recognition camera, they use two kinds of focusing methods such as lilted and auto-focusing method. In case of fixed focusing method, the users should repeatedly align their eyes in DOF(Depth of Field), while the iris recognition system acquires good focused is image. Therefore it can give much inconvenience to the users. In case of auto-focusing method, the iris recognition camera moves focus lens with auto-focusing algorithm for capturing the best focused image. However, that needs additional H/W equipment such as distance measuring sensor between users and camera lens, and motor to move focus lens. Therefore the size and cost of iris recognition camera are increased and this kind of camera cannot be used for small sized mobile device. To overcome those problems, we propose method to increase DOF by iris image restoration algorithm based on focus value of iris image. When we tested our proposed algorithm with BM-ET100 made by Panasonic, we could increase operation range from 48-53cm to 46-56cm.
PDF KSCI

Search Result 615, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)