• Title/Summary/Keyword: projection histogram

Search Result 81, Processing Time 0.02 seconds

Optical Character Recognition for Hindi Language Using a Neural-network Approach

  • Yadav, Divakar;Sanchez-Cuadrado, Sonia;Morato, Jorge
    • Journal of Information Processing Systems
    • /
    • v.9 no.1
    • /
    • pp.117-140
    • /
    • 2013
  • Hindi is the most widely spoken language in India, with more than 300 million speakers. As there is no separation between the characters of texts written in Hindi as there is in English, the Optical Character Recognition (OCR) systems developed for the Hindi language carry a very poor recognition rate. In this paper we propose an OCR for printed Hindi text in Devanagari script, using Artificial Neural Network (ANN), which improves its efficiency. One of the major reasons for the poor recognition rate is error in character segmentation. The presence of touching characters in the scanned documents further complicates the segmentation process, creating a major problem when designing an effective character segmentation technique. Preprocessing, character segmentation, feature extraction, and finally, classification and recognition are the major steps which are followed by a general OCR. The preprocessing tasks considered in the paper are conversion of gray scaled images to binary images, image rectification, and segmentation of the document's textual contents into paragraphs, lines, words, and then at the level of basic symbols. The basic symbols, obtained as the fundamental unit from the segmentation process, are recognized by the neural classifier. In this work, three feature extraction techniques-: histogram of projection based on mean distance, histogram of projection based on pixel value, and vertical zero crossing, have been used to improve the rate of recognition. These feature extraction techniques are powerful enough to extract features of even distorted characters/symbols. For development of the neural classifier, a back-propagation neural network with two hidden layers is used. The classifier is trained and tested for printed Hindi texts. A performance of approximately 90% correct recognition rate is achieved.

Web-based Moving Object Tracking by Controlling Pan-Tilt Camera using Motion Detection (움직임 검출의 캠 제어에 의한 웹기반 이동 객체 추적)

  • 박천주;박희정;이재협;전병민
    • The Journal of the Korea Contents Association
    • /
    • v.2 no.2
    • /
    • pp.17-26
    • /
    • 2002
  • In this paper, we suggest a method to acquire the moving object centered video by panning and tilting a camera automatically according to motion vectors calculated by detecting the motion of a moving object on video steam. We create a difference image by estimating the intensity difference at the grid points of neighboring frames. And we detect the motion using both horizontal projection histogram and vertical projection histogram and decide the center of motion part. Then we calculate a new direction and degree of the motion by comparing coordinates at the center of current motion and the center of previous motion. By controling the RCM using these Motion vectors, we can get video stream positioned unwire object on the center of video frame. Through the experiments, we could get a moving object centered video stream continuously arid monitor remotely by implementing sever/client architecture based on the web.

  • PDF

Skin Pigmentation Detection Using Projection Transformed Block Coefficient (투영 변환 블록 계수를 이용한 피부 색소 침착 검출)

  • Liu, Yang;Lee, Suk-Hwan;Kwon, Seong-Geun;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.9
    • /
    • pp.1044-1056
    • /
    • 2013
  • This paper presents an approach for detecting and measuring human skin pigmentation. In the proposed scheme, we extract a skin area by a GMM-EM clustering based skin color model that is estimated from the statistical analysis of training images and remove tiny noises through the morphology processing. A skin area is decomposed into two components of hemoglobin and melanin by an independent component analysis (ICA) algorithm. Then, we calculate the intensities of hemoglobin and melanin by using the projection transformed block coefficient and determine the existence of skin pigmentation according to the global and local distribution of two intensities. Furthermore, we measure the area and density of the detected skin pigmentation. Experimental results verified that our scheme can both detect the skin pigmentation and measure the quantity of that and also our scheme takes less time because of the location histogram.

Multiple Pedestrians Detection using Motion Information and Support Vector Machine from a Moving Camera Image (이동 카메라 영상에서 움직임 정보와 Support Vector Machine을 이용한 다수 보행자 검출)

  • Lim, Jong-Seok;Park, Hyo-Jin;Kim, Wook-Hyun
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.12 no.4
    • /
    • pp.250-257
    • /
    • 2011
  • In this paper, we proposed the method detecting multiple pedestrians using motion information and SVM(Support Vector Machine) from a moving camera image. First, we detect moving pedestrians from both the difference image and the projection histogram which is compensated for the camera ego-motion using corresponding feature sets. The difference image is simple method but it is not detected motionless pedestrians. Thus, to fix up this problem, we detect motionless pedestrians using SVM The SVM works well particularly in binary classification problem such as pedestrian detection. However, it is not detected in case that the pedestrians are adjacent or they move arms and legs excessively in the image. Therefore, in this paper, we proposed the method detecting motionless and adjacent pedestrians as well as people who take excessive action in the image using motion information and SVM The experimental results on our various test video sequences demonstrated the high efficiency of our approach as it had shown an average detection ratio of 94% and False Positive of 2.8%.

Reduced-Reference Quality Assessment for Compressed Videos Based on the Similarity Measure of Edge Projections (에지 투영의 유사도를 이용한 압축된 영상에 대한 Reduced-Reference 화질 평가)

  • Kim, Dong-O;Park, Rae-Hong;Sim, Dong-Gyu
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.3
    • /
    • pp.37-45
    • /
    • 2008
  • Quality assessment ai s to evaluate if a distorted image or video has a good quality by measuring the difference between the original and distorted images or videos. In this paper, to assess the visual qualify of a distorted image or video, visual features of the distorted image are compared with those of the original image instead of the direct comparison of the distorted image with the original image. We use edge projections from two images as features, where the edge projection can be easily obtained by projecting edge pixels in an edge map along vertical/horizontal direction. In this paper, edge projections are obtained by using vertical/horizontal directions of gradients as well as the magnitude of each gradient. Experimental results show the effectiveness of the proposed quality assessment through the comparison with conventional quality assessment algorithms such as structural similarity(SSIM), edge peak signal-to-noise ratio(EPSNR), and edge histogram descriptor(EHD) methods.

Efficient Video Retrieval Scheme with Luminance Projection Model (휘도투시모델을 적용한 효율적인 비디오 검색기법)

  • Kim, Sang Hyun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.12
    • /
    • pp.8649-8653
    • /
    • 2015
  • A number of video indexing and retrieval algorithms have been proposed to manage large video databases efficiently. The video similarity measure is one of most important technical factor for video content management system. In this paper, we propose the luminance characteristics model to measure the video similarity efficiently. Most algorithms for video indexing have been commonly used histograms, edges, or motion features, whereas in this paper, the proposed algorithm is employed an efficient similarity measure using the luminance projection. To index the video sequences effectively and to reduce the computational complexity, we calculate video similarity using the key frames extracted by the cumulative measure, and compare the set of key frames using the modified Hausdorff distance. Experimental results show that the proposed luminance projection model yields the remarkable improved accuracy and performance than the conventional algorithm such as the histogram comparison method, with the low computational complexity.

A Study For Automobile License Plate Extraction Using DCT and Correlation (DCT와 Correlation을 이용한 자동차번호판 추출에 관한 연구)

  • 경보현;손태주;남궁연;남궁재찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.7A
    • /
    • pp.1050-1056
    • /
    • 2000
  • In this paper, We Propose the automobile license plate extraction method using Discrete Cosin Transform and Correlation fem automobile image obtained through digital camera. The automobile license plate is consisted of the character and rectangle background of it. We extracted the automobile edge image by the DCT processing of automobile image and Obtained the automobile license plate from the automobile edge image by Correlation processing. We separated characters from automobile license plate using the projection histogram. Compare to the previous methods, we obtained the good result from extracting the automobile license plate at night, very strong light and bad weather.

  • PDF

A Recognition of the Printed Alphabet by Using Nonogram Puzzle (노노그램 퍼즐을 이용한 인쇄체 영문자 인식)

  • Sohn, Young-Sun;Kim, Bo-Sung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.4
    • /
    • pp.451-455
    • /
    • 2008
  • In this paper we embody a system that recognizes the printed alphabet of two font types (Batang, Dodum) inputted by a black-and-white CCD camera and converts it into an editable text form. The image of the inputted printed sentences is binarized, then the rows of each sentence are separated through the vertical projection using the Histogram method, and the height of the characters are normalized to 48 pixels. With the reverse application of the basic principle of the Nonogram puzzle to the individual normalized character, the character is covered with the pixel-based squares, representing the characteristics of the character as the numerical information of the Nonogram puzzle in order to recognize the character through the comparison with the standard pattern information. The test of 2609 characters of font type Batang and 1475 characters of font type Dodum yielded a 100% recognition rate.

Particle Filtering based Object Tracking Method using Feedback and Tracking Box Correction (피드백과 박스 보정을 이용한 Particle Filtering 객체추적 방법론)

  • Ahn, Jung-Ho
    • Journal of Satellite, Information and Communications
    • /
    • v.8 no.1
    • /
    • pp.77-82
    • /
    • 2013
  • The object tracking method using particle filtering has been proved successful since it is based on the Monte Carlo simulation to estimate the posterior distribution of the state vector that is nonlinear and non-Gaussian in the real-world situation. In this paper, we present two nobel methods that can improve the performance of the object tracking algorithm based on the particle filtering. First one is the feedback method that replace the low-weighted tracking sample by the estimated state vector in the previous frame. The second one is an tracking box correction method to find an confidence interval of back projection probability on the estimated candidate object area. An sample propagation equation is also presented, which is obtained by experiments. We designed well-organized test data set which reflects various challenging circumstances, and, by using it, experimental results proved that the proposed methods improves the traditional particle filter based object tracking method.

Segmentation and Recognition of Traffic Signs using Shape Information and Edge Image in Real Image (실영상에서 형태 정보와 에지 영상을 이용한 교통 표지판 영역 추출과 인식)

  • Kwak, Hyun-Wook;Oh,Jun-Taek;Kim, Wook-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.11B no.2
    • /
    • pp.149-158
    • /
    • 2004
  • This study proposes a method for segmentation and recognition of traffic signs using shape information and edge image in real image. It first segments traffic sign candidate regions by connected component algorithm from binary images, obtained by utilizing the RGB color ratio of each pixel in the image, and then extracts actual traffic signs based on their symmetries on X- and Y-axes. Histogram equalization is performed for unsegmented candidate regions caused by low contrast in the image. In the recognition stage, it utilizes shape information including projection profiles on X- and Y-axes, moment, and the number of crossings and distance which concentric circular patterns and 8-directional rays from region center intersects with edges of traffic signs. It finally performs recognition by measuring similarity with the templates in the database. It will be shown from several experimental results that the system is robust to environmental factors, such as light and weather condition.