Search | Korea Science

A text region extraction algorithm based on Android for real-time text recognition (실시간 글자 인식을 위한 안드로이드 기반의 글자 영역 추출 기술)

Lee, Gyu-Cheol;Lee, Sangyong;Yoo, Jisang
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.11a
- /
- pp.194-196
- /
- 2016
본 논문에서는 안드로이드 환경에서 글자 인식을 위한 전처리 과정으로 입력 영상에서 글자 영역만을 추출하는 기법을 제안한다. 대부분의 글자 인식 어플리케이션에서 글자를 인식하는 방법은 RoI(Region of Interest)에 인식하려는 글자를 위치시켜 놓고 사용자가 촬영함으로써 진행된다. 하지만 촬영된 영상 그대로를 인식에 사용하기 때문에 잡음 및 글자가 아닌 영역들을 글자로 인식하는 문제 등으로 인하여 인식률이 현저히 떨어진다. 제안하는 기법에서는 MSER(Maximally Stable Extremal Regions) 기법을 통해 각각의 글자를 추출한 후, 글자의 특성을 이용하여 글자 영역만을 추출한다. 기법의 성능 평가는 무료 OCR(Optical Character Recognition) 엔진인 Tesseract-OCR을 통해 글자 인식률을 비교하였으며, 제안하는 기법을 적용한 글자 인식 시스템이 적용하지 않은 시스템보다 글자의 인식률이 향상되는 것을 확인하였다.
PDF

Extraction of open-caption from video (비디오 자막 추출 기법에 관한 연구)

김성섭;문영식
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.04b
- /
- pp.481-483
- /
- 2001
본 논문에서는 동영상으로부터 색상, 서체, 크기와 같은 사전 지식 없이도 글자/자막을 효율적으로 추출하는 방법을 제안한다. 해상도가 낮고 복잡한 배경을 포함할 수 있는 비디오에서 글자 인식률 향상을 위해 먼저 동일한 텍스트 영역의 존재하는 프레임들을 자동적으로 추출한 후 이들의 시간적 평균영상을 만들어 향상된 영상을 얻는다. 평균영상의 외각선 영상의 투영 값을 통해 문자영역을 찾고 각 텍스트 영역에 대해 1차 배경제거 과정인 region filling을 적용하여 글자의 배경들을 제거 함으로써 글자를 추출한다. 1차 배경제거의 결과를 검증하고 추가적으로 k-means를 이용한 color clustering을 적용하여 남아있는 배경들을 효율적으로 제거 함으로써 최종 글자영상을 추출한다.
PDF

도메인 어댑테이션을 이용한 폰트 변화에 강인한 한글 분류기 개발

Park, Jaewoo;Lee, Eunji;Cho, Nam Ik
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2019.11a
- /
- pp.50-53
- /
- 2019
본 논문에서는 도메인 어댑테이션을 이용하여 폰트 변화에 강인한 한글 분류기를 학습하는 방법을 제안한다. 제안하는 네트워크 모델은 총 7 개로 이루어져 있으며 각각 이미지로부터 폰트에 무관한 정보를 추출하는 인코더, 추출된 정보의 유효성을 판단하기 위해 이미지 재합성에 사용되는 디코더, 재합성된 이미지의 글자 분류기, 폰트 분류기, 재합성된 글자의 정교함을 판단하는 판별기(discriminator), 그리고 인코더에서 추출된 정보에 대한 글자 분류기, 폰트 분류기이다. 본 논문에서는 적대적 생성 신경망의 학습법을 따르는 도메인 어댑테이션 기법을 이용하여 인코더의 추출 정보가 폰트 정보는 속이면서 글자 분류의 정확성은 높이도록 학습하였다. 학습 결과 인코더로부터 추출되는 정보들은 폰트에 무관한 성질을 지니면서 글자 분류에 높은 정확성을 띄었으며, 추가로 디코더에서 나오는 이미지들도 원본 폰트와 같은 이미지를 생성해 낼 수 있었다.
PDF

An Efficient Method to Extract Units of Manchu Characters (만주 글자의 단위를 추출하는 효율적인 방법)

Snowberger, Aaron Daniel;Lee, Choong Ho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.617-619
- /
- 2021
Since Manchu characters are written vertically and are connected without spaces within a word, a preprocessing process is required to separate the character area and the units that make up the characters before recognizing the characters. In this paper, we describe a preprocessing method that extracts the character area and cuts off the unit of the character. Unlike existing research that presupposes a method of recognizing each word or character unit, or recognizing the remaining part after removing the stem of a continuous character, this method cuts the character into each recognizable unit. It can be applied to the method of recognizing letters by combining the units. Through an experiment, the effectiveness of this method was verified.
PDF

Extraction of Automobile License Plates and Letter Using Color Information and Red Value Change in Line-by-Line (색상정보와 행별 Red값 변화량을 이용한 자동차 번호판과 글자 추출)

Yu, SongHyun;Lee, Dokyung;Jeong, Jechang
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2014.11a
- /
- pp.138-141
- /
- 2014
본 논문에서는 색상 정보를 이용하여 배경 영역이 포함된 자동차의 전,후면 사진에서의 자동차 번호판 영역(녹색, 흰색) 추출과 추출된 번호판에서 글자를 분리해내는 방법을 제안한다. 기존의 색상 정보를 이용하여 번호판을 추출하는 방법은 흰색 번호판(신형 번호판)의 경우에는 배경 영역에서 흰색인 영역도 많고 국내 차량 중에 흰색 차량이 많기 때문에 번호판 영역과 배경 영역 사이의 명확한 구분에 어려움이 있었다. 따라서 행별 Red값 변화도를 조사하여 배경 영역과 번호판 영역 사이의 명확한 구분을 하게 하며, 흰색 번호판의 경우에 추출이 안되면 흰색의 기준을 더 낮추어서 다시 영역 추출을 할 수 있는 재추출 알고리즘을 추가해서 비교적 어두운 사진에서도 번호판영역을 추출할 수 있도록 한다. 추출된 번호판에서 글자를 추출해내는 과정에서도 이진화를 거치면 노이즈가 많이 생기기 때문에 이를 줄이고자 행별 Red값 변화도를 조사하여 번호판 영역에서 위아래 부분의 노이즈를 줄일 수 있도록 하였다.
PDF

Korean Caption Extraction with Decision Tree (의사결정 트리를 이용한 한글 자막 추출)

Jung, Je-Hee;Lee, Seun-Hoon;Kim, Jae-Kwang;Lee, Jee-Hyong
- Proceedings of the Korean Information Science Society Conference
- /
- 2008.06c
- /
- pp.527-532
- /
- 2008
자막은 영상과 관련이 있는 정보를 포함한다. 이러한 영상의 정보를 이용하기 위해서 자막을 추출하는 연구가 진행되고 있다. 기존의 자막 추출 연구는 언어 독립적인 특징으로 자막을 이루는 획의 에지는 일정한 간격을 유지하거나 수평라인으로 존재하는 글자의 분포를 이용한 방법을 제안하였다. 이러한 방법들은 획의 간격이 일정한 자막이나 하나의 글자가 하나의 획으로 이루어진 글자에서만 정상적인 동작을 보장하였다. 본 논문에서는 한글 자막 특징을 고려한 자막 추출 방법을 제안한다. 먼저, 한글 자막의 특징인 가로 획의 다수 분포를 고려한 적응형 에지 이진화를 수행하여 에지 영상을 생성하고 에지 연결 객체를 생성한다. 그 후에 생성한 연결 객체를 특징을 추출하여 사전에 생성한 의사결정 트리로 연결 객체를 자막과 비자막 연결객체로 분류한다. 의사결정 트리를 생성하기 위해서 사용한 연결 객체는 뉴스, 다큐멘터리 프로그램에서 획득하였으며, 성능 평가를 위해서 뉴스, 다큐멘터리, 스포츠 프로그램과 같은 대중 방송에서 획득한 영상에서 자막을 추출하였다. 평가 방법은 찾아진 연결 객체 중에 자막 연결 객체의 비율과 전체 자막 중에서 찾아진 자막 연결 객체의 비율로 분석하였다. 실험 결과에서는 제안한 방법이 한글 자막의 추출에 적용 가능함을 보여준다.
PDF

Psychological Disturbance caused by Letters in Double lmage and its lmplication on Perceptual Integration (글자의 이중상에 의한 심리적 혼란감과 지각 통합 과정)

Park, Sang-Ho;Chung, Chan-Sup
- Korean Journal of Cognitive Science
- /
- v.6 no.1
- /
- pp.47-71
- /
- 1995
A psychological disturbance caused by letters in double image was termed as'letter sickness'.The effects of stimulus exposure time and letter familiarity on the letter sickness were measured to test a hypothesis that disturbances in eye movement and recognition stages is the cause of letter sickness.Letter sickness incteased significantly as stimulus exposure time lengthened from 50ms,100ms,to 3000ms.It was also significantly higher with familiar Hangul letters as compared with less familiar foreign letters and meaningless words as compared with meaningful words,respectively.These experimental findings imply that letter sickness is caused by the failure of adjusting eye movements to dismiss the double images.that the more familiar the letters.the more strongly the letter-identification process is commotted,resulting in the increased effect of disturbance from double image.and that the disturbance effect of double image is amplified when it is hard to extract the meaning from familiar letters.An experiment where subjects were made to learn Braille-like symbols consisting of 6 dots to test the hypothesis that the stronger the tendency to process the meaning of a symbol,the stronger becomes letter sickness due to double image,also showed that letter sickness significantly increased as a function of learning.
PDF

An Extracting Text Area Using Adaptive Edge Enhanced MSER in Real World Image (실세계 영상에서 적응적 에지 강화 기반의 MSER을 이용한 글자 영역 추출 기법)

Park, Youngmok;Park, Sunhwa;Seo, Yeong Geon
- Journal of Digital Contents Society
- /
- v.17 no.4
- /
- pp.219-226
- /
- 2016
In our general life, what we recognize information with our human eyes and use it is diverse and massive. But even the current technologies improved by artificial intelligence are exorbitantly deficient comparing to human visual processing ability. Nevertheless, many researchers are trying to get information in everyday life, especially concentrate effort on recognizing information consisted of text. In the fields of recognizing text, to extract the text from the general document is used in some information processing fields, but to extract and recognize the text from real image is deficient too much yet. It is because the real images have many properties like color, size, orientation and something in common. In this paper, we applies an adaptive edge enhanced MSER(Maximally Stable Extremal Regions) to extract the text area in those diverse environments and the scene text, and show that the proposed method is a comparatively nice method with experiments.
https://doi.org/10.9728/dcs.2016.17.4.219 인용 PDF KSCI

Text Region Extraction from Videos using the Harris Corner Detector (해리스 코너 검출기를 이용한 비디오 자막 영역 추출)

Kim, Won-Jun;Kim, Chang-Ick
- Journal of KIISE:Software and Applications
- /
- v.34 no.7
- /
- pp.646-654
- /
- 2007
In recent years, the use of text inserted into TV contents has grown to provide viewers with better visual understanding. In this paper, video text is defined as superimposed text region located of the bottom of video. Video text extraction is the first step for video information retrieval and video indexing. Most of video text detection and extraction methods in the previous work are based on text color, contrast between text and background, edge, character filter, and so on. However, the video text extraction has big problems due to low resolution of video and complex background. To solve these problems, we propose a method to extract text from videos using the Harris corner detector. The proposed algorithm consists of four steps: corer map generation using the Harris corner detector, extraction of text candidates considering density of comers, text region determination using labeling, and post-processing. The proposed algorithm is language independent and can be applied to texts with various colors. Text region update between frames is also exploited to reduce the processing time. Experiments are performed on diverse videos to confirm the efficiency of the proposed method.
PDF KSCI

Custom Handwriting Font Creation Service (사용자 필적 맞춤형 폰트 생성 서비스)

Kim, Ye-Jin;Lee, Soo-Yeon;Sim, Kyu-Min;Jun, Kyung-Koo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.10a
- /
- pp.946-949
- /
- 2019
한 벌의 한글 글자체를 만드는데 일반적으로 많은 제작 비용과 시간이 소요된다. 따라서 폰트 제작의 어려움을 덜기 위해, 사용자가 대표 글자들을 입력하면 그 글자들의 디자인 특성을 딥러닝 기술을 이용하여 학습한 모델이 나머지 글자들을 자동 생성해주는 시스템 구축한다면 폰트 제작이 훨씬 용이해질 뿐만 아니라 저작권 문제로부터 자유로워질 것이다. 이와 관련된 선행연구를 실행하고 분석해 본 결과 데이터 전처리 과정에서 글자가 잘리거나 크기가 맞지 않아 제대로 된 데이터셋이 구축되지 않는 문제가 있음을 발견하였다. 본 논문에서는 이러한 문제를 해결하기 위해 템플릿에서 자동적으로 글자영역을 추출하고 이미지를 보정하는 전처리 과정과 함께 기존 모델에서 새로운 필터를 추가하여 학습 성능을 높이는 방법을 제안한다. 이를 통해 기존 연구에서 측정된 손실값을 낮춘 결과를 확인했으며 결과적으로 실제 글자체와 더욱 유사한 사용자 맞춤형 글자체를 제공할 수 있을 것이다.
https://doi.org/10.3745/PKIPS.y2019m10a.946 인용 PDF

Search Result 88, Processing Time 0.035 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)