통합 검색 | Korea Science

Size-Independent Caption Extraction for Korean Captions with Edge Connected Components

Jung, Je-Hee;Kim, Jaekwang;Lee, Jee-Hyong
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- 제12권4호
- /
- pp.308-318
- /
- 2012
Captions include information which relates to the images. In order to obtain the information in the captions, text extraction methods from images have been developed. However, most existing methods can be applied to captions with a fixed height or stroke width using fixed pixel-size or block-size operators which are derived from morphological supposition. We propose an edge connected components based method that can extract Korean captions that are composed of various sizes and fonts. We analyze the properties of edge connected components embedding captions and build a decision tree which discriminates edge connected components which include captions from ones which do not. The images for the experiment are collected from broadcast programs such as documentaries and news programs which include captions with various heights and fonts. We evaluate our proposed method by comparing the performance of the latent caption area extraction. The experiment shows that the proposed method can efficiently extract various sizes of Korean captions.
https://doi.org/10.5391/IJFIS.2012.12.4.308 인용 PDF KSCI

에지 및 국부적 최소/최대 변환을 이용한 자연 이미지로부터 텍스트 영역 검출 (Text Region Detection using Edge and Regional Minima/Maxima Transformation from Natural Scene Images)

박종천;이근왕
- 한국산학기술학회논문지
- /
- 제10권2호
- /
- pp.358-363
- /
- 2009
자연이미지로부터 텍스트 영역 검출은 다양한 응용분야에 활용됨으로 이 분야의 많은 연구가 필요하다. 최근의 연구 방법은 에지 및 연결요소 기반 방법을 결합하는 다양한 알고리즘을 이용하여 텍스트 영역을 검출하고 있다. 그러므로 본 논문은 이러한 결합방법으로 에지 및 국부적 최소/최대 변환 방법을 이용하여 텍스트 영역을 검출하는 알고리즘을 제안한다. 명도 이미지로부터 에지 및 국부적 최소/최대 연결성분을 검출하고, 에지 및 국부적 최소/최대 연결성분을 레이블화한다. 레이블된 영역을 분석하여 텍스트 후보 영역을 검출하고, 검출된 각각의 텍스트 후보 영역을 결합하여 단일 텍스트 후보 이미지를 생성한다. 텍스트 후보 개별문자의 인접성 및 유사도를 비교하여 검증함으로서 최종적인 텍스트 영역을 검출한다. 실험결과 제안한 알고리즘은 에지 요소 및 국부적 최소/최대 연결요소 검출 방법을 결합하여 자연 이미지로부터 텍스트 영역 검출의 정확도 및 재현률을 향상할 수 있었다.
https://doi.org/10.5762/KAIS.2009.10.2.358 인용 PDF

에지 및 형태학적 재구성에 의한 연결요소를 이용한 자연영상의 문자영역 검출 (Character Region Detection in Natural Image Using Edge and Connected Component by Morphological Reconstruction)

권교현;박종천;전병민
- 한국엔터테인먼트산업학회논문지
- /
- 제5권1호
- /
- pp.127-133
- /
- 2011
자연영상에 내포되어 있는 문자는 다양한 내용을 표현하는 중요한 정보이다. 기존의 문자 검출 알고리즘은 영상의 복잡도와 주변의 조명, 문자와 유사한 배경색 등의 환경에서 문자영역을 검출하지 못하는 문제점이 있으므로 본 논문에서는 에지 및 형태학적 재구성에 의한 연결요소를 이용한 자연영상에 포함된 문자영역을 검출하는 방법을 제안한다. 첫 번째 단계로, 명암도 영상에서 캐니에지(Canny-Edge) 검출기를 이용한 에지 성분과 형태학적 연산에 의한 지역적 최소/최대값을 갖는 연결요소를 검출하고, 각각 검출된 연결성분을 레이블링하고, 레이블링 된 각 성분에 대해 문자가 갖는 특징을 이용한 후보 문자영역을 검출한다. 마지막으로 검출된 후보 문자 영역을 서로 합병하여 하나의 후보 문자 영역을 생성하고, 후보 문자 영역의 인접성과 유사성으로 후보 문자 영역을 검증하여 최종 문자 영역을 검출한다. 실험결과 제안한 에지 및 연결요소 성분을 이용한 방법은 문자영역 검출의 정확성이 개선되었다.

Correction of Signboard Distortion by Vertical Stroke Estimation

Lim, Jun Sik;Na, In Seop;Kim, Soo Hyung
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제7권9호
- /
- pp.2312-2325
- /
- 2013
In this paper, we propose a preprocessing method that it is to correct the distortion of text area in Korean signboard images as a preprocessing step to improve character recognition. Distorted perspective in recognizing of Korean signboard text may cause of the low recognition rate. The proposed method consists of four main steps and eight sub-steps: main step consists of potential vertical components detection, vertical components detection, text-boundary estimation and distortion correction. First, potential vertical line components detection consists of four steps, including edge detection for each connected component, pixel distance normalization in the edge, dominant-point detection in the edge and removal of horizontal components. Second, vertical line components detection is composed of removal of diagonal components and extraction of vertical line components. Third, the outline estimation step is composed of the left and right boundary line detection. Finally, distortion of the text image is corrected by bilinear transformation based on the estimated outline. We compared the changes in recognition rates of OCR before and after applying the proposed algorithm. The recognition rate of the distortion corrected signboard images is 29.63% and 21.9% higher at the character and the text unit than those of the original images.
https://doi.org/10.3837/tiis.2013.09.014 인용 PDF KSCI

구성요소가 서로 종속인 네트워크시스템의 신뢰성모형과 계산알고리즘 (Reliability Modeling and Computational Algorithm of Network Systems with Dependent Components)

홍정식;이창훈
- 한국경영과학회지
- /
- 제14권1호
- /
- pp.88-96
- /
- 1989
General measure in the reliability is the k-terminal reliability, which is the probability that the specified vertices are connected by the working edges. To compute the k-terminal reliability components are usually assumed to be statistically independent. In this study the modeling and analysis of the k-terminal reliability are investigated when dependency among components is considered. As the size of the network increases, the number of the joint probability parameter to represent the dependency among components is increasing exponentially. To avoid such a difficulty the structured-event-based-reliability model (SERM) is presented. This model uses the combination of the network topology (physical representation) and reliability block diagram (logical representation). This enables us to represent the dependency among components in a network form. Computational algorithms for the k-terminal reliability in SERM are based on the factoring algorithm Two features of the ractoring algorithm are the reliability preserving reduction and the privoting edge selection strategy. The pivoting edge selction strategy is modified by two different ways to tackle the replicated edges occuring in SERM. Two algorithms are presented according to each modified pivoting strategy and illustrated by numerical example.
PDF

An Efficient Color Edge Detection Using the Mahalanobis Distance

Khongkraphan, Kittiya
- Journal of Information Processing Systems
- /
- 제10권4호
- /
- pp.589-601
- /
- 2014
The performance of edge detection often relies on its ability to correctly determine the dissimilarities of connected pixels. For grayscale images, the dissimilarity of two pixels is estimated by a scalar difference of their intensities and for color images, this is done by using the vector difference (color distance) of the three-color components. The Euclidean distance in the RGB color space typically measures a color distance. However, the RGB space is not suitable for edge detection since its color components do not coincide with the information human perception uses to separate objects from backgrounds. In this paper, we propose a novel method for color edge detection by taking advantage of the HSV color space and the Mahalanobis distance. The HSV space models colors in a manner similar to human perception. The Mahalanobis distance independently considers the hue, saturation, and lightness and gives them different degrees of contribution for the measurement of color distances. Therefore, our method is robust against the change of lightness as compared to previous approaches. Furthermore, we will introduce a noise-resistant technique for determining image gradients. Various experiments on simulated and real-world images show that our approach outperforms several existing methods, especially when the images vary in lightness or are corrupted by noise.
https://doi.org/10.3745/JIPS.02.0010 인용 PDF KSCI

신경망을 이용한 자막 크기에 무관한 연결 객체 기반의 자막 추출 (Connected Component-Based and Size-Independent Caption Extraction with Neural Networks)

정제희;윤태복;김동문;이지형
- 한국지능시스템학회논문지
- /
- 제17권7호
- /
- pp.924-929
- /
- 2007
영상에 나타나는 자막은 영상과 관계가 있는 정보를 포함한다. 이러한 영상과 관련 있는 정보를 이용하기 위해 영상으로부터 자막을 추출하는 연구는 근래에 들어 활발히 진행되고 있다. 기존의 연구는 일정한 높이의 자막이나 획의 두께를 지닌 자막에서만 정상적인 작동을 한다. 본 논문에서는 일정 크기 이상의 자막에 대해서 적용할 수 있는 크기에 무관한 자막 추출 방법을 제안한다. 먼저, 자막 연결 객체의 패턴 추출을 위해서 자막이 포함된 영상을 수집하고, 신경망을 이용해서 자막의 패턴을 분석한다. 그 후로는 사전에 추출한 패턴을 이용하여 입력 영상에서 자막을 추출한다. 실험에 사용된 영상은 뉴스, 다큐멘터리, 쇼 프로그램과 같은 대중 방송에서 수집하였다. 실험 결과는 다양한 크기의 자막을 포함한 영상을 사용하여 실험하였고, 자막 추출의 결과는 찾아진 연결객체 중에 자막의 비율과 자막 중에 찾아진 자막의 비율로 분석하였다. 실험 결과를 보면 제안한 방법에 의해 다양한 크기의 자막을 추출할 수 있음을 보여준다.
https://doi.org/10.5391/JKIIS.2007.17.7.924 인용 PDF KSCI

자연 영상에서 획 너비 추정 기반 텍스트 영역 이진화 (The Binarization of Text Regions in Natural Scene Images, based on Stroke Width Estimation)

;김정환;이귀상
- 스마트미디어저널
- /
- 제1권4호
- /
- pp.27-34
- /
- 2012
In this paper, a novel text binarization is presented that can deal with some complex conditions, such as shadows, non-uniform illumination due to highlight or object projection, and messy backgrounds. To locate the target text region, a focus line is assumed to pass through a text region. Next, connected component analysis and stroke width estimation based on location information of the focus line is used to locate the bounding box of the text region, and each box of connected components. A series of classifications are applied to identify whether each CC(Connected component) is text or non-text. Also, a modified K-means clustering method based on an HCL color space is applied to reduce the color dimension. A text binarization procedure based on location of text component and seed color pixel is then used to generate the final result.
PDF

에지 및 컬러 양자화를 이용한 모바일 폰 카메라 기반장면 텍스트 검출 (Mobile Phone Camera Based Scene Text Detection Using Edge and Color Quantization)

박종천;이근왕
- 한국산학기술학회논문지
- /
- 제11권3호
- /
- pp.847-852
- /
- 2010
자연 영상 내에 포함된 텍스트는 영상의 다양하고 중요한 특징을 갖는다. 그러므로 텍스트를 검출하고 추출하여 인식하는 것이 중요한 연구대상으로 연구되고 있다. 최근 모바일 폰 카메라를 기반으로 다양한 분야에서 많은 응용 기술이 연구 개발되고 있다. 본 논문은 에지 및 연결요소를 이용한 장면 텍스트 검출 방법을 제안한다. 그레이스케일 영상으로부터 에지 성분 검출과 지역적 표준편차를 이용하여 텍스트 영역의 경계선을 검출하고, RGB 컬러공간의 유클리디안 거리를 기준으로 연결요소를 검출한다. 검출된 에지 및 연결요소를 레이블링하고 각각 영역의 외곽사각형을 구한다. 텍스트의 휴리스틱 이용하여 후보 텍스트를 추출한다. 후보 텍스트 영역을 병합하여 하나의 후보 텍스트 영역을 생성하고, 후보 텍스트의 지역적 인접성과 구조적 유사성으로 후보 텍스트를 검증함으로서 최종적인 텍스트 영역을 검출하였다. 실험결과 에지 및 컬러 연결요소 특징을 상호 보완함으로서 텍스트 영역의 검출률을 향상시켰다.
https://doi.org/10.5762/KAIS.2010.11.3.847 인용 PDF KSCI

신경회로망 제어기와 동적 베이시안 네트워크를 이용한 시변 및 비정치 확률시스템의 제어 (Control of Time-varying and Nonstationary Stochastic Systems using a Neural Network Controller and Dynamic Bayesian Network Modeling)

조현철;이진우;이영진;이권순
- 한국지능시스템학회논문지
- /
- 제17권7호
- /
- pp.930-938
- /
- 2007
영상에 나타나는 자막은 영상과 관계가 있는 정보를 포함한다. 이러한 영상과 관련 있는 정보를 이용하기 위해 영상으로부터 자막을 추출하는 연구는 근래에 들어 활발히 진행되고 있다. 기존의 연구는 일정한 높이의 자막이나 획의 두께를 지닌 자막에서만 정상적인 작동을 한다. 본 논문에서는 일정 크기 이상의 자막에 대해서 적용할 수 있는 크기에 무관한 자막 추출 방법을 제안한다. 먼저, 자막 연결 객체의 패턴 추출을 위해서 자막이 포함된 영상을 수집하고, 신경망을 이용해서 자막의 패턴을 분석한다. 그 후로는 사전에 추출한 패턴을 이용하여 입력 영상에서 자막을 추출한다. 실험에 사용된 영상은 뉴스, 다큐멘터리, 쇼 프로그램과 같은 대중 방송에서 수집하였다. 실험 결과는 다양한 크기의 자막을 포함한 영상을 사용하여 실험하였고, 자막 추출의 결과는 찾아진 연결객체 중에 자막의 비율과 자막 중에 찾아진 자막의 비율로 분석하였다. 실험 결과를 보면 제안한 방법에 의해 다양한 크기의 자막을 추출할 수 있음을 보여준다.
https://doi.org/10.5391/JKIIS.2007.17.7.930 인용 PDF KSCI

검색결과 37건 처리시간 0.01초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)