• Title/Summary/Keyword: Text line information

Search Result 147, Processing Time 0.028 seconds

Caption Detection and Recognition for Video Image Information Retrieval (비디오 영상 정보 검색을 위한 문자 추출 및 인식)

  • 구건서
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.7
    • /
    • pp.901-914
    • /
    • 2002
  • In this paper, We propose an efficient automatic caption detection and location method, caption recognition using FE-MCBP(Feature Extraction based Multichained BackPropagation) neural network for content based retrieval of video. Frames are selected at fixed time interval from video and key frames are selected by gray scale histogram method. for each key frames, segmentation is performed and caption lines are detected using line scan method. lastly each characters are separated. This research improves speed and efficiency by color segmentation using local maximum analysis method before line scanning. Caption detection is a first stage of multimedia database organization and detected captions are used as input of text recognition system. Recognized captions can be searched by content based retrieval method.

  • PDF

An Efficient Correction Method for Misrecognized Words in Off-line Hangul Character Recognition (오프라인 한글 문자 인식을 위한 효율적인 오인식 단어 교정 방법)

  • Lee, Byeong-Hui;Kim, Tae-Gyun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.6
    • /
    • pp.1598-1606
    • /
    • 1996
  • In order to achieve high accuracy of off-line character recognition(OCR) systems, the recognized text must be processed through a post-processing stage using contextual information. In this paper, we reclassify Korean word classes in terms of OCR word correction. And we collect combinations of Korean particles(approximately 900) linguistic verbal from(around 800). We aggregate 9 Korean irregular verbal phrases defined from a Korean linguistic point of view. Using these Korean word information and a Head-tail method, we can correct misrecognized words. A Korean character recognizer demonstrates 93.7% correct character recognition without a post-processing stage. The entire recognition rate of our system with a post-processing stage exceeds 97% correct character recognition.

  • PDF

The Online Game Coined Profanity Filtering System by using Semi-Global Alignment (반 전역 정렬을 이용한 온라인 게임 변형 욕설 필터링 시스템)

  • Yoon, Tai-Jin;Cho, Hwan-Gue
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.12
    • /
    • pp.113-120
    • /
    • 2009
  • Currently the verbal abuse in text message over on-line game is so serious. However we do not have any effective policy or technical tools yet. Till now in order to cope with this problem, the online game service providers have accumulated a set of forbidden words and applied this list on the textual word used in on-line game, which is called 'Swear filter'. But young on-line game players easily avoid this filtering method by coining another words which is not kept in the list. Especially Korean is very easy to make new variations of a vulgar word. In this paper, we propose one smart filtering algorithm to identify newly coined profanities. Important features of our method include the canonical form transformation of coined profanities, semi-global alignment between in the level of consonant and vowel units. For experiment, we have collected more than 1000 newly coined vulgar words in on-line gaming sites and tested these word against our methods. where our system have successfully filtered more than 90% of those newly coined vulgar words.

Development of Intelligent Learning Tool based on Human eyeball Movement Analysis for Improving Foreign Language Competence (외국어 능력 향상을 위한 사용자 안구운동 분석 기반의 지능형 학습도구 개발)

  • Shin, Jihye;Jang, Young-Min;Kim, Sangwook;Mallipeddi, Rammohan;Bae, Jungok;Choi, Sungmook;Lee, Minho
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.11
    • /
    • pp.153-161
    • /
    • 2013
  • Recently, there has been a tremendous increase in the availability of educational materials for foreign language learning. As part of this trend, there has been an increase in the amount of electronically mediated materials available. However, conventional educational contents developed using computer technology has provided typically one-way information, which is not the most helpful thing for users. Providing the user's convenience requires additional off-line analysis for diagnosing an individual user's learning. To improve the user's comprehension of texts written in a foreign language, we propose an intelligent learning tool based on the analysis of the user's eyeball movements, which is able to diagnose and improve foreign language reading ability by providing necessary supplementary aid just when it is needed. To determine the user's learning state, we correlate their eye movements with findings from research in cognitive psychology and neurophysiology. Based on this, the learning tool can distinguish whether users know or do not know words when they are reading foreign language sentences. If the learning tool judges a word to be unknown, it immediately provides the student with the meaning of the word by extracting it from an on-line dictionary. The proposed model provides a tool which empowers independent learning and makes access to the meanings of unknown words automatic. In this way, it can enhance a user's reading achievement as well as satisfaction with text comprehension in a foreign language.

A study on the type of navigation interface design for information search in e-commerce (이커머스에서 정보 탐색을 위한 네비게이션 인터페이스 디자인 유형 연구)

  • Jung, Da-Young;Kim, Seung-In
    • Journal of Digital Convergence
    • /
    • v.19 no.10
    • /
    • pp.411-418
    • /
    • 2021
  • In this study, information search methods and user interface types provided to users were investigated for the top 100 e-commerce services selected by Statista and the National Retail Federation. And the characteristics of each type were derived by analyzing the interaction method of the user's manipulation with the visualization elements constituting the interface. The research results are as follows. First, as the information provision method, spread format was more often used as the number and hierarchy of information increased, and drop-down and mega menu methods were used more often as the number and hierarchy of information decreased. Second, as a visual classification method according to the information hierarchy, the background color, font change, and line were often used, and there were many cases where the background color and line were used at the same time. Third, there were various elements such as background color, text color, and line as an interaction method for user manipulation, and two or more of them were applied at the same time the most. This study is meaningful in that it defines the characteristics of each type through the analysis of the types of interfaces for e-commerce information search and items that can be the selection criteria for detailed elements.

Analysis of News Agenda Using Text mining and Semantic Network Analysis: Focused on COVID-19 Emotions (텍스트 마이닝과 의미 네트워크 분석을 활용한 뉴스 의제 분석: 코로나 19 관련 감정을 중심으로)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.47-64
    • /
    • 2021
  • The global spread of COVID-19 around the world has not only affected many parts of our daily life but also has a huge impact on many areas, including the economy and society. As the number of confirmed cases and deaths increases, medical staff and the public are said to be experiencing psychological problems such as anxiety, depression, and stress. The collective tragedy that accompanies the epidemic raises fear and anxiety, which is known to cause enormous disruptions to the behavior and psychological well-being of many. Long-term negative emotions can reduce people's immunity and destroy their physical balance, so it is essential to understand the psychological state of COVID-19. This study suggests a method of monitoring medial news reflecting current days which requires striving not only for physical but also for psychological quarantine in the prolonged COVID-19 situation. Moreover, it is presented how an easier method of analyzing social media networks applies to those cases. The aim of this study is to assist health policymakers in fast and complex decision-making processes. News plays a major role in setting the policy agenda. Among various major media, news headlines are considered important in the field of communication science as a summary of the core content that the media wants to convey to the audiences who read it. News data used in this study was easily collected using "Bigkinds" that is created by integrating big data technology. With the collected news data, keywords were classified through text mining, and the relationship between words was visualized through semantic network analysis between keywords. Using the KrKwic program, a Korean semantic network analysis tool, text mining was performed and the frequency of words was calculated to easily identify keywords. The frequency of words appearing in keywords of articles related to COVID-19 emotions was checked and visualized in word cloud 'China', 'anxiety', 'situation', 'mind', 'social', and 'health' appeared high in relation to the emotions of COVID-19. In addition, UCINET, a specialized social network analysis program, was used to analyze connection centrality and cluster analysis, and a method of visualizing a graph using Net Draw was performed. As a result of analyzing the connection centrality between each data, it was found that the most central keywords in the keyword-centric network were 'psychology', 'COVID-19', 'blue', and 'anxiety'. The network of frequency of co-occurrence among the keywords appearing in the headlines of the news was visualized as a graph. The thickness of the line on the graph is proportional to the frequency of co-occurrence, and if the frequency of two words appearing at the same time is high, it is indicated by a thick line. It can be seen that the 'COVID-blue' pair is displayed in the boldest, and the 'COVID-emotion' and 'COVID-anxiety' pairs are displayed with a relatively thick line. 'Blue' related to COVID-19 is a word that means depression, and it was confirmed that COVID-19 and depression are keywords that should be of interest now. The research methodology used in this study has the convenience of being able to quickly measure social phenomena and changes while reducing costs. In this study, by analyzing news headlines, we were able to identify people's feelings and perceptions on issues related to COVID-19 depression, and identify the main agendas to be analyzed by deriving important keywords. By presenting and visualizing the subject and important keywords related to the COVID-19 emotion at a time, medical policy managers will be able to be provided a variety of perspectives when identifying and researching the regarding phenomenon. It is expected that it can help to use it as basic data for support, treatment and service development for psychological quarantine issues related to COVID-19.

Implementation of JBIG2 CODEC using Segmentation for Effective Compression (효율적인 압축을 위한 영역 세그먼트를 이용한 JBIG2 CODEC 구현)

  • 백옥규;고형화
    • Proceedings of the IEEK Conference
    • /
    • 2001.09a
    • /
    • pp.37-40
    • /
    • 2001
  • JBIG2 표준은 그레이 문서를 고압축의 이진 영상으로 부호화 하기위하여 선 영역(region of line-art), 하프톤 영역(region of Halftone), 텍스트 영역(region of Text)으로 세그먼트하여 각각 영역에 최적화 모드를 사용하여 부호화한다. 본 논문에서는 JBIG2에서 제공하는 세가지 모드의 코딩, 즉, 제네릭 영역(region of Generic) 코딩, 텍스트 영역을 위한 패턴 매칭(Pattern Matching) 코딩, 하프톤 영역을 위한 하프톤 코딩을 모두 구현하였다. 그리고, 각 영역을 세그먼트하는 방법을 개선하여 적용하여 세그먼트의 성능 향상을 이루었다. 특히, 부호화량이 많은 하프톤 영역의 세그먼트를 향상시켜 최적화 모드로 부호화 하도록 구현하였다. 팩스 테스트 영상(IEEE-l67a)으로 구현한 JBIC2 CODEC을 실험한 결과, 각 영역에 대한 세그먼트가 [6]의 방법에 의한 세그먼트보다 더 효율적으로 이루어졌으며 주관적 화질 또한 우수하였다.

  • PDF

Hansel and English Text Font Recognition Using Geometrical Pattern Vector (기하학적 패턴 벡터를 이용한 한.영 글꼴 문자인식)

  • 석영수;홍창희;조정락;강기섭;민종규;이응주
    • Proceedings of the IEEK Conference
    • /
    • 2001.09a
    • /
    • pp.425-428
    • /
    • 2001
  • 본 논문에서는 문서 위의 문자를 Off-Line방식으로 컴퓨터에 저장할 수 있도록 기하학적 패턴 벡터를 이용하여 한·영문자 및 글꼴을 인식하는 알고리즘을 제안하였다. 일반적으로 문서에서는 여러 가지 글꼴에 따라 글자의 형태가 다르므로 대표적인 한·영 세 가지 글꼴을 기하학적 패턴(Geometrical Pattern Vector)을 이용하여 크기와 이동에 인식하도록 하였다. 이진 입력 한영혼용 영상에서 잡음을 제거하고 수평·수직 투영 기법을 이용하여 한 문자를 분할하여 문자의 폭에 따라 기하학적 패턴을 추출한다. 추출한 패턴은 각 합계를 계산하여 기준 패턴 합계와 비교한 후 기준 패턴 문자와 글꼴을 인식하게 된다. 마지막으로 제안한 알고리즘의 성능을 평가하기 위해 크기, 이동 변형이 있는 대표적인 한·영 글꼴(신명조, 궁서, 고딕)체와 영어 Time New Roman체를 대상으로 모의 실험을 수행하였다. 제안한 알고리즘은 기존의 원형 패턴 알고리즘보다 문자인식률과 글꼴 그리고 영어의 대·소문자를 구별하는 우수함을 보였다.

  • PDF

Recognition and classification of dimension set for automatic input of mechanical drawings (기계 도면의 자동 입력을 위한 치수 집합의 인식 및 분류)

  • 정윤수;박길흠
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.34S no.11
    • /
    • pp.114-125
    • /
    • 1997
  • This paper presents a method that automatically recognizes dimension sets from the mechanical drawings, and that classifies 6 types dimension sets according to functional purpose. In the proposed method, the object and closed-loop symbols are separated from the character-free drawings. Then object lines and interpretation lines are vectorized. And, after recognizing dimension sets(consistings of arrowhead, shape line, tail lines, extension lines, text-string, and feature control frame), we classify recognized dimension sets as horizontal, vertical, angular, diametral, radial, and leader dimension sets. Finally the proposed method converts classified dimension sets into AutoCAD data by using AutoLisp language. By using the methods of geometric modeling, the proposed method readily recognized and classifies dimension sets from complex drawings. Experimetnal results are presented, which are obtained by applying the proposed method to drawings drawn in compliance with the KS drafting standard.

  • PDF

Application of Electronic Retailing in Apparel (의류를 중심으로 한 전자상거래의 활용 실태에 관한 연구)

  • Won, Myung-Sim
    • Korean Journal of Human Ecology
    • /
    • v.8 no.3
    • /
    • pp.511-524
    • /
    • 1999
  • This research examines 13 Korean Web sites and 15 foreign Web Sites to explore how companies present apparel products by both layout of graphics and information at the Web sites. The results show that most Web sites display tiny icons next to the item's text description. Clicking on these icons takes the customers to another web page, where the full size photograph of the item appears. The results also revealed that most web sites offer shopping bag function and payment options such as on-line and credit cards. The results indicate that Web sites are constantly evolving and following functions such as virtual dressing room, FAQ, the links, E-Cash payment, currency converter and multilingual sites are becoming standards in the near future.

  • PDF