• Title/Summary/Keyword: Text-to-Image

Search Result 889, Processing Time 0.166 seconds

Augmenting Text Document by Controlling Its IR-Reflectance (적외선 반사 특성 제어를 통한 텍스트 문서 증강)

  • Park, Hanhoon;Moon, Kwang-Seok
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.6
    • /
    • pp.882-892
    • /
    • 2017
  • Locally Likely Arrangement Hashing (LLAH) is a method that describes image features based on the geometry between their neighbors. Thus, it has been preferred to implement augmented reality on poorly-textured objects such as text documents. However, LLAH strongly requires that image features be detected with high repeatability and located at a distance from one another. To fulfill the requirement for text document, this paper proposes a method that facilitates the word detection in infrared (IR) range by adjusting the IR-reflectance of words. Specifically, the words are printed out with two different black inks: one is using the K(carbon black) ink only, the other is mixing the C(cyan), M(magenta), Y(yellow) inks. Since only the words printed out with the K ink is visible in IR range, a part of words are selected in advance to be used as features and printed out the K ink. The selected words can be robustly detected with high repeatability in IR range and this enables to implement augmented reality on text documents with high fidelity. The validity of the proposed method was verified through experiments.

Research and Development of Document Recognition System for Utilizing Image Data (이미지데이터 활용을 위한 문서인식시스템 연구 및 개발)

  • Kwag, Hee-Kue
    • The KIPS Transactions:PartB
    • /
    • v.17B no.2
    • /
    • pp.125-138
    • /
    • 2010
  • The purpose of this research is to enhance document recognition system which is essential for developing full-text retrieval system of the document image data stored in the digital library of a public institution. To achieve this purpose, the main tasks of this research are: 1) analyzing the document image data and then developing its image preprocessing technology and document structure analysis one, 2) building its specialized knowledge base consisting of document layout and property, character model and word dictionary, respectively. In addition, developing the management tool of this knowledge base, the document recognition system is able to handle the various types of the document image data. Currently, we developed the prototype system of document recognition which is combined with the specialized knowledge base and the library of document structure analysis, respectively, adapted for the document image data housed in National Archives of Korea. With the results of this research, we plan to build up the test-bed and estimate the performance of document recognition system to maximize the utilization of full-text retrieval system.

Text Verification Based on Sub-Image Matching (부분 영상 매칭에 기반한 텍스트 검증)

  • Son Hwa Jeong;Jeong Seon Hwa;Kim Soo Hyung
    • The KIPS Transactions:PartB
    • /
    • v.12B no.2 s.98
    • /
    • pp.115-122
    • /
    • 2005
  • The sub-mage matching problem in which one image contains some part of the other image, has been mostly investigated on natural images. In this paper, we propose two sub-image matching techniques: mesh-based method and correlation-based method, that are efficiently used to match text images. Mesh-based method consists of two stages, box alignment and similarity measurement by extracting the mesh feature from the two images. Correlation-based method determines the similarity using the correlation of the two images based on FFT function. We have applied the two methods to the text verification in a postal automation system and observed that the accuracy of correlation-based method is $92.7\%$ while that of mesh-based method is $90.1\%$.

Web Image Caption Extraction using Positional Relation and Lexical Similarity (위치적 연관성과 어휘적 유사성을 이용한 웹 이미지 캡션 추출)

  • Lee, Hyoung-Gyu;Kim, Min-Jeong;Hong, Gum-Won;Rim, Hae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.4
    • /
    • pp.335-345
    • /
    • 2009
  • In this paper, we propose a new web image caption extraction method considering the positional relation between a caption and an image and the lexical similarity between a caption and the main text containing the caption. The positional relation between a caption and an image represents how the caption is located with respect to the distance and the direction of the corresponding image. The lexical similarity between a caption and the main text indicates how likely the main text generates the caption of the image. Compared with previous image caption extraction approaches which only utilize the independent features of image and captions, the proposed approach can improve caption extraction recall rate, precision rate and 28% F-measure by including additional features of positional relation and lexical similarity.

Interactive Typography System using Combined Corner and Contour Detection

  • Lim, Sooyeon;Kim, Sangwook
    • International Journal of Contents
    • /
    • v.13 no.1
    • /
    • pp.68-75
    • /
    • 2017
  • Interactive Typography is a process where a user communicates by interacting with text and a moving factor. This research covers interactive typography using real-time response to a user's gesture. In order to form a language-independent system, preprocessing of entered text data presents image data. This preprocessing is followed by recognizing the image data and the setting interaction points. This is done using computer vision technology such as the Harris corner detector and contour detection. User interaction is achieved using skeleton information tracked by a depth camera. By synchronizing the user's skeleton information acquired by Kinect (a depth camera,) and the typography components (interaction points), all user gestures are linked with the typography in real time. An experiment was conducted, in both English and Korean, where users showed an 81% satisfaction level using an interactive typography system where text components showed discrete movements in accordance with the users' gestures. Through this experiment, it was possible to ascertain that sensibility varied depending on the size and the speed of the text and interactive alteration. The results show that interactive typography can potentially be an accurate communication tool, and not merely a uniform text transmission system.

A Study on the Semiotic Application about the Image Vestmental (의상 이미지의 응용 기호론적 연구(I)-엘자 스키아파렐리의 3가지 의상 이미지에 관하여-)

  • 최인순
    • Journal of the Korean Society of Costume
    • /
    • v.38
    • /
    • pp.101-122
    • /
    • 1998
  • The purpose of this study is to define the fundamentals of one symbolic concept, so calles vestment-sign, based on the logical relationship of sign system about the trichotomy by charles S. Peice's sign concept for the communication system of meaning in the non-linguistic image domain. To prove the argument of vestment-sign, I selected 3 type of vestment language by styliste, Elsa Schiaparel-li. The third image vestmental chosen here, titled“Larme-Illusion(1938)”,printed by Salvad-or Dali will produce one symbolic proposition as a logical result which is generated and developed through the interpretation of other images. First of all the text, which is manifested by Elsa Schiaparelli's first image vestmental, tit-led“Notation Musical(1937)”and is symbolized as one category in the representation of the form, is regarded symbolic and metaphorical from a standpoint that the title and the meaning is connected to the form. The second image vestment, titled“Ruches Noirs(1938)”represents externally splendid feminity man-ifested by the symbolic and metaphorical expression. And the purity of sensitivity aiming to humanity in the detail of the poetic feeling of naturalism makes us imagine the battle fild of furious sensitivity. Like as the result of the battle, the third image stimulated our eyesight with the“absence”of dressing function. The proposition of the text,《Death》which the third image delivers, constructs sign system to bring up a meaning with the disappearance of physical“signifier”. This establishment of the symbolic concept presents the etymological authority of symbol generation called“Design”.

  • PDF

A Display Method of Image Information and URL Using the Message Structures of Emergency Alert Broadcasts for 5G Cellular Communications (5G 이동통신 용 재난경보 방송의 메시지 구조를 이용한 이미지 정보 및 URL 표출기법)

  • Chang, Sekchin
    • Journal of Broadcast Engineering
    • /
    • v.26 no.5
    • /
    • pp.592-598
    • /
    • 2021
  • Current cellular systems rely on a CBS protocol for emergency alert broadcast services. However, the CBS protocol just specifies the delivery of a limited text message. Therefore, foreigners, who are unfamiliar with local characters, may have some difficulties in understanding the received CBS text message. The CBS protocol also reveals a distinct restriction in delivering abundant information because of a limited number of text characters. In order to overcome the weak points of the current CBS protocol, we propose a display method of image information and URL on the screens of mobile terminals for the received CBS text message in this paper. The presented approach effectively utilizes the message structure of CBS for 5G cellular systems.

Character Recognition Algorithm in Low-Quality Legacy Contents Based on Alternative End-to-End Learning (대안적 통째학습 기반 저품질 레거시 콘텐츠에서의 문자 인식 알고리즘)

  • Lee, Sung-Jin;Yun, Jun-Seok;Park, Seon-hoo;Yoo, Seok Bong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.11
    • /
    • pp.1486-1494
    • /
    • 2021
  • Character recognition is a technology required in various platforms, such as smart parking and text to speech, and many studies are being conducted to improve its performance through new attempts. However, with low-quality image used for character recognition, a difference in resolution of the training image and test image for character recognition occurs, resulting in poor accuracy. To solve this problem, this paper designed an end-to-end learning neural network that combines image super-resolution and character recognition so that the character recognition model performance is robust against various quality data, and implemented an alternative whole learning algorithm to learn the whole neural network. An alternative end-to-end learning and recognition performance test was conducted using the license plate image among various text images, and the effectiveness of the proposed algorithm was verified with the performance test.

Deep-Learning Approach for Text Detection Using Fully Convolutional Networks

  • Tung, Trieu Son;Lee, Gueesang
    • International Journal of Contents
    • /
    • v.14 no.1
    • /
    • pp.1-6
    • /
    • 2018
  • Text, as one of the most influential inventions of humanity, has played an important role in human life since ancient times. The rich and precise information embodied in text is very useful in a wide range of vision-based applications such as the text data extracted from images that can provide information for automatic annotation, indexing, language translation, and the assistance systems for impaired persons. Therefore, natural-scene text detection with active research topics regarding computer vision and document analysis is very important. Previous methods have poor performances due to numerous false-positive and true-negative regions. In this paper, a fully-convolutional-network (FCN)-based method that uses supervised architecture is used to localize textual regions. The model was trained directly using images wherein pixel values were used as inputs and binary ground truth was used as label. The method was evaluated using ICDAR-2013 dataset and proved to be comparable to other feature-based methods. It could expedite research on text detection using deep-learning based approach in the future.

On the Study of Textual Classics and Artistic Creation - Taking Buddhist Art Dunhuang Grottoes as an Example

  • Liu Tingting
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.3
    • /
    • pp.205-210
    • /
    • 2023
  • Stone cave paintings are continuous interactions as independent mediums in places such as text, images and stone cave architecture. Unlike Buddha statues, the narrative of the text always fascinates and guides the viewer to the timeliness of the image, that is, the narrative. In particular, in Buddhist art, Buddha statues are never simple images, and murals are never simple paintings. Before the Tang Dynasty, most unknown artists were artisans, and many artists still worked on murals in temples and palaces, and independent paintings such as scrolls and sides became an important form of painting after the Tang Dynasty, changing the mechanism of painting creation. In this paper, the graphic creation process prioritizes dedication and service, but we can still feel the creativity of the painters strongly. The historical resources of how to paint these paintings, the clues to the copies, and the precursor to the foreground, encourage the painters to constantly try to resemble each other and discover problems...Therefore, in this paper, it was confirmed that reinvention and creativity are very important, and that Dunhuang Buddhist art is the basis for artists' creation and the source of vitality.