• Title/Summary/Keyword: text image

Search Result 982, Processing Time 0.032 seconds

Detecting Rectangular Image Regions in a Window Image for 3D Conversion (3D 변환을 위한 윈도우영상에서 사각 이미지 영역 검출)

  • Gil, Jong In;Lee, Jun Seok;Kim, Manbae
    • Journal of Broadcast Engineering
    • /
    • v.18 no.6
    • /
    • pp.795-807
    • /
    • 2013
  • In recent years, 2D-to-3D conversion techniques have gained much attraction. Most of conventional methods focused on natural images such as movie, animation and so forth. However, it is difficult to apply these techniques to window images mixed with text, image, logo, and icon. Also, different depth values of text pixels will cause distortion and a proper 3D image can not be delivered in some situations. To solve this problem, we propose a method to classify a given image into either a window or a natural image. For the window image, only rectangular image regions (RIR) are detected and converted in 3D. Other text and background are displayed in 2D. The proposed method was performed on more than 10,000 test images. In the experimental results, the detection ratio of window image reaches 97% and RIR detection ratio is 87%.

A Study about Inter-Textuality in Modern Hair Style - Focused on Collections - (현대 헤어스타일에 표현된 텍스트의 다원화 현상에 관한 연구 - 컬렉션을 중심으로 -)

  • Kim, Sung-Ah;Yoo, Tae-Soon
    • Fashion & Textile Research Journal
    • /
    • v.11 no.6
    • /
    • pp.934-941
    • /
    • 2009
  • The purpose of this study is to examine by which correlation the pluralistic phenomenon in text is functioned in comparison with hair style and fashion in collection. As a result, the pluralistic image in text, which was shown in modern fashion, was indicated to be pluralistic phenomenon by gender, T.P.O, coordination, and material. The pluralistic image in text for hair style can be known to have been indicated to be the pluralistic phenomenon in text for gender and to be the pluralistic phenomenon in text according to material and cultural category. As for a method of this study, it did put limitation on the part that is shown in the fashion collection from 2001 to 2007, analyzed hair-style features centering on photos, which were extracted from style.com, the online site of specializing in fashion, and carried out a literature research side by side with the theoretical background on intertextuality. The analysis in work according to the pluralistic phenomenon in text made it possible for looking at with a new sight differently from the recognition in the past, and opened the potentiality for being able to understand lots of strange representations, which have been impossible so far. The process of imitating and reconstructing each text according to compositional principle led to possibly knowing the necessity of an artist's ability that can implement the originative world.

Adversarial Shade Generation and Training Text Recognition Algorithm that is Robust to Text in Brightness (밝기 변화에 강인한 적대적 음영 생성 및 훈련 글자 인식 알고리즘)

  • Seo, Minseok;Kim, Daehan;Choi, Dong-Geol
    • The Journal of Korea Robotics Society
    • /
    • v.16 no.3
    • /
    • pp.276-282
    • /
    • 2021
  • The system for recognizing text in natural scenes has been applied in various industries. However, due to the change in brightness that occurs in nature such as light reflection and shadow, the text recognition performance significantly decreases. To solve this problem, we propose an adversarial shadow generation and training algorithm that is robust to shadow changes. The adversarial shadow generation and training algorithm divides the entire image into a total of 9 grids, and adjusts the brightness with 4 trainable parameters for each grid. Finally, training is conducted in a adversarial relationship between the text recognition model and the shaded image generator. As the training progresses, more and more difficult shaded grid combinations occur. When training with this curriculum-learning attitude, we not only showed a performance improvement of more than 3% in the ICDAR2015 public benchmark dataset, but also confirmed that the performance improved when applied to our's android application text recognition dataset.

Pill Identification Algorithm Based on Deep Learning Using Imprinted Text Feature (음각 정보를 이용한 딥러닝 기반의 알약 식별 알고리즘 연구)

  • Seon Min, Lee;Young Jae, Kim;Kwang Gi, Kim
    • Journal of Biomedical Engineering Research
    • /
    • v.43 no.6
    • /
    • pp.441-447
    • /
    • 2022
  • In this paper, we propose a pill identification model using engraved text feature and image feature such as shape and color, and compare it with an identification model that does not use engraved text feature to verify the possibility of improving identification performance by improving recognition rate of the engraved text. The data consisted of 100 classes and used 10 images per class. The engraved text feature was acquired through Keras OCR based on deep learning and 1D CNN, and the image feature was acquired through 2D CNN. According to the identification results, the accuracy of the text recognition model was 90%. The accuracy of the comparative model and the proposed model was 91.9% and 97.6%. The accuracy, precision, recall, and F1-score of the proposed model were better than those of the comparative model in terms of statistical significance. As a result, we confirmed that the expansion of the range of feature improved the performance of the identification model.

Geriatric Dwelling Depression Measurement Based on Projective Image Analysis Modeling

  • Lee, Yewon;Park, Chongwook;Woo, Sungju
    • International Journal of Advanced Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.323-330
    • /
    • 2018
  • The growth of the older population is expected to further increase social problems associated with population aging, such as isolation, poverty, and depression. The emerging issues associated with the older population are also expected to provide further momentum on studies about the dwelling environment as factors that ensure the health of older people as well as improve their quality of life. Therefore, approaches for explaining the issues of the older age group should be diversified using a variety of factors and appropriate analytic tools. Studies on measuring depression have principally focused on assessing an objective self-report questionnaire, usually in a highly structured, textual form which may not reflect the cognitive impairment of older adults. The aim of this study was to define and measure dwelling depression among older adults in Korea. There are two specific hypotheses in this study as follows: (a) there will be statistically significant relationships with dwelling dissatisfaction and depression, and (b) dwelling depression tools containing text and images will be, respectively, assessment tools that have a good construct with content validity and reliability. In the first experiment, to define and measure dwelling depression, 301 people over 65 years old living in single and two-person households were surveyed using a text-based dwelling depression questionnaires from September 1-30, 2017. In the second experiment, to examine whether the projective image questionnaire could serve as a suitable replacement for the text-based questionnaires, the same participants were surveyed from January 22 to February 2, 2018. The results show that depression has a close correlation with dwelling dissatisfaction. In addition, the geriatric dwelling depression index (GDDI) based on the projective image was refined. Additionally, the projective image questionnaire has a close correlation with the text-based questionnaire. Finally, through ROC curve analysis, it was found that the projective image questionnaire can accurately predict a depression group. To this end, this preliminary study examined the validity of the projective image questionnaire in older adults to make this instrument feasible for older populations and to contribute to a profound understanding of geriatric depression due to the living environment. We hope they will provide a basis for further research on psychological diagnoses using projective images.

The Text Analysis of Plasticity Expressed in the Modern Art to Wear (Part II) - Focused on the West Art Works since 1980s - (현대 예술의상에 표현된 조형성의 텍스트 분석 (제2보) - 1980년대 이후 서구 작가 작품을 중심으로 -)

  • Seo, Seung-Mi;Yang, Sook-Hi
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.29 no.7 s.144
    • /
    • pp.926-937
    • /
    • 2005
  • The analysis category of Art to Wear was text analyzed from the research material of 100 projects put together by fashion specialist. The conclusion of Art to Wear was comprehended the general features of it were compared and analyzed from a semiotics context. According to this analysis, the formative features of modern Art to Wear is categorized into three different dimensions from a semiotics light. The formative features of modem Art to Wear in the light of syntactic dimension was divided as an open constructed shape of Space Extension, non-typical Deformation, Geometrical Plasticity. The formative features of modem Art to Wear in the light of semantic dimension express symbolic meaning through metaphorical sign. These sign reflect the body image of the life and death and its objective of Abjection, Hybrid of discultural appearance and the image of Hyper-reality, which are features used to comprehend the inner meaning. The formative features of modem Art to Wear in the light of pragmatic dimension divided the artist emotion and meaning system delivered by Emotive Image, the Phatic Image that arouse inner signification and the Poetic Image which contain artistic and aesthetic meaning within it.

Design and Development of a Multimodal Biomedical Information Retrieval System

  • Demner-Fushman, Dina;Antani, Sameer;Simpson, Matthew;Thoma, George R.
    • Journal of Computing Science and Engineering
    • /
    • v.6 no.2
    • /
    • pp.168-177
    • /
    • 2012
  • The search for relevant and actionable information is a key to achieving clinical and research goals in biomedicine. Biomedical information exists in different forms: as text and illustrations in journal articles and other documents, in images stored in databases, and as patients' cases in electronic health records. This paper presents ways to move beyond conventional text-based searching of these resources, by combining text and visual features in search queries and document representation. A combination of techniques and tools from the fields of natural language processing, information retrieval, and content-based image retrieval allows the development of building blocks for advanced information services. Such services enable searching by textual as well as visual queries, and retrieving documents enriched by relevant images, charts, and other illustrations from the journal literature, patient records and image databases.

Implementation of the Embedded System Screen Control using Mobile Network (모바일 네트워크를 이용한 임베디드 전광판제어기의 구현)

  • Lee Yeon-Seok;Kim Yang-Woo
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2006.08a
    • /
    • pp.269-273
    • /
    • 2006
  • In this paper, a remote screen control by mobile networks on embedded system is implemented. For this system a server program is ported on the embedded system connected with internet. And on the side of a mobile phone, a client program is ported using GVM. The embedded system can display the text and image from the mobile phone on its LCD. In the implemented embedded system the text and image data from GVM emulator is sent to the system for display on its LCD. The realization of the proposed embedded system can display the text :md image from a working mobile phone.

  • PDF

Web Image Caption Extraction using Positional Relation and Lexical Similarity (위치적 연관성과 어휘적 유사성을 이용한 웹 이미지 캡션 추출)

  • Lee, Hyoung-Gyu;Kim, Min-Jeong;Hong, Gum-Won;Rim, Hae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.4
    • /
    • pp.335-345
    • /
    • 2009
  • In this paper, we propose a new web image caption extraction method considering the positional relation between a caption and an image and the lexical similarity between a caption and the main text containing the caption. The positional relation between a caption and an image represents how the caption is located with respect to the distance and the direction of the corresponding image. The lexical similarity between a caption and the main text indicates how likely the main text generates the caption of the image. Compared with previous image caption extraction approaches which only utilize the independent features of image and captions, the proposed approach can improve caption extraction recall rate, precision rate and 28% F-measure by including additional features of positional relation and lexical similarity.

A Study on Radiological Image Retrieval System (방사선 의료영상 검색 시스템에 관한 연구)

  • Park, Byung-Rae;Shin, Yong-Won
    • Journal of radiological science and technology
    • /
    • v.28 no.1
    • /
    • pp.19-24
    • /
    • 2005
  • The purpose of this study was to design and implement a useful annotation-based Radiological image retrieval system to accurately determine on education and image information for Radiological technologists. For better retrieval performance based on large image databases, we presented an indexing technique that integrated $B^+-tree$ proposed by Bayer for indexing simple attributes and inverted file structure for text medical keywords acquired from additional description information about Radiological images. In our results, we implemented proposed retrieval system with Delphi under Windows XP environment. End users, Radiological technologists, are able to store simple attributes information such as doctor name, operator name, body parts, disease and so on, additional text-based description information, and Radiological image itself as well as to retrieve wanted results by using simple attributes and text keywords from large image databases by graphic user interface. Consequently proposed system can be used for effective clinical decision on Radiological image, reduction of education time by organizing the knowledge, and well organized education in the clinical fields. In addition, It can be expected to develop as decision support system by constructing web-based integrated imaging system included general image and special contrast image for the future.

  • PDF