Search | Korea Science

Performance Improvement of TextFuseNet using Image Sharpening (선명화 기법을 이용한 TextFuseNet 성능 향상)

Jeong, Ji-Yeon;Cheon, Ji-Eun;Jung, Yuchul
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2021.01a
- /
- pp.71-73
- /
- 2021
본 논문에서는 Scene Text Detection의 새로운 프레임워크인 TextFuseNet에 영상처리 관련 기술인 선명화 기법을 제안한다. Scene Text Detection은 야외 간판이나 표지판 등 불특정 배경에서 글자를 인식하는 기술이며, 그중 하나의 프레임워크가 TextFuseNet이다. TextFuseNet은 문자, 단어, 전역 기준으로 텍스트를 감지하는데, 여기서는 영상처리의 기술인 선명화 기법을 적용하여 TextFuseNet의 성능을 향상시키는 것이 목적이다. 선명화 기법은 기존 Sharpening Filter 방법과 Unsharp Masking 방법을 사용하였고 이 중 Sharpening Filter 방법을 적용하였을 때 AP가 0.9% 향상되었음을 확인하였다.
PDF

Joint-transform Correlator Multiple-image Encryption System Based on Quick-response Code Key

Chen, Qi;Shen, Xueju;Cheng, Yue;Huang, Fuyu;Lin, Chao;Liu, HeXiong
- Current Optics and Photonics
- /
- v.3 no.4
- /
- pp.320-328
- /
- 2019
A method for joint-transform correlator (JTC) multiple-image encryption based on a quick-response (QR) code key is proposed. The QR codes converted from different texts are used as key masks to encrypt and decrypt multiple images. Not only can Chinese text and English text be used as key text, but also symbols can be used. With this method, users have no need to transmit the whole key mask; they only need to transmit the text that is used to generate the key. The correlation coefficient is introduced to evaluate the decryption performance of our proposed cryptosystem, and we explore the sensitivity of the key mask and the capability for multiple-image encryption. Robustness analysis is also conducted in this paper. Computer simulations and experimental results verify the correctness of this method.
https://doi.org/10.3807/COPP.2019.3.4.320 인용 PDF KSCI HTML

Study on Generation of Children's Hand Drawing Learning Model for Text-to-Image (Text-to-Image를 위한 아동 손그림 학습 모델 생성 연구)

Lee, Eunchae;Moon, Mikyeong
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2022.07a
- /
- pp.505-506
- /
- 2022
인공지능 기술은 점차 빠른 속도로 발전되며 응용 분야가 확대되어 창작 산업에서의 역할도 커져 예술, 영화 및 기타 창조적인 산업에도 영향을 주고 있다. 이러한 인공지능 기술을 이용하여 텍스트로 설명하면 다양한 스타일의 이미지를 생성해내는 기술이 있지만 아동이 직접 그린 손그림 스타일의 그림을 생성하지는 못한다. 본 논문에서는 아동 손그림 데이터를 통해 Text-to-Image를 학습시켜 새로운 학습 모델을 생성하는 과정에 대해서 기술한다. 이 연구를 통해 생성된 픽셀을 결합하여 텍스트를 기반으로 하나의 아동 손그림을 만들 수 있을 것으로 기대한다.
PDF

Efficient Text Localization using MLP-based Texture Classification (신경망 기반의 텍스춰 분석을 이용한 효율적인 문자 추출)

Jung, Kee-Chul;Kim, Kwang-In;Han, Jung-Hyun
- Journal of KIISE:Software and Applications
- /
- v.29 no.3
- /
- pp.180-191
- /
- 2002
We present a new text localization method in images using a multi-layer perceptron(MLP) and a multiple continuously adaptive mean shift (MultiCAMShift) algorithm. An automatically constructed MLP-based texture classifier generates a text probability image for various types of images without an explicit feature extraction. The MultiCAMShift algorithm, which operates on the text probability Image produced by an MLP, can place bounding boxes efficiently without analyzing the texture properties of an entire image.
PDF KSCI

Design and Implementation of Image Gallery using Text Embedded JPEG (Text Embedded JPEG를 이용한 Image Gallery의 설계 및 구현)

천시영;곽미라;조동섭
- Proceedings of the Korea Multimedia Society Conference
- /
- 2003.05b
- /
- pp.724-727
- /
- 2003
현재 웹상의 이미지 갤러리에는 이미지와 함께 제목이나 설명이 포함되는 경우가 많다. 본 논문에서는 갤러리의 검색, 정렬 등의 기능을 강화하고 이미지와 정보의 통합을 위해서 JPEG 이미지의 헤더를 확장하여 이미지의 저작자, 만든 날짜, 설명, 파일크기 등의 텍스트 정보를 내장한 Text Embedded JPEG를 고안하였다. 이 Text Embedded JPEG를 이용한 웹 갤러리에서 이용자는 이미지에 대한 보다 자세한 정보를 볼 수 있고 이 각각의 정보들에 따라 정렬할 수도 있고 이미지 정보를 변경할 수도 있도록 설계하였다.
PDF

Recent Development in Text-based Medical Image Retrieval (텍스트 기반 의료영상 검색의 최근 발전)

Hwang, Kyung Hoon;Lee, Haejun;Koh, Geon;Kim, Seog Gyun;Sun, Yong Han;Choi, Duckjoo
- Journal of Biomedical Engineering Research
- /
- v.36 no.3
- /
- pp.55-60
- /
- 2015
An effective image retrieval system is required as the amount of medical imaging data is increasing recently. Authors reviewed the recent development of text-based medical image retrieval including the use of controlled vocabularies - RadLex (Radiology Lexicon), FMA (Foundational Model of Anatomy), etc - natural language processing, semantic ontology, and image annotation and markup.
https://doi.org/10.9718/JBER.2015.36.3.55 인용 PDF KSCI

A Study on Visual Behavior for Presenting Consumer-Oriented Information on an Online Fashion Store

Kim, Dahyun;Lee, Seunghee
- Journal of the Korean Society of Clothing and Textiles
- /
- v.44 no.5
- /
- pp.789-809
- /
- 2020
Growth in online channels has created fierce competition; consequently, retailers have to invest an increasing amount of effort into attracting consumers. In this study, eye-tracking technology examined consumers' visual behavior to gain an understanding of information searching behavior in exploring product information for fashion products. Product attribute information was classified into two image-based elements (model image information and detail image information) and two text-based elements (basic text information, detail text information), after which consumers' visual behavior for each information element was analyzed. Furthermore, whether involvement affects consumers' information search behavior was investigated. The results demonstrated that model image information attracted visual attention the quickest, while detail text information and model image information received the most visual attention. Additionally, high-involvement consumers tended to pay more attention to detailed information while low-involvement consumers tended to pay more attention to image-based and basic information. This study is expected to help broaden the understanding of consumer behavior and provide implications for establishing strategies on how to efficiently organize product information for online fashion stores.
https://doi.org/10.5850/JKSCT.2020.44.5.789 인용 PDF KSCI

Simple Image Stenography Technology for Large Scale Text (대용량 텍스트를 위한 손실 없는 영상 은닉기술)

Rhee, Keun-Moo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2008.05a
- /
- pp.1104-1107
- /
- 2008
These people where generally the image or the document nik technique silver document image, against the digital data of audio back all type the research is advanced being used with objective and the use which are various, is a d. Needs a low-end leveling instrument security text from the research which it sees and with substitution quantity the silver nik being simple it will be able to deliver the technique which is simple it embodied. It combined the text image first and the nose which is in the collar image of 24 bit depth which will reach ting it did and it rehabilitatedded and a higher officer technique and the result it used that the loss ratio of the text image to analyze is slight it was ascertained.
https://doi.org/10.3745/PKIPS.y2008m05a.1104 인용 PDF

A Still Image Compression System with a High Quality Text Compression Capability (고 품질 텍스트 압축 기능을 지원하는 정지영상 압축 시스템)

Lee, Je-Myung;Lee, Ho-Suk
- Journal of KIISE:Software and Applications
- /
- v.34 no.3
- /
- pp.275-302
- /
- 2007
We propose a novel still image compression system which supports a high quality text compression function. The system segments the text from the image and compresses the text with a high quality. The system shows 48:1 high compression ratio using context-based adaptive binary arithmetic coding. The arithmetic coding performs the high compression by the codeblocks in the bitplane. The input of the system consists of a segmentation mode and a ROI(Region Of Interest) mode. In segmentation mode, the input image is segmented into a foreground consisting of text and a background consisting of the remaining region. In ROI mode, the input image is represented by the region of interest window. The high quality text compression function with a high compression ratio shows that the proposed system can be comparable with the JPEG2000 products. This system also uses gray coding to improve the compression ratio.
PDF KSCI

The Effectiveness of High-level Text Features in SOM-based Web Image Clustering (SOM 기반 웹 이미지 분류에서 고수준 텍스트 특징들의 효과)

Cho Soo-Sun
- The KIPS Transactions:PartB
- /
- v.13B no.2 s.105
- /
- pp.121-126
- /
- 2006
In this paper, we propose an approach to increase the power of clustering Web images by using high-level semantic features from text information relevant to Web images as well as low-level visual features of image itself. These high-level text features can be obtained from image URLs and file names, page titles, hyperlinks, and surrounding text. As a clustering engine, self-organizing map (SOM) proposed by Kohonen is used. In the SOM-based clustering using high-level text features and low-level visual features, the 200 images from 10 categories are divided in some suitable clusters effectively. For the evaluation of clustering powers, we propose simple but novel measures indicating the degrees of scattering images from the same category, and degrees of accumulation of the same category images. From the experiment results, we find that the high-level text features are more useful in SOM-based Web image clustering.
https://doi.org/10.3745/KIPSTB.2006.13B.2.121 인용 PDF KSCI

Search Result 973, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)