• 제목/요약/키워드: smart phone images

Search Result 88, Processing Time 0.033 seconds

Distortion Corrected Black and White Document Image Generation Based on Camera (카메라기반의 왜곡이 보정된 흑백 문서 영상 생성)

  • Kim, Jin-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.11
    • /
    • pp.18-26
    • /
    • 2015
  • Geometric distortion and shadow effect due to capturing angle could be included in document copy images that are captured by a camera in stead of a scanner. In this paper, a clean black and white document image generation algorithm by distortion correction and shadow elimination based on a camera, is proposed. In order to correct geometric distortion such as straightening un-straight boundary lines occurred by camera lens radial distortion and eliminating outlying area included by camera direction, second derivative filter based document boundary detection method is developed. Black and white images have been generated by adaptive binarization method by eliminating shadow effect. Experimental results of the black and white document image generation algorithm by recovering geometrical distortion and eliminating shadow effect for the document images captured by smart phone camera, shows very good processing results.

An Image Denoising Algorithm Using Multiple Images for Mobile Smartphone Cameras (스마트폰 카메라에서 다중 영상을 이용한 영상 잡음 제거 알고리즘)

  • Kim, Sung-Un
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.10
    • /
    • pp.1189-1195
    • /
    • 2014
  • In this study we propose an image denoising algorithm which manipulates the information obtained from multiple images in the same environment for mobile smart phones. We also envisage a multiple images registration method for mobile smart phone cameras equipped with limited computing ability and present an effective image denoising algorithm combining and manipulating the information obtained from multiple images. We proved that the proposed algorithm has much better PSNR value than the method applying single image. We verified that the propose approach has good denoising quality and can be utilized in the feasible level speed on Android smart phones.

A Manually Captured and Modified Phone Screen Image Dataset for Widget Classification on CNNs

  • Byun, SungChul;Han, Seong-Soo;Jeong, Chang-Sung
    • Journal of Information Processing Systems
    • /
    • v.18 no.2
    • /
    • pp.197-207
    • /
    • 2022
  • The applications and user interfaces (UIs) of smart mobile devices are constantly diversifying. For example, deep learning can be an innovative solution to classify widgets in screen images for increasing convenience. To this end, the present research leverages captured images and the ReDraw dataset to write deep learning datasets for image classification purposes. First, as the validation for datasets using ResNet50 and EfficientNet, the experiments show that the dataset composed in this study is helpful for classification according to a widget's functionality. An implementation for widget detection and classification on RetinaNet and EfficientNet is then executed. Finally, the research suggests the Widg-C and Widg-D datasets-a deep learning dataset for identifying the widgets of smart devices-and implementing them for use with representative convolutional neural network models.

Content-based Image Retrieval using Spatial-Color and Gabor Texture on A Mobile Device (모바일 디바이스상에서 공간-칼라와 가버 질감을 이용한 내용-기반 영상 검색)

  • Lee, Yong-Hwan;Lee, June-Hwan;Cho, Han-Jin;Kwon, Oh-Kin;Kim, Youngseop
    • Journal of the Semiconductor & Display Technology
    • /
    • v.13 no.4
    • /
    • pp.91-96
    • /
    • 2014
  • Mobile image retrieval is one of the most exciting and fastest growing research fields in the area of multimedia technology. As the amount of digital contents continues to grow users are experiencing increasing difficulty in finding specific images in their image libraries. This paper proposes a new efficient and effective mobile image retrieval method that applies a weighted combination of color and texture utilizing spatial-color and second order statistics. The system for mobile image searches runs in real-time on an iPhone and can easily be used to find a specific image. To evaluate the performance of the new method, we assessed the iPhone simulations performance in terms of average precision and recall using several image databases and compare the results with those obtained using existing methods. Experimental trials revealed that the proposed descriptor exhibited a significant improvement of over 13% in retrieval effectiveness, compared to the best of the other descriptors.

Develop 3D Prostate Cancer Visualization Tool in Smart Care System (스마트 케어 시스템에서의 3차원 전립선 암 가시화 도구 개발)

  • Ahn, Byung Uk;Shin, Seung Won;Choi, Moon Hyung;Jung, Seung Eun;Kim, Kwang Gi
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.2
    • /
    • pp.163-169
    • /
    • 2016
  • In Korea, prostate cancer accounted for generating growth rate second the following thyroid cancer, because of western dietary habits. Survival rate of prostate cancer after clinical behavior is changed depend on follow-up management. A telemedicine have been applied to replacement of medical specialist in rural area, and a quick reaction to emergency situation. Our study developed prostate 3-dimensional (3D) visualization program and designed prostate aftercare system architecture, called smart care, using a device that can access the Internet. Region of interest (ROI) in prostate was manually segmented by physicians and visualized to 3D objects and sent to PACS Server as DICOM images. So, medical personnel could confirm patients' data along with 3D images not only PACS system, but also portable device like a smart phone. As a result, we conducted the aftercare service to 98 patients and visualize 3D prostate images. 3D images had advantage to instinctively apprehend where lesion is and make patients to understand state of their disease easily. In the future, should conduct an aftercare service to more patients, and will obtain numerical index through follow-up study to an accurate analysis.

The Study on Creative Tutoring Service Design to Improve Self-presentation and Learning Abilities for Kids Focusing on Visual Association and Storytelling

  • Lee, Dong-Min;Park, Hye-Jung;Cho, Sung-Bae
    • Journal of the Ergonomics Society of Korea
    • /
    • v.31 no.1
    • /
    • pp.117-124
    • /
    • 2012
  • Objective: The goal of this study is to design a creative tutoring service, which helps children gain confidence and creativity through learning activities. Background: Nowadays most kids are growing up in a very competitive environment under their parents' zeal for education. A stressful environment can deter a child from the confident undertaking of challenges, leading to depression, anxiety, and feelings of inadequacy. Art therapy helps children work through these issues, however the process led by instructors or parents, and kids still feel anxious studying adults' face to read their thought. Method: To help children address challenges, a creative tutoring service application can provide images with certain tasks instead of asking them to fill in blank areas. The tasks asked by the service system are 1) to visualize children's own experience utilizing visually associated images from given images and 2) to create an illustrated story modifying and re-composing given images. Another task is to learn basic math and words with numbers and alphabets in customized colors. By completing each task children collect awards, which allow them graduate to higher levels of challenges. The outcomes from the tasks are sent to the main server system and reviewed by analysts. Those results are sent to children's parents as a text message on smart phone. Results: Visual implication using images inspires children to make creative stories based on their own experience. Also, children can find their own patterns of reaching answers by using synaesthetic imagery through repetitive practices of creative thinking tasks. Conclusion: Understanding how they feel about doing tasks in certain environments and assessing them in varied situations should be carefully considered when designers approach service design for kids. By focusing on how to tutor children in creative ways, as opposed to focusing on the expected outcome, creative service applications can be designed to reduce children's stress and encourage self expression. Children are predicted to gain confidence through using the service without the concern of comparison by others. Application: The creative tutoring service needs to be developed and tested by varying types of children.

View Synthesis Error Removal for Comfortable 3D Video Systems (편안한 3차원 비디오 시스템을 위한 영상 합성 오류 제거)

  • Lee, Cheon;Ho, Yo-Sung
    • Smart Media Journal
    • /
    • v.1 no.3
    • /
    • pp.36-42
    • /
    • 2012
  • Recently, the smart applications, such as smart phone and smart TV, become a hot issue in IT consumer markets. In particular, the smart TV provides 3D video services, hence efficient coding methods for 3D video data are required. Three-dimensional (3D) video involves stereoscopic or multi-view images to provide depth experience through 3D display systems. Binocular cues are perceived by rendering proper viewpoint images obtained at slightly different view angles. Since the number of viewpoints of the multi-view video is limited, 3D display devices should generate arbitrary viewpoint images using available adjacent view images. In this paper, after we explain a view synthesis method briefly, we propose a new algorithm to compensate view synthesis errors around object boundaries. We describe a 3D warping technique exploiting the depth map for viewpoint shifting and a hole filling method using multi-view images. Then, we propose an algorithm to remove boundary noises that are generated due to mismatches of object edges in the color and depth images. The proposed method reduces annoying boundary noises near object edges by replacing erroneous textures with alternative textures from the other reference image. Using the proposed method, we can generate perceptually inproved images for 3D video systems.

  • PDF

Adaptive Character Segmentation to Improve Text Recognition Accuracy on Mobile Phones (모바일 시스템에서 텍스트 인식 위한 적응적 문자 분할)

  • Kim, Jeong Sik;Yang, Hyung Jeong;Kim, Soo Hyung;Lee, Guee Sang;Do, Luu Ngoc;Kim, Sun Hee
    • Smart Media Journal
    • /
    • v.1 no.4
    • /
    • pp.59-71
    • /
    • 2012
  • Since mobile phones are used as common communication devices, their applications are increasingly important to human's life. Using smart-phones camera to collect daily life environment's information is one of targets for many applications such as text recognition, object recognition or context awareness. Studies have been conducted to provide important information through the recognition of texts, which are artificially or naturally included in images and movies acquired from mobile phones. In this study, a character segmentation method that improves character-recognition accuracy in images obtained from mobile phone cameras is proposed. The proposed method first classifies texts in a given image to printed letters and handwritten letters since segmentation approaches for them are different. For printed letters, rough segmentation process is conducted, then the segmented regions are integrated, deleted, and re-segmented. Segmentation for the handwritten letters is performed after skews are corrected and the characters are classified by integrating them. The experimental result shows our method achieves a successful performance for both printed and handwritten letters as 95.9% and 84.7%, respectively.

  • PDF

AR Anchor System Using Mobile Based 3D GNN Detection

  • Jeong, Chi-Seo;Kim, Jun-Sik;Kim, Dong-Kyun;Kwon, Soon-Chul;Jung, Kye-Dong
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.54-60
    • /
    • 2021
  • AR (Augmented Reality) is a technology that provides virtual content to the real world and provides additional information to objects in real-time through 3D content. In the past, a high-performance device was required to experience AR, but it was possible to implement AR more easily by improving mobile performance and mounting various sensors such as ToF (Time-of-Flight). Also, the importance of mobile augmented reality is growing with the commercialization of high-speed wireless Internet such as 5G. Thus, this paper proposes a system that can provide AR services via GNN (Graph Neural Network) using cameras and sensors on mobile devices. ToF of mobile devices is used to capture depth maps. A 3D point cloud was created using RGB images to distinguish specific colors of objects. Point clouds created with RGB images and Depth Map perform downsampling for smooth communication between mobile and server. Point clouds sent to the server are used for 3D object detection. The detection process determines the class of objects and uses one point in the 3D bounding box as an anchor point. AR contents are provided through app and web through class and anchor of the detected object.

Recognition of Passport MRZ Information Using Combined Neural Networks (결합 신경망을 이용한 여권 MRZ 정보 인식)

  • Kim, Jinho
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.15 no.4
    • /
    • pp.149-157
    • /
    • 2019
  • In case of reading passport using a smart phone in contrast with a dedicated passport reading system, MRZ(Machine Readable Zone) character recognition can be hard when the character strokes were broken, touched or blurred according to the lighting condition, and the position and size of MRZ character lines were varied due to the camera distance and angle. In this paper, the effective recognition algorithm of the passport MRZ information using a combined neural network recognizer of CNN(Convolutional Neural Network) and ANN( Artificial Neural Network), is proposed under the various sized and skewed passport images. The MRZ line detection using connected component analysis algorithm and the skew correction using perspective transform algorithm are also designed in order to achieve effective character segmentation results. Each of the MRZ field recognition results is verified by using five check digits for deciding whether retrying the recognition process of passport MRZ information or not. After we implement the proposed recognition algorithm of passport MRZ information, the excellent recognition performance of the passport MRZ information was obtained in the experimental results for PC off-line mode and smart phone on-line mode.