Search | Korea Science

A Study to Improve the Accuracy of Segmentation and Classification of Mosaic Images over the Korean Peninsula (한반도 모자이크 영상의 분할 및 분류 정확도 향상을 위한 연구)

Moon, Jiyoon;Lee, Kwang Jae
- Korean Journal of Remote Sensing
- /
- v.37 no.6_3
- /
- pp.1943-1949
- /
- 2021
In recent years, as the demand of high-resolution satellite images increases due to the miniaturization and constellation of satellites, various efforts to support users to utilize satellite images more conveniently are performed. Accordingly, the Korea Aerospace Research Institute produces and provides mosaic images on the Korean Peninsula every year to improve the convenience of users in the public sector and activate the use of satellite images. In order to increase the utilization of mosaic images on the Korean Peninsula, a study on satellite image segmentation and classification using mosaic images was attempted. However, since mosaic images provide only R, G, and B bands and processes such as image sharpening and color balancing are applied, there is a limitation that the spectral information of original images is distorted, so various indices were extracted and classified using R, G, and B bands to compensate for this. As a result of the study, the accuracy of image classification results using only mosaic images was about 72%, while the accuracy of image classification results using indices extracted from R, G, and B bands together was about 79%. Through this, it was confirmed that when performing image classification using mosaic images on the Korean Peninsula, the image classification results can be improved if the indices extracted from R, G, and B bands are used together. These research results are expected to be applied not only to mosaic images but also to images in which spectral information is limited or only R, G, and B bands are provided.
https://doi.org/10.7780/kjrs.2021.37.6.3.3 인용 PDF KSCI HTML

Study on Development of LED Camping Light Design Based on IOT and Emotional Lighting Contents (IOT 및 감성조명 콘텐츠 기반의 LED 캠핑등 디자인 개발에 관한 연구)

Kim, Hee-Jun
- The Journal of the Korea Contents Association
- /
- v.18 no.12
- /
- pp.332-342
- /
- 2018
This study is aimed at suggesting information about technical choices for designing LED camping lights based on emotional lighting contents of integrated IOT and design areas which take a central role in creation and knowledge based industries and the procedure for materializing them. 'i-Light,' a portable LED camping light, is 'connected lighting' connecting men, space and emotion and a smart camping light based on IOT and emotional lighting contents. 'i-Light' has two functions. One is about lighting for adjusting color and color temperature naturally and the other is about safety for detecting harmful gases. 'i-Light' also has various emotional functions for experiencing interaction and taste of light. For the purpose, portable LED camping lights were designed, first of all, and then a highly color rendering/full-color lighting module, a smart sensor module and an IOT device platform were developed. In addition, efforts were made to establish detailed data about emotional lighting contents and to develop a Web application based on them. Finally, prototypes of portable LED camping lights were made to get a test bench and usability evaluation from related organizations. According to the results, all of 12 developed emotional lighting contents and three IOT safety sensors were suitable and prototypes were satisfactory. This paper will suggest a direction about actual technical choices for development of contents and products integrating artificial intelligence and big data and about the procedure for materializing them.
https://doi.org/10.5392/JKCA.2018.18.12.332 인용 PDF KSCI HTML

Speech Visualization of Korean Vowels Based on the Distances Among Acoustic Features (음성특징의 거리 개념에 기반한 한국어 모음 음성의 시각화)

Pok, Gouchol
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.12 no.5
- /
- pp.512-520
- /
- 2019
It is quite useful to represent speeches visually for learners who study foreign languages as well as the hearing impaired who cannot directly hear speeches, and a number of researches have been presented in the literature. They remain, however, at the level of representing the characteristics of speeches using colors or showing the changing shape of lips and mouth using the animation-based representation. As a result of such approaches, those methods cannot tell the users how far their pronunciations are away from the standard ones, and moreover they make it technically difficult to develop such a system in which users can correct their pronunciation in an interactive manner. In order to address these kind of drawbacks, this paper proposes a speech visualization model based on the relative distance between the user's speech and the standard one, furthermore suggests actual implementation directions by applying the proposed model to the visualization of Korean vowels. The method extract three formants F1, F2, and F3 from speech signals and feed them into the Kohonen's SOM to map the results into 2-D screen and represent each speech as a pint on the screen. We have presented a real system implemented using the open source formant analysis software on the speech of a Korean instructor and several foreign students studying Korean language, in which the user interface was built using the Javascript for the screen display.
https://doi.org/10.17661/jkiiect.2019.12.5.512 인용 PDF KSCI

A Study on the Restoration of Korean Traditional Palace Image by Adjusting the Receptive Field of Pix2Pix (Pix2Pix의 수용 영역 조절을 통한 전통 고궁 이미지 복원 연구)

Hwang, Won-Yong;Kim, Hyo-Kwan
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.15 no.5
- /
- pp.360-366
- /
- 2022
This paper presents a AI model structure for restoring Korean traditional palace photographs, which remain only black-and-white photographs, to color photographs using Pix2Pix, one of the adversarial generative neural network techniques. Pix2Pix consists of a combination of a synthetic image generator model and a discriminator model that determines whether a synthetic image is real or fake. This paper deals with an artificial intelligence model by adjusting a receptive field of the discriminator, and analyzes the results by considering the characteristics of the ancient palace photograph. The receptive field of Pix2Pix, which is used to restore black-and-white photographs, was commonly used in a fixed size, but a fixed size of receptive field is not suitable for a photograph which consisting with various change in an image. This paper observed the result of changing the size of the existing fixed a receptive field to identify the proper size of the discriminator that could reflect the characteristics of ancient palaces. In this experiment, the receptive field of the discriminator was adjusted based on the prepared ancient palace photos. This paper measure a loss of the model according to the change in a receptive field of the discriminator and check the results of restored photos using a well trained AI model from experiments.
https://doi.org/10.17661/jkiiect.2022.15.5.360 인용 PDF KSCI HTML

Implementation of CNN-based Classification Training Model for Unstructured Fashion Image Retrieval using Preprocessing with MASK R-CNN (비정형 패션 이미지 검색을 위한 MASK R-CNN 선형처리 기반 CNN 분류 학습모델 구현)

Seunga, Cho;Hayoung, Lee;Hyelim, Jang;Kyuri, Kim;Hyeon-Ji, Lee;Bong-Ki, Son;Jaeho, Lee
- Journal of Korea Society of Industrial Information Systems
- /
- v.27 no.6
- /
- pp.13-23
- /
- 2022
In this paper, we propose a detailed component image classification algorithm by fashion item for unstructured data retrieval in the fashion field. Due to the COVID-19 environment, AI-based online shopping malls are increasing recently. However, there is a limit to accurate unstructured data search with existing keyword search and personalized style recommendations based on user surfing behavior. In this study, pre-processing using Mask R-CNN was conducted using images crawled from online shopping sites and then classified components for each fashion item through CNN. We obtain the accuaracy for collar of the shirt's as 93.28%, the pattern of the shirt as 98.10%, the 3 classese fit of the jeans as 91.73%, And, we further obtained one for the 4 classes fit of jeans as 81.59% and the color of the jeans as 93.91%. At the results for the decorated items, we also obtained the accuract of the washing of the jeans as 91.20% and the demage of jeans accuaracy as 92.96%.
https://doi.org/10.9723/jksiis.2022.27.6.013 인용 PDF KSCI

A Mobile Landmarks Guide : Outdoor Augmented Reality based on LOD and Contextual Device (모바일 랜드마크 가이드 : LOD와 문맥적 장치 기반의 실외 증강현실)

Zhao, Bi-Cheng;Rosli, Ahmad Nurzid;Jang, Chol-Hee;Lee, Kee-Sung;Jo, Geun-Sik
- Journal of Intelligence and Information Systems
- /
- v.18 no.1
- /
- pp.1-21
- /
- 2012
In recent years, mobile phone has experienced an extremely fast evolution. It is equipped with high-quality color displays, high resolution cameras, and real-time accelerated 3D graphics. In addition, some other features are includes GPS sensor and Digital Compass, etc. This evolution advent significantly helps the application developers to use the power of smart-phones, to create a rich environment that offers a wide range of services and exciting possibilities. To date mobile AR in outdoor research there are many popular location-based AR services, such Layar and Wikitude. These systems have big limitation the AR contents hardly overlaid on the real target. Another research is context-based AR services using image recognition and tracking. The AR contents are precisely overlaid on the real target. But the real-time performance is restricted by the retrieval time and hardly implement in large scale area. In our work, we exploit to combine advantages of location-based AR with context-based AR. The system can easily find out surrounding landmarks first and then do the recognition and tracking with them. The proposed system mainly consists of two major parts-landmark browsing module and annotation module. In landmark browsing module, user can view an augmented virtual information (information media), such as text, picture and video on their smart-phone viewfinder, when they pointing out their smart-phone to a certain building or landmark. For this, landmark recognition technique is applied in this work. SURF point-based features are used in the matching process due to their robustness. To ensure the image retrieval and matching processes is fast enough for real time tracking, we exploit the contextual device (GPS and digital compass) information. This is necessary to select the nearest and pointed orientation landmarks from the database. The queried image is only matched with this selected data. Therefore, the speed for matching will be significantly increased. Secondly is the annotation module. Instead of viewing only the augmented information media, user can create virtual annotation based on linked data. Having to know a full knowledge about the landmark, are not necessary required. They can simply look for the appropriate topic by searching it with a keyword in linked data. With this, it helps the system to find out target URI in order to generate correct AR contents. On the other hand, in order to recognize target landmarks, images of selected building or landmark are captured from different angle and distance. This procedure looks like a similar processing of building a connection between the real building and the virtual information existed in the Linked Open Data. In our experiments, search range in the database is reduced by clustering images into groups according to their coordinates. A Grid-base clustering method and user location information are used to restrict the retrieval range. Comparing the existed research using cluster and GPS information the retrieval time is around 70~80ms. Experiment results show our approach the retrieval time reduces to around 18~20ms in average. Therefore the totally processing time is reduced from 490~540ms to 438~480ms. The performance improvement will be more obvious when the database growing. It demonstrates the proposed system is efficient and robust in many cases.
https://doi.org/10.13088/jiis.2012.18.1.001 인용 PDF KSCI

3D Facial Animation with Head Motion Estimation and Facial Expression Cloning (얼굴 모션 추정과 표정 복제에 의한 3차원 얼굴 애니메이션)

Kwon, Oh-Ryun;Chun, Jun-Chul
- The KIPS Transactions:PartB
- /
- v.14B no.4
- /
- pp.311-320
- /
- 2007
This paper presents vision-based 3D facial expression animation technique and system which provide the robust 3D head pose estimation and real-time facial expression control. Many researches of 3D face animation have been done for the facial expression control itself rather than focusing on 3D head motion tracking. However, the head motion tracking is one of critical issues to be solved for developing realistic facial animation. In this research, we developed an integrated animation system that includes 3D head motion tracking and facial expression control at the same time. The proposed system consists of three major phases: face detection, 3D head motion tracking, and facial expression control. For face detection, with the non-parametric HT skin color model and template matching, we can detect the facial region efficiently from video frame. For 3D head motion tracking, we exploit the cylindrical head model that is projected to the initial head motion template. Given an initial reference template of the face image and the corresponding head motion, the cylindrical head model is created and the foil head motion is traced based on the optical flow method. For the facial expression cloning we utilize the feature-based method, The major facial feature points are detected by the geometry of information of the face with template matching and traced by optical flow. Since the locations of varying feature points are composed of head motion and facial expression information, the animation parameters which describe the variation of the facial features are acquired from geometrically transformed frontal head pose image. Finally, the facial expression cloning is done by two fitting process. The control points of the 3D model are varied applying the animation parameters to the face model, and the non-feature points around the control points are changed by use of Radial Basis Function(RBF). From the experiment, we can prove that the developed vision-based animation system can create realistic facial animation with robust head pose estimation and facial variation from input video image.
https://doi.org/10.3745/KIPSTB.2007.14-B.4.311 인용 PDF KSCI

A Study on Effective Information Delivery of Digital Sign Systems in General Hospitals (종합병원 디지털 정보안내사인의 효과적 정보전달을 위한 연구)

Kim, Hwa Sil;Paik, Jin Kyung
- Korea Science and Art Forum
- /
- v.19
- /
- pp.281-292
- /
- 2015
For this study, I conducted a survey investigating current situation, user preference, and field experiment. Hospitals utilizing digital sign systems at least five years were selected, which are connected with visual elements (layout, typo, color) used in waiting areas and elements of the systems (time, video time line). The results obtained from the field survey showed that digital sign systems used the color of typo and background contrasted to one another to increase explicitness and to ensure easy understanding of contents. In addition, the Gothic typo with relatively high legibility was adopted. Time and video timeline, which characterize digital sign systems, showed the advertising screens of the hospitals and the guidance of medical treatment at regular intervals. Moreover, survey results on user satisfaction showed that a majority of respondents indicated they had difficulty in understanding digital information conveyed from digital sign systems due to time setting for rotational speed or the small size of typo although most of the users had previous experience with digital sign systems. The highest proportion of respondents (n=86, 86%) answered that information related to medical departments was what they sought most frequently and that this kind of information should be importantly considered in digital sign systems. For the experiment, new samples with restructured contents of current digital sign systems were created and tested while keeping its design unchanged as well as applying these new samples. Study participants were in their 20s through 50s. When the size of typo was larger under the same conditions for all age groups, study participants found the desired information approximately 3.5 seconds faster. In addition, those in their 20-30s and 40-50s showed the time difference of 4.7 seconds for small typo and 6 seconds for large typo, which suggested that there was a difference by age in the amount of time taken in the experiment to find the desired information from the rotating digital sign system regardless of age and the size of typo.
https://doi.org/10.17548/ksaf.2015.03.19.281 인용

Region-based Multi-level Thresholding for Color Image Segmentation (영역 기반의 Multi-level Thresholding에 의한 컬러 영상 분할)

Oh, Jun-Taek;Kim, Wook-Hyun
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.43 no.6 s.312
- /
- pp.20-27
- /
- 2006
Multi-level thresholding is a method that is widely used in image segmentation. However most of the existing methods are not suited to be directly used in applicable fields and moreover expanded until a step of image segmentation. This paper proposes region-based multi-level thresholding as an image segmentation method. At first we classify pixels of each color channel to two clusters by using EWFCM(Entropy-based Weighted Fuzzy C-Means) algorithm that is an improved FCM algorithm with spatial information between pixels. To obtain better segmentation results, a reduction of clusters is then performed by a region-based reclassification step based on a similarity between regions existing in a cluster and the other clusters. The clusters are created using the classification information of pixels according to color channel. We finally perform a region merging by Bayesian algorithm based on Kullback-Leibler distance between a region and the neighboring regions as a post-processing method as many regions still exist in image. Experiments show that region-based multi-level thresholding is superior to cluster-, pixel-based multi-level thresholding, and the existing mettled. And much better segmentation results are obtained by the post-processing method.
PDF KSCI

A Study on the Imporvement of Wireless Internet Service Tariff Scheme. (무선인터넷 데이터 서비스 과금 체계 개선 연구)

Min, Gyeong-Ju;Kim, Jeong-Ho;Park, Jin-Yang
- Journal of the Korea Computer Industry Society
- /
- v.5 no.9
- /
- pp.1101-1110
- /
- 2004
In the first quarter of 2004, there were about 1 billion 348 million mobile phone users worldwide with a penetration rate of only 29%. Korea ranks among the highest in the use of mobile communication, having over 36 million mobile phone subscribers with a mobile phone penetration rate of 75% as of May 2004. Since the introduction of wireless Internet service in May 1999, the number of subscribers rose to 34.5 million with 95.3% of the total mobile phone subscribers using wireless Internet services in May 2004, largely due to continued investments by telecommunication service providers, improvement of mobile handsets (color and digital camera phones) and implementation of policies on mobile number portability. In the Korean wireless Internet market, there are many user complaints since the service providers are competing with each other through TV commercial sales and phone discounts rather than improving their call quality, services and billing systems. therefore there is a growing need to improve the billing systems through means such as the implementation of reasonable payment plans according to consumer use, development of a wireless Internet billing system that can predict the number of users and establishment of pricing standards for controlled data (head, tail, etc...) as well as menu information by testing the texts. multimedia, video and other types of content provided by the three major mobile communication companies. The purpose of this study is to promote wireless Internet services and protect user rights by proposing a reasonable way to improve the billing systems for wireless Internet services after conducting a comparative analysis of file size and billing data of each of the service providers through a verification test on a packet billing system for wireless Internet services.
PDF

Search Result 1,033, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)