• Title/Summary/Keyword: Data Labeling

Search Result 469, Processing Time 0.021 seconds

A Prediction System of Skin Pore Labeling Using CNN and Image Processing (합성곱 신경망 및 영상처리 기법을 활용한 피부 모공 등급 예측 시스템)

  • Tae-Hee, Lee;Woo-Sung, Hwang;Myung-Ryul, Choi
    • Journal of IKEEE
    • /
    • v.26 no.4
    • /
    • pp.647-652
    • /
    • 2022
  • In this paper, we propose a prediction system for skin pore labeling based on a CNN(Convolution Neural Network) model, where a data set is constructed by processing skin images taken by users, and a pore feature image is generated by the proposed image processing algorithm. The skin image data set was labeled for pore characteristics based on the visual classification criteria of skin beauty experts. The proposed image processing algorithm was applied to generate pore feature images from skin images and to train a CNN model that predicts pore feature ratings. The prediction results with pore features by the proposed CNN model is similar to experts visual classification results, where less learning time and higher prediction results were obtained than the results by the comparison model (Resnet-50). In this paper, we describe the proposed image processing algorithm and CNN model, the results of the prediction system and future research plans.

Urban Environment change detection through landscape indices derived from Landsat TM data

  • Iisaka, Joji
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.696-701
    • /
    • 2002
  • This paper describes some results of change detection in Tokyo metropolitan area, Japan , using the Landsat TM data, and methods to quantify the ground cover classes. The changes are analyzed using the measures of not only conventional spectral classes but also a set of landscape indices to describe spatial properties of ground cove types using fractal dimension of objects, entropy in the specific windows defining the neighbors of focusing locations. In order eliminate the seasonal radiometric effects on TM data, an automated class labeling method is also attempted. Urban areas are also delineated automatically by defining the boundaries of the urban area. These procedures for urban change detection were implemented by the unified image computing methods proposed by the author, they can be automated in coherent and systematic ways, and it is anticipated to automate the whole procedures. The results of this analysis suggest that Tokyo metropolitan area was extended to the suburban areas along the new transportation networks and the high density area of Tokyo were also very much extended during the period between 1985 and 1995.

  • PDF

Deep-Learning-Based Molecular Imaging Biomarkers: Toward Data-Driven Theranostics

  • Choi, Hongyoon
    • Progress in Medical Physics
    • /
    • v.30 no.2
    • /
    • pp.39-48
    • /
    • 2019
  • Deep learning has been applied to various medical data. In particular, current deep learning models exhibit remarkable performance at specific tasks, sometimes offering higher accuracy than that of experts for discriminating specific diseases from medical images. The current status of deep learning applications to molecular imaging can be divided into a few subtypes in terms of their purposes: differential diagnostic classification, enhancement of image acquisition, and image-based quantification. As functional and pathophysiologic information is key to molecular imaging, this review will emphasize the need for accurate biomarker acquisition by deep learning in molecular imaging. Furthermore, this review addresses practical issues that include clinical validation, data distribution, labeling issues, and harmonization to achieve clinically feasible deep learning models. Eventually, deep learning will enhance the role of theranostics, which aims at precision targeting of pathophysiology by maximizing molecular imaging functional information.

Detection of Political Manipulation through Unsupervised Learning

  • Lee, Sihyung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.4
    • /
    • pp.1825-1844
    • /
    • 2019
  • Political campaigns circulate manipulative opinions in online communities to implant false beliefs and eventually win elections. Not only is this type of manipulation unfair, it also has long-lasting negative impacts on people's lives. Existing tools detect political manipulation based on a supervised classifier, which is accurate when trained with large labeled data. However, preparing this data becomes an excessive burden and must be repeated often to reflect changing manipulation tactics. We propose a practical detection system that requires moderate groundwork to achieve a sufficient level of accuracy. The proposed system groups opinions with similar properties into clusters, and then labels a few opinions from each cluster to build a classifier. It also models each opinion with features deduced from raw data with no additional processing. To validate the system, we collected over a million opinions during three nation-wide campaigns in South Korea. The system reduced groundwork from 200K to nearly 200 labeling tasks, and correctly identified over 90% of manipulative opinions. The system also effectively identified transitions in manipulative tactics over time. We suggest that online communities perform periodic audits using the proposed system to highlight manipulative opinions and emerging tactics.

Importance-Performance Analysis(IPA) of the selection attributes of functional cosmetics (기능성화장품 선택속성의 IPA(중요도-만족도) 분석)

  • Han, Do-Kyung;Lee, Hyun-Jun;Paik, Hyun-Dong;Shin, Dong-Kyoo;Park, Dae-Sub;Hwang, Hye-Sun;Hong, Wan-Soo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.6
    • /
    • pp.527-536
    • /
    • 2016
  • This study aims to generate baseline data for vitalizing the sales of functional cosmetics through an Importance-Performance Analysis (IPA) of the selection attributes of functional cosmetics. From the analysis of consumers' selection criteria, the study will assist functional cosmetics companies in reflecting consumer demands and therefore securing competitiveness. For this, general consumers aged over 20 years were surveyed for 5 weeks from Feb 23 through Mar 30, 2015, and 447 empirical data (response rate 88.9%) were processed through SPSS WIN 21.0 program for analysis. To conduct gender difference analysis on the IPA of the selection attributes of functional cosmetics, 17 selection attributes were categorized into 4 factors: functionality, labeling, popularity, and product. Cronbach's alpha for all factors was 0.5, proving the internal consistency and reliability of the survey. The survey results showed that while the entire average came out significantly higher for females (5.89/7points) than for males (5.66/7points) (p<0.001), the selection attributes 'anti-wrinkling', 'whitening function', 'functionality', 'expiration date', 'full ingredient labeling system' and 'various promotional events' showed significant gender differences. IPA results pertaining to gender showed 'price', 'functionality', 'spreadability' and 'full ingredient labeling system' as 2nd quadrant attributes, whereas female consumers selected 'price', 'whitening function', 'anti-wrinkling', 'functionality' and 'full ingredient labeling system' as attributes. Results show that businesses in the field of cosmetics and related areas need to prioritize improving the following factors that received low satisfaction from all consumers: 'price', 'functionality', and 'total labeling.' In particular, the 'price' aspects are considered to require reasonable and affordable pricing.

Dialogic Male Voice Triphone DB Construction (남성 음성 triphone DB 구축에 관한 연구)

  • Kim, Yu-Jin;Baek, Sang-Hoon;Han, Min-Soo;Chung, Jae-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.2
    • /
    • pp.61-71
    • /
    • 1996
  • In this paper, dialogic triphone data base construction for triphone synthesis system is discussed. Particularly, in this work, dialogic speech data is collected from the broadcast media, and three different transcription steps are taken. Total 10 hours of speech data are collected. Among them, six hours of speech data are used for the triphone data base construction, and the rest four hours of data are reserved. Dialogic speech data base construction is far different from the reciting speech data base construction. This paper describes various steps that necessary for the dialogic triphone data base construction from collecting speech data to triphone unit labeling.

  • PDF

Development of a Steel Plate Surface Defect Detection System Based on Small Data Deep Learning (소량 데이터 딥러닝 기반 강판 표면 결함 검출 시스템 개발)

  • Gaybulayev, Abdulaziz;Lee, Na-Hyeon;Lee, Ki-Hwan;Kim, Tae-Hyong
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.17 no.3
    • /
    • pp.129-138
    • /
    • 2022
  • Collecting and labeling sufficient training data, which is essential to deep learning-based visual inspection, is difficult for manufacturers to perform because it is very expensive. This paper presents a steel plate surface defect detection system with industrial-grade detection performance by training a small amount of steel plate surface images consisting of labeled and non-labeled data. To overcome the problem of lack of training data, we propose two data augmentation techniques: program-based augmentation, which generates defect images in a geometric way, and generative model-based augmentation, which learns the distribution of labeled data. We also propose a 4-step semi-supervised learning using pseudo labels and consistency training with fixed-size augmentation in order to utilize unlabeled data for training. The proposed technique obtained about 99% defect detection performance for four defect types by using 100 real images including labeled and unlabeled data.

Implementation of CNN-based Masking Algorithm for Post Processing of Aerial Image

  • CHOI, Eunsoo;QUAN, Zhixuan;JUNG, Sangwoo
    • Korean Journal of Artificial Intelligence
    • /
    • v.9 no.2
    • /
    • pp.7-14
    • /
    • 2021
  • Purpose: To solve urban problems, empirical research is being actively conducted to implement a smart city based on various ICT technologies, and digital twin technology is needed to effectively implement a smart city. A digital twin is essential for the realization of a smart city. A digital twin is a virtual environment that intuitively visualizes multidimensional data in the real world based on 3D. Digital twin is implemented on the premise of the convergence of GIS and BIM, and in particular, a lot of time is invested in data pre-processing and labeling in the data construction process. In digital twin, data quality is prioritized for consistency with reality, but there is a limit to data inspection with the naked eye. Therefore, in order to improve the required time and quality of digital twin construction, it was attempted to detect a building using Mask R-CNN, a deep learning-based masking algorithm for aerial images. If the results of this study are advanced and used to build digital twin data, it is thought that a high-quality smart city can be realized.

Relation Extraction Model for Noisy Data Handling on Distant Supervision Data based on Reinforcement Learning (원격지도학습데이터의 오류를 처리하는 강화학습기반 관계추출 모델)

  • Yoon, Sooji;Nam, Sangha;Kim, Eun-kyung;Choi, Key-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2018.10a
    • /
    • pp.55-60
    • /
    • 2018
  • 기계학습 기반인 관계추출 모델을 설계할 때 다량의 학습데이터를 빠르게 얻기 위해 원격지도학습 방식으로 데이터를 수집한다. 이러한 데이터는 잘못 분류되어 학습데이터로 사용되기 때문에 모델의 성능에 부정적인 영향을 끼칠 수 있다. 본 논문에서는 이러한 문제를 강화학습 접근법을 사용해 해결하고자 한다. 본 논문에서 제안하는 모델은 오 분류된 데이터로부터 좋은 품질의 데이터를 찾는 문장선택기와 선택된 문장들을 가지고 학습이 되어 관계를 추출하는 관계추출기로 구성된다. 문장선택기는 지도학습데이터 없이 관계추출기로부터 피드백을 받아 학습이 진행된다. 이러한 방식은 기존의 관계추출 모델보다 좋은 성능을 보여주었고 결과적으로 원격지도학습데이터의 단점을 해결한 방법임을 보였다.

  • PDF

An Effective WSSENet-Based Similarity Retrieval Method of Large Lung CT Image Databases

  • Zhuang, Yi;Chen, Shuai;Jiang, Nan;Hu, Hua
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.7
    • /
    • pp.2359-2376
    • /
    • 2022
  • With the exponential growth of medical image big data represented by high-resolution CT images(CTI), the high-resolution CTI data is of great importance for clinical research and diagnosis. The paper takes lung CTI as an example to study. Retrieving answer CTIs similar to the input one from the large-scale lung CTI database can effectively assist physicians to diagnose. Compared with the conventional content-based image retrieval(CBIR) methods, the CBIR for lung CTIs demands higher retrieval accuracy in both the contour shape and the internal details of the organ. In traditional supervised deep learning networks, the learning of the network relies on the labeling of CTIs which is a very time-consuming task. To address this issue, the paper proposes a Weakly Supervised Similarity Evaluation Network (WSSENet) for efficiently support similarity analysis of lung CTIs. We conducted extensive experiments to verify the effectiveness of the WSSENet based on which the CBIR is performed.