• Title/Summary/Keyword: learning through the image

Search Result 951, Processing Time 0.028 seconds

Learning efficiency checking system by measuring human motion detection (사람의 움직임 감지를 측정한 학습 능률 확인 시스템)

  • Kim, Sukhyun;Lee, Jinsung;Yu, Eunsang;Park, Seon-u;Kim, Eung-Tae
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • fall
    • /
    • pp.290-293
    • /
    • 2021
  • In this paper, we implement a learning efficiency verification system to inspire learning motivation and help improve concentration by detecting the situation of the user studying. To this aim, data on learning attitude and concentration are measured by extracting the movement of the user's face or body through a real-time camera. The Jetson board was used to implement the real-time embedded system, and a convolutional neural network (CNN) was implemented for image recognition. After detecting the feature part of the object using a CNN, motion detection is performed. The captured image is shown in a GUI written in PYQT5, and data is collected by sending push messages when each of the actions is obstructed. In addition, each function can be executed on the main screen made with the GUI, and functions such as a statistical graph that calculates the collected data, To do list, and white noise are performed. Through learning efficiency checking system, various functions including data collection and analysis of targets were provided to users.

  • PDF

A New Image Analysis Method based on Regression Manifold 3-D PCA (회귀 매니폴드 3-D PCA 기반 새로운 이미지 분석 방법)

  • Lee, Kyung-Min;Lin, Chi-Ho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.2
    • /
    • pp.103-108
    • /
    • 2022
  • In this paper, we propose a new image analysis method based on regression manifold 3-D PCA. The proposed method is a new image analysis method consisting of a regression analysis algorithm with a structure designed based on an autoencoder capable of nonlinear expansion of manifold 3-D PCA and PCA for efficient dimension reduction when entering large-capacity image data. With the configuration of an autoencoder, a regression manifold 3-DPCA, which derives the best hyperplane through three-dimensional rotation of image pixel values, and a Bayesian rule structure similar to a deep learning structure, are applied. Experiments are performed to verify performance. The image is improved by utilizing the fine dust image, and accuracy performance evaluation is performed through the classification model. As a result, it can be confirmed that it is effective for deep learning performance.

An Analysis of Youth EEG based on the Emotional Color Scheme Images by Different Space of Community Facilities (공동주택 커뮤니티시설의 공간별 감성색채배색 이미지에 따른 청소년의 뇌파분석)

  • Hwang, Yeon-Sook;Kim, Sun-Young;Kim, Ju-Yeon
    • Korean Institute of Interior Design Journal
    • /
    • v.22 no.5
    • /
    • pp.171-178
    • /
    • 2013
  • In this study, we sought to find out the effect of different emotional interior images of the community facilities in an apartment complex on a youth brain wave by analyzing an Electroencephalograph (EEG). Based on the frequency of usage, we selected learning facilities, cultural facilities, and sport facilities. For brain stimulation, the visual stimulants with three different emotional words, cheerful, gentle, and elegant, were used based on I.R.I image scale. Overall, total nine different emotional images were used. Based on our findings, we conclude that: first, in order to improve learning concentration of the youth, a learning facility for the youth needs to be designed by skillfully combining the soft and comfortable colors from the gentle image and the murky and dark colors from the elegant image. Second, when designing a cultural facility, it is preferable to consider the elegant image for a calm and comfortable space. Third, a sport facility design needs to preclude dark colors and apply light colors to create a dynamic and lively space. Furthermore, we found out that the youth has established static images of each functionally different facility through their experience and learning. Therefore, it is imperative to plan community facilities in an apartment complex in a way to connect the space function with the emotional characteristics of the youth in order to support and encourage energetic activities and learning of the community youth.

Ensemble Learning Based on Tumor Internal and External Imaging Patch to Predict the Recurrence of Non-small Cell Lung Cancer Patients in Chest CT Image (흉부 CT 영상에서 비소세포폐암 환자의 재발 예측을 위한 종양 내외부 영상 패치 기반 앙상블 학습)

  • Lee, Ye-Sel;Cho, A-Hyun;Hong, Helen
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.3
    • /
    • pp.373-381
    • /
    • 2021
  • In this paper, we propose a classification model based on convolutional neural network(CNN) for predicting 2-year recurrence in non-small cell lung cancer(NSCLC) patients using preoperative chest CT images. Based on the region of interest(ROI) defined as the tumor internal and external area, the input images consist of an intratumoral patch, a peritumoral patch and a peritumoral texture patch focusing on the texture information of the peritumoral patch. Each patch is trained through AlexNet pretrained on ImageNet to explore the usefulness and performance of various patches. Additionally, ensemble learning of network trained with each patch analyzes the performance of different patch combination. Compared with all results, the ensemble model with intratumoral and peritumoral patches achieved the best performance (ACC=98.28%, Sensitivity=100%, NPV=100%).

Performance Comparison of Convolution Neural Network by Weight Initialization and Parameter Update Method1 (가중치 초기화 및 매개변수 갱신 방법에 따른 컨벌루션 신경망의 성능 비교)

  • Park, Sung-Wook;Kim, Do-Yeon
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.4
    • /
    • pp.441-449
    • /
    • 2018
  • Deep learning has been used for various processing centered on image recognition. One core algorithms of the deep learning, convolutional neural network is an deep neural network that specialized in image recognition. In this paper, we use a convolutional neural network to classify forest insects and propose an optimization method. Experiments were carried out by combining two weight initialization and six parameter update methods. As a result, the Xavier-SGD method showed the highest performance with an accuracy of 82.53% in the 12 different combinations of experiments. Through this, the latest learning algorithms, which complement the disadvantages of the previous parameter update method, we conclude that it can not lead to higher performance than existing methods in all application environments.

GAN-based shadow removal using context information

  • Yoon, Hee-jin;Kim, Kang-jik;Chun, Jun-chul
    • Journal of Internet Computing and Services
    • /
    • v.20 no.6
    • /
    • pp.29-36
    • /
    • 2019
  • When dealing with outdoor images in a variety of computer vision applications, the presence of shadow degrades performance. In order to understand the information occluded by shadow, it is essential to remove the shadow. To solve this problem, in many studies, involves a two-step process of shadow detection and removal. However, the field of shadow detection based on CNN has greatly improved, but the field of shadow removal has been difficult because it needs to be restored after removing the shadow. In this paper, it is assumed that shadow is detected, and shadow-less image is generated by using original image and shadow mask. In previous methods, based on CGAN, the image created by the generator was learned from only the aspect of the image patch in the adversarial learning through the discriminator. In the contrast, we propose a novel method using a discriminator that judges both the whole image and the local patch at the same time. We not only use the residual generator to produce high quality images, but we also use joint loss, which combines reconstruction loss and GAN loss for training stability. To evaluate our approach, we used an ISTD datasets consisting of a single image. The images generated by our approach show sharp and restored detailed information compared to previous methods.

Study on the Content Development of Mobile AR_HMD through a Real Time 360 Image Processing. (360° 실시간 영상처리를 통한 모바일 AR_HMD 콘텐츠 개발을 위한 연구)

  • Lee, Changhyun;Kim, Youngseop;Kim, Yeonmin;Park, Inho;Choi, JaeHak;Lee, Yonghwan;Han, Woori
    • Journal of the Semiconductor & Display Technology
    • /
    • v.15 no.2
    • /
    • pp.66-69
    • /
    • 2016
  • Recently, augmented reality and virtual reality in the ICT sector have been highlighted. So also interested in related HMD areas to facilitate contact with the VR content is being attend. This paper proposes a method for implementing to the virtual reality through the mobile HMD device with the real time 360 image. This system is required the real time 360 image streaming server configuration and image processing for augmented reality and virtual reality. The configuration of the streaming server is configured the DB server to store images and the relay server that can relay images to other devices. Augmented image processing module is composed based on markerless tracking, and there are four modules that are recognition, tracking, detecting and learning module. Also, the purpose of this paper is shown the augmented 360 image processing through the Mobile HMD.

Prerequisite Research for the Development of an End-to-End System for Automatic Tooth Segmentation: A Deep Learning-Based Reference Point Setting Algorithm (자동 치아 분할용 종단 간 시스템 개발을 위한 선결 연구: 딥러닝 기반 기준점 설정 알고리즘)

  • Kyungdeok Seo;Sena Lee;Yongkyu Jin;Sejung Yang
    • Journal of Biomedical Engineering Research
    • /
    • v.44 no.5
    • /
    • pp.346-353
    • /
    • 2023
  • In this paper, we propose an innovative approach that leverages deep learning to find optimal reference points for achieving precise tooth segmentation in three-dimensional tooth point cloud data. A dataset consisting of 350 aligned maxillary and mandibular cloud data was used as input, and both end coordinates of individual teeth were used as correct answers. A two-dimensional image was created by projecting the rendered point cloud data along the Z-axis, where an image of individual teeth was created using an object detection algorithm. The proposed algorithm is designed by adding various modules to the Unet model that allow effective learning of a narrow range, and detects both end points of the tooth using the generated tooth image. In the evaluation using DSC, Euclid distance, and MAE as indicators, we achieved superior performance compared to other Unet-based models. In future research, we will develop an algorithm to find the reference point of the point cloud by back-projecting the reference point detected in the image in three dimensions, and based on this, we will develop an algorithm to divide the teeth individually in the point cloud through image processing techniques.

Anomaly-based Alzheimer's disease detection using entropy-based probability Positron Emission Tomography images

  • Husnu Baris Baydargil;Jangsik Park;Ibrahim Furkan Ince
    • ETRI Journal
    • /
    • v.46 no.3
    • /
    • pp.513-525
    • /
    • 2024
  • Deep neural networks trained on labeled medical data face major challenges owing to the economic costs of data acquisition through expensive medical imaging devices, expert labor for data annotation, and large datasets to achieve optimal model performance. The heterogeneity of diseases, such as Alzheimer's disease, further complicates deep learning because the test cases may substantially differ from the training data, possibly increasing the rate of false positives. We propose a reconstruction-based self-supervised anomaly detection model to overcome these challenges. It has a dual-subnetwork encoder that enhances feature encoding augmented by skip connections to the decoder for improving the gradient flow. The novel encoder captures local and global features to improve image reconstruction. In addition, we introduce an entropy-based image conversion method. Extensive evaluations show that the proposed model outperforms benchmark models in anomaly detection and classification using an encoder. The supervised and unsupervised models show improved performances when trained with data preprocessed using the proposed image conversion method.

Enhancing Underwater Images through Deep Curve Estimation (깊은 곡선 추정을 이용한 수중 영상 개선)

  • Muhammad Tariq Mahmood;Young Kyu Choi
    • Journal of the Semiconductor & Display Technology
    • /
    • v.23 no.2
    • /
    • pp.23-27
    • /
    • 2024
  • Underwater images are typically degraded due to color distortion, light absorption, scattering, and noise from artificial light sources. Restoration of these images is an essential task in many underwater applications. In this paper, we propose a two-phase deep learning-based method, Underwater Deep Curve Estimation (UWDCE), designed to effectively enhance the quality of underwater images. The first phase involves a white balancing and color correction technique to compensate for color imbalances. The second phase introduces a novel deep learning model, UWDCE, to learn the mapping between the color-corrected image and its best-fitting curve parameter maps. The model operates iteratively, applying light-enhancement curves to achieve better contrast and maintain pixel values within a normalized range. The results demonstrate the effectiveness of our method, producing higher-quality images compared to state-of-the-art methods.

  • PDF