• Title/Summary/Keyword: block learning

Search Result 304, Processing Time 0.036 seconds

Performance Evaluation of ResNet-based Pneumonia Detection Model with the Small Number of Layers Using Chest X-ray Images (흉부 X선 영상을 이용한 작은 층수 ResNet 기반 폐렴 진단 모델의 성능 평가)

  • Youngeun Choi;Seungwan Lee
    • Journal of radiological science and technology
    • /
    • v.46 no.4
    • /
    • pp.277-285
    • /
    • 2023
  • In this study, pneumonia identification networks with the small number of layers were constructed by using chest X-ray images. The networks had similar trainable-parameters, and the performance of the trained models was quantitatively evaluated with the modification of the network architectures. A total of 6 networks were constructed: convolutional neural network (CNN), VGGNet, GoogleNet, residual network with identity blocks, ResNet with bottleneck blocks and ResNet with identity and bottleneck blocks. Trainable parameters for the 6 networks were set in a range of 273,921-294,817 by adjusting the output channels of convolution layers. The network training was implemented with binary cross entropy (BCE) loss function, sigmoid activation function, adaptive moment estimation (Adam) optimizer and 100 epochs. The performance of the trained models was evaluated in terms of training time, accuracy, precision, recall, specificity and F1-score. The results showed that the trained models with the small number of layers precisely detect pneumonia from chest X-ray images. In particular, the overall quantitative performance of the trained models based on the ResNets was above 0.9, and the performance levels were similar or superior to those based on the CNN, VGGNet and GoogleNet. Also, the residual blocks affected the performance of the trained models based on the ResNets. Therefore, in this study, we demonstrated that the object detection networks with the small number of layers are suitable for detecting pneumonia using chest X-ray images. And, the trained models based on the ResNets can be optimized by applying appropriate residual-blocks.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

COMPARISON OF KEDI-WISC AND BGT PERFORMANCE BETWEEN THE ASPERGER' DISORDER AND PDD NOS CHILDREN (아스퍼거장애와 비전형 자폐장애 아동의 KEDI-WISC와 BGT 수행의 비교)

  • Yang, Yoon-Ran;Shin, Min-Sup
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.9 no.2
    • /
    • pp.165-173
    • /
    • 1998
  • Objectives:This study was conducted to compare the cognitive characteristics and visual-motor coordination ability of children with Asperger’s disorder and with those of children with PDD NOS. Methods:27 children(13 in AS group and 14 in PDD NOS group) were individually assessed using the K-WISC and BGT, and the results of those tests were analyzed. Results:The mean FSIQ of the AS group was significantly higher than that of the PDD NOS group. There was also a large discrepancy between VIQ and PIQ in the PDD NOS, while there was not significant discrepancy in the AS. The AS was distinguished from PDD NOS group by significantly higher scores in Vocabulary and Comprehension subscales and lower score in Block design. Also, when compared with the PDD NOS, the AS showed more difficulties in visual-motor coordination. Conclusion:The AS showed relatively good verbal and learning ability, while the PDD NOS relatively superior ability in visuospatial function and visual-motor coordination. The findings indicated that the K-WISC and BGT might be useful assessment tool to differentiate the AS from PDD NOS.

  • PDF

Development of Convergence Education Program for Elementary School Gifted Education Based on Mathematics and Science (초등학교 영재교육을 위한 수학·과학 중심의 융합교육 프로그램 개발)

  • Ryu, Sung-Rim;Lee, Jong-Hak;Yoon, Ma-Byong;Kim, Hak-Sung
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.10
    • /
    • pp.217-228
    • /
    • 2018
  • The purpose of this study is to develop STEAM program for gifted education by combining educational contents of humanities, arts, engineering, technology, and design into various subjects, focusing on mathematics-science curriculum of elementary school. The achievement standards and curriculum contents of elementary mathematics-science curriculum were analyzed while considering 2015 revised national curriculum. And then, a 16 class-hour convergence education program consisting of 3-hour block time was developed by applying the STEAM model with 4 steps. The validity of the program developed through this process was verified, and four educational experts evaluate whether the program can be applied to the elementary school. Based on the evaluation results, the convergence education program was finalized. As a result of implementing the gifted education program for mathematics-science, students achieved the objectives and values of convergence education such as creative design, self-directed participation, cooperative learning, and interest in class activities (game, making). If this convergence education program is applied to regular class, creative experiential class, or class for gifted children, students can promote their scientific creativity, artistic sensitivity, design sence, and so on.

A study on the characteristics of cyanobacteria in the mainstream of Nakdong river using decision trees (의사결정나무를 이용한 낙동강 본류 구간의 남조류 발생특성 연구)

  • Jung, Woo Suk;Jo, Bu Geon;Kim, Young Do;Kim, Sung Eun
    • Journal of Wetlands Research
    • /
    • v.21 no.4
    • /
    • pp.312-320
    • /
    • 2019
  • The occurrence of cyanobacteria causes problems such as oxygen depletion and increase of organic matter in the water body due to mass prosperity and death. Each year, Algae bloom warning System is issued due to the effects of summer heat and drought. It is necessary to quantitatively characterize the occurrence of cyanobacteria for proactive green algae management in the main Nakdong river. In this study, we analyzed the major influencing factors on cyanobacteria bloom using visualization and correlation analysis. A decision tree, a machine learning method, was used to quantitatively analyze the conditions of cyanobacteria according to the influence factors. In all the weirs, meteorological factors, temperature and SPI drought index, were significantly correlated with cyanobacterial cell number. Increasing the number of days of heat wave and drought block the mixing of water in the water body and the stratification phenomenon to promote the development of cyanobacteria. In the long term, it is necessary to proactively manage cyanobacteria considering the meteorological impacts.

The Effect of Training Patch Size and ConvNeXt application on the Accuracy of CycleGAN-based Satellite Image Simulation (학습패치 크기와 ConvNeXt 적용이 CycleGAN 기반 위성영상 모의 정확도에 미치는 영향)

  • Won, Taeyeon;Jo, Su Min;Eo, Yang Dam
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.3
    • /
    • pp.177-185
    • /
    • 2022
  • A method of restoring the occluded area was proposed by referring to images taken with the same types of sensors on high-resolution optical satellite images through deep learning. For the natural continuity of the simulated image with the occlusion region and the surrounding image while maintaining the pixel distribution of the original image as much as possible in the patch segmentation image, CycleGAN (Cycle Generative Adversarial Network) method with ConvNeXt block applied was used to analyze three experimental regions. In addition, We compared the experimental results of a training patch size of 512*512 pixels and a 1024*1024 pixel size that was doubled. As a result of experimenting with three regions with different characteristics,the ConvNeXt CycleGAN methodology showed an improved R2 value compared to the existing CycleGAN-applied image and histogram matching image. For the experiment by patch size used for training, an R2 value of about 0.98 was generated for a patch of 1024*1024 pixels. Furthermore, As a result of comparing the pixel distribution for each image band, the simulation result trained with a large patch size showed a more similar histogram distribution to the original image. Therefore, by using ConvNeXt CycleGAN, which is more advanced than the image applied with the existing CycleGAN method and the histogram-matching image, it is possible to derive simulation results similar to the original image and perform a successful simulation.

Design and Implementation of BNN based Human Identification and Motion Classification System Using CW Radar (연속파 레이다를 활용한 이진 신경망 기반 사람 식별 및 동작 분류 시스템 설계 및 구현)

  • Kim, Kyeong-min;Kim, Seong-jin;NamKoong, Ho-jung;Jung, Yun-ho
    • Journal of Advanced Navigation Technology
    • /
    • v.26 no.4
    • /
    • pp.211-218
    • /
    • 2022
  • Continuous wave (CW) radar has the advantage of reliability and accuracy compared to other sensors such as camera and lidar. In addition, binarized neural network (BNN) has a characteristic that dramatically reduces memory usage and complexity compared to other deep learning networks. Therefore, this paper proposes binarized neural network based human identification and motion classification system using CW radar. After receiving a signal from CW radar, a spectrogram is generated through a short-time Fourier transform (STFT). Based on this spectrogram, we propose an algorithm that detects whether a person approaches a radar. Also, we designed an optimized BNN model that can support the accuracy of 90.0% for human identification and 98.3% for motion classification. In order to accelerate BNN operation, we designed BNN hardware accelerator on field programmable gate array (FPGA). The accelerator was implemented with 1,030 logics, 836 registers, and 334.904 Kbit block memory, and it was confirmed that the real-time operation was possible with a total calculation time of 6 ms from inference to transferring result.

Analysis of Satisfaction of Pre-service and In-service Elementary Teachers with Artificial Intelligence Education using App Inventor

  • Junghee, Jo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.3
    • /
    • pp.189-196
    • /
    • 2023
  • This paper analyzes the level of satisfaction of two groups of teachers who were educated about artificial intelligence using App Inventor. The participants were 13 pre-service and 9 in-service elementary school teachers and the data was collected using a questionnaire. As a result of the study, in-service teachers were all more satisfied than pre-service teachers in terms of interest, difficulty, and participation in the education. In addition, the questions investigating whether education helped motivate learning of artificial intelligence and whether there is a willingness to apply it to elementary classes in the future were also more positive for in-service teachers than for pre-service teachers. In general, pre-service teachers had somewhat more negative views than in-service teachers, but they were more positive than in-service teachers in terms of whether the education helped improve their understanding of artificial intelligence and whether they were willing to participate in additional education. Analysis of the Mann-Whitney test to see if there was a significant difference in satisfaction between the two groups showed no significance. This may be because most of the students in the two groups already had block-type or text-type programming experience, so they were able to participate in the education without any special resistance or difficulty with App Inventor, resulting in high levels of satisfaction from both groups. The results of this study can provide basic data for the future development and operation of programs for artificial intelligence education for both pre-service and in-service elementary school teachers.

Multi-View 3D Human Pose Estimation Based on Transformer (트랜스포머 기반의 다중 시점 3차원 인체자세추정)

  • Seoung Wook Choi;Jin Young Lee;Gye Young Kim
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.48-56
    • /
    • 2023
  • The technology of Three-dimensional human posture estimation is used in sports, motion recognition, and special effects of video media. Among various methods for this, multi-view 3D human pose estimation is essential for precise estimation even in complex real-world environments. But Existing models for multi-view 3D human posture estimation have the disadvantage of high order of time complexity as they use 3D feature maps. This paper proposes a method to extend an existing monocular viewpoint multi-frame model based on Transformer with lower time complexity to 3D human posture estimation for multi-viewpoints. To expand to multi-viewpoints our proposed method first generates an 8-dimensional joint coordinate that connects 2-dimensional joint coordinates for 17 joints at 4-vieiwpoints acquired using the 2-dimensional human posture detector, CPN(Cascaded Pyramid Network). This paper then converts them into 17×32 data with patch embedding, and enters the data into a transformer model, finally. Consequently, the MLP(Multi-Layer Perceptron) block that outputs the 3D-human posture simultaneously updates the 3D human posture estimation for 4-viewpoints at every iteration. Compared to Zheng[5]'s method the number of model parameters of the proposed method was 48.9%, MPJPE(Mean Per Joint Position Error) was reduced by 20.6 mm (43.8%) and the average learning time per epoch was more than 20 times faster.

  • PDF

Development and Application of Creative Education Learning Program Using Creative Thinking Methods (창의적 사고기법을 활용한 창의교육 수업프로그램 개발 및 적용)

  • Han, Shin;Kim, Hyoungbum;Lee, Chang-Hwan
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.13 no.2
    • /
    • pp.162-174
    • /
    • 2020
  • This study aimed to develop a creative education class program using metaphor, one of the creative thinking techniques, and to examine the effectiveness of the program targeting for randomly sampled 338 students in six middle schools. The creative education class program with the metaphor was developed based on content elements concerning 'astronomy' in 2015 science curriculum revision in South Korea. The program was tested for validity after being modified and supplemented three times by forming a group of experts, and the final version of the program was applied to school education fields during four periods, including block time. To find out the effectiveness of the program and the implementation, creative education class satisfaction test and creative thinking process test were conducted. That is to say, the creative education class satisfaction test was conducted before treatment and the creative thinking process test was implemented both before and after treatment. The results of the study are as follows. First, in this study, the program was developed with the emphasis on students voluntarily and actively participating in creative education programs while utilizing creative thinking methods. Second, the statistical results of the pre- and post-class about the creative education program using the metaphor of creative thinking techniques represented significant results(p<.05). In other words, the two-dependent samples by students' pre-and post-score about the creative education class showed significant statistical test results (p<.05). It turned out that the creative education program using metaphor has had a positive impact on research participants. Third, in regards to the results of the creative education class satisfaction test, 101 out of 338 students(30%) answered 'Strongly Agree' and 137(41%) answered 'Agree', indicating the subjects' satisfaction with the class was high in general. On the other hand, concerning difficulties of the creative class, 137(41%) answered "Lack of time" as the main factor, followed by 98(30%) "Difficulties of problems they were required to solve", 73(22%) answered "Conflicts with friends", and 24(7%) said "Difficulties of contents." These responses were taken into account as considerations for further development of creative education programs.