• Title/Summary/Keyword: 특징 추출부 학습

Search Result 47, Processing Time 0.025 seconds

Spam-Mail Filtering System Using Weighted Bayesian Classifier (가중치가 부여된 베이지안 분류자를 이용한 스팸 메일 필터링 시스템)

  • 김현준;정재은;조근식
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.8
    • /
    • pp.1092-1100
    • /
    • 2004
  • An E-mails have regarded as one of the most popular methods for exchanging information because of easy usage and low cost. Meanwhile, exponentially growing unwanted mails in user's mailbox have been raised as main problem. Recognizing this issue, Korean government established a law in order to prevent e-mail abuse. In this paper we suggest hybrid spam mail filtering system using weighted Bayesian classifier which is extended from naive Bayesian classifier by adding the concept of preprocessing and intelligent agents. This system can classify spam mails automatically by using training data without manual definition of message rules. Particularly, we improved filtering efficiency by imposing weight on some character by feature extraction from spam mails. Finally, we show efficiency comparison among four cases - naive Bayesian, weighting on e-mail header, weighting on HTML tags, weighting on hyperlinks and combining all of four cases. As compared with naive Bayesian classifier, the proposed system obtained 5.7% decreased precision, while the recall and F-measure of this system increased by 33.3% and 31.2%, respectively.

Analysis of High School Students' Conceptual Differentiation Patterns using Concept map (개념도를 이용한 고등학생의 개념 분화 유형 분석)

  • Sim, Jae-Ho;Chung, Wan-Ho;Lee, Kil-Jae;Hong, Jun-Euy
    • Journal of The Korean Association For Science Education
    • /
    • v.24 no.2
    • /
    • pp.246-257
    • /
    • 2004
  • The purpose of this qualitative study was to identify high school students' conceptual differentiation patterns on human digestion system. The subjects were 124 high school students and this group was guided to independently construct concept maps. Among them, 19 were selected for an in-depth interview and a short test. The concept maps, interview transcripts and the results of short-test were analyzed to identify conceptual differentiation patterns. The results were as follows. Mainly three distinct conceptual differentiation patterns were identified. The first pattern can be named as an 'Free-flow type'. The group belongs to this pattern expressed numerous examples than meaningful concepts with unclear understanding of hierarchial relation between each concepts. Also, this group had difficulties in grasp interrelations of different concepts. The second pattern can be identified as 'Sequence type'. This group constructed concept maps by featuring conceptual sequence. The group applied meaningful learning, yet assembled concept maps primarily according to sequence of learning and exhibited less organized concept maps than hierarchial type. The third pattern can be named as 'Hierarchial type'. All students elaborated concept maps after lessons. The sequence type changed hierarchial type or sequence mixed with hierarchial type but free-flow type was hardly changed.

Development and evaluation of Pre-Parenthood Education Program for high school students based on Home Economics subject (고등학생을 위한 가정교과 기반 예비부모교육 프로그램 개발 및 평가)

  • Noh, Heui-Yeon;Cho, Jae Soon;Chae, Jung Hyun
    • Journal of Korean Home Economics Education Association
    • /
    • v.29 no.4
    • /
    • pp.161-193
    • /
    • 2017
  • The purpose of this study was to develop and evaluate pre-parenthood education program(PPEP) based on Home Economics(HE) subject for high school students. The development and evaluation of PPEP based on HE subject in this study followed ADDIE model except implementation through 4 processes such as analysis, design, development, and evaluation. First, program development directions were set in three aspects such as 'general development', 'contents', and 'teaching and learning methods'. Themes of the program are 11 in total such as '1. Parenting, what is being a parent', '2. Choosing your spouse, happy marital relationship, the best gift to your children', '3. Pregnancy and birth, a moving meeting with a new life', '4. Taking care of a new born infant for 24 hours', '5. Taking care of infants, relationship with my lovely baby, attachment', '6. Taking care of young children, my child from another planet', '7. Parents and children in healthy family', '8. Parent-child relationship, wise parents to make effective interaction with their children', '9. Parents safety manager at home,', '10. Practice to take care of infants', and '11. Practice of community nurturing support service development'. In particular, learning activities of the program have major characteristics such as 1) utilization of cases including practice problems related to parenting, 2) community exchange activities utilizing learned knowledge and techniques, 3) actual life project activities utilizing learning contents related with parenting, 4) activities inducing positive changes in current life of high school students, and 5) practice activities for the necessities of life such as food, clothing and shelter supporting development of children. Second, the program was developed according to the design. Teaching-learning plans and materials for 17 classes were developed according to 11 themes. The developed plans include class flow and teacher's reference. It starts with receiving a class-related message from a virtual child at the introduction stage and ended with replying to the message by summarizing contents of the class and making a promise as a parent-to-be. That is the basic frame of class flow. Learning materials included various plans and reports necessary for learning activities and they are prepared in details so that they can be play the role of textbooks in regular curriculum. Third, evaluation of developed program was executed by a 5 point Likert scale survey on 13 HE experts on two aspects of program development process and program development results. In the evaluation of development process, mean value was 4.61 and index of content validity was 97.4%. For development results, mean value was 4.37 and index of content validity was 86.9%. These values showed that validity in the development process and results in this study was highly secured and confirmed that PPEP based on HE was appropriate and valid to enhance parent qualifications of high school learners.

Analysis of Verbal Interaction Between Teachers and Students in Middle School Science Classroom (중학교 과학 수업에서 교사와 학생의 언어적 상호작용 분석)

  • Choi, Kyung-Hee;Park, Jong-Yoon;Choi, Byung-Soon;Nam, Jeong-Hee;Choi, Kyung-Soon;Lee, Ki-Soon
    • Journal of The Korean Association For Science Education
    • /
    • v.24 no.6
    • /
    • pp.1039-1048
    • /
    • 2004
  • The purpose of this study is to analyze verbal interaction between teachers and students in order to collect qualitative data on the characteristics of the interaction to enhance teaching efficacy. Total of 12 classes of eight science teachers were observed and were interviewed. The classes were video taped and all the verbal interactions were transcribed. The transcribed content and interviews were further analyzed to draw any conclusions on the verbal interaction between teachers and students. Analysis criteria for the data on the class and interview were developed based on the literature review and applied to analyze the collected content. The analyzed data showed that verbal interactions composed of confirmation questions for memorization, students' short responses and teacher's immediate feedbacks. The results of the study also suggested that there needs to be further studies on the interactional techniques for teacher in utilizing the class materials and activities. The teachers should acknowledge the importance of the questions and feedbacks of teachers for students to stimulate their sound learning through literatures.

The Accuracy Assessment of Species Classification according to Spatial Resolution of Satellite Image Dataset Based on Deep Learning Model (딥러닝 모델 기반 위성영상 데이터세트 공간 해상도에 따른 수종분류 정확도 평가)

  • Park, Jeongmook;Sim, Woodam;Kim, Kyoungmin;Lim, Joongbin;Lee, Jung-Soo
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1407-1422
    • /
    • 2022
  • This study was conducted to classify tree species and assess the classification accuracy, using SE-Inception, a classification-based deep learning model. The input images of the dataset used Worldview-3 and GeoEye-1 images, and the size of the input images was divided into 10 × 10 m, 30 × 30 m, and 50 × 50 m to compare and evaluate the accuracy of classification of tree species. The label data was divided into five tree species (Pinus densiflora, Pinus koraiensis, Larix kaempferi, Abies holophylla Maxim. and Quercus) by visually interpreting the divided image, and then labeling was performed manually. The dataset constructed a total of 2,429 images, of which about 85% was used as learning data and about 15% as verification data. As a result of classification using the deep learning model, the overall accuracy of up to 78% was achieved when using the Worldview-3 image, the accuracy of up to 84% when using the GeoEye-1 image, and the classification accuracy was high performance. In particular, Quercus showed high accuracy of more than 85% in F1 regardless of the input image size, but trees with similar spectral characteristics such as Pinus densiflora and Pinus koraiensis had many errors. Therefore, there may be limitations in extracting feature amount only with spectral information of satellite images, and classification accuracy may be improved by using images containing various pattern information such as vegetation index and Gray-Level Co-occurrence Matrix (GLCM).

Fault Detection Technique for PVDF Sensor Based on Support Vector Machine (서포트벡터머신 기반 PVDF 센서의 결함 예측 기법)

  • Seung-Wook Kim;Sang-Min Lee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.5
    • /
    • pp.785-796
    • /
    • 2023
  • In this study, a methodology for real-time classification and prediction of defects that may appear in PVDF(Polyvinylidene fluoride) sensors, which are widely used for structural integrity monitoring, is proposed. The types of sensor defects appearing according to the sensor attachment environment were classified, and an impact test using an impact hammer was performed to obtain an output signal according to the defect type. In order to cleary identify the difference between the output signal according to the defect types, the time domain statistical features were extracted and a data set was constructed. Among the machine learning based classification algorithms, the learning of the acquired data set and the result were analyzed to select the most suitable algorithm for detecting sensor defect types, and among them, it was confirmed that the highest optimization was performed to show SVM(Support Vector Machine). As a result, sensor defect types were classified with an accuracy of 92.5%, which was up to 13.95% higher than other classification algorithms. It is believed that the sensor defect prediction technique proposed in this study can be used as a base technology to secure the reliability of not only PVDF sensors but also various sensors for real time structural health monitoring.

Customer Behavior Prediction of Binary Classification Model Using Unstructured Information and Convolution Neural Network: The Case of Online Storefront (비정형 정보와 CNN 기법을 활용한 이진 분류 모델의 고객 행태 예측: 전자상거래 사례를 중심으로)

  • Kim, Seungsoo;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.221-241
    • /
    • 2018
  • Deep learning is getting attention recently. The deep learning technique which had been applied in competitions of the International Conference on Image Recognition Technology(ILSVR) and AlphaGo is Convolution Neural Network(CNN). CNN is characterized in that the input image is divided into small sections to recognize the partial features and combine them to recognize as a whole. Deep learning technologies are expected to bring a lot of changes in our lives, but until now, its applications have been limited to image recognition and natural language processing. The use of deep learning techniques for business problems is still an early research stage. If their performance is proved, they can be applied to traditional business problems such as future marketing response prediction, fraud transaction detection, bankruptcy prediction, and so on. So, it is a very meaningful experiment to diagnose the possibility of solving business problems using deep learning technologies based on the case of online shopping companies which have big data, are relatively easy to identify customer behavior and has high utilization values. Especially, in online shopping companies, the competition environment is rapidly changing and becoming more intense. Therefore, analysis of customer behavior for maximizing profit is becoming more and more important for online shopping companies. In this study, we propose 'CNN model of Heterogeneous Information Integration' using CNN as a way to improve the predictive power of customer behavior in online shopping enterprises. In order to propose a model that optimizes the performance, which is a model that learns from the convolution neural network of the multi-layer perceptron structure by combining structured and unstructured information, this model uses 'heterogeneous information integration', 'unstructured information vector conversion', 'multi-layer perceptron design', and evaluate the performance of each architecture, and confirm the proposed model based on the results. In addition, the target variables for predicting customer behavior are defined as six binary classification problems: re-purchaser, churn, frequent shopper, frequent refund shopper, high amount shopper, high discount shopper. In order to verify the usefulness of the proposed model, we conducted experiments using actual data of domestic specific online shopping company. This experiment uses actual transactions, customers, and VOC data of specific online shopping company in Korea. Data extraction criteria are defined for 47,947 customers who registered at least one VOC in January 2011 (1 month). The customer profiles of these customers, as well as a total of 19 months of trading data from September 2010 to March 2012, and VOCs posted for a month are used. The experiment of this study is divided into two stages. In the first step, we evaluate three architectures that affect the performance of the proposed model and select optimal parameters. We evaluate the performance with the proposed model. Experimental results show that the proposed model, which combines both structured and unstructured information, is superior compared to NBC(Naïve Bayes classification), SVM(Support vector machine), and ANN(Artificial neural network). Therefore, it is significant that the use of unstructured information contributes to predict customer behavior, and that CNN can be applied to solve business problems as well as image recognition and natural language processing problems. It can be confirmed through experiments that CNN is more effective in understanding and interpreting the meaning of context in text VOC data. And it is significant that the empirical research based on the actual data of the e-commerce company can extract very meaningful information from the VOC data written in the text format directly by the customer in the prediction of the customer behavior. Finally, through various experiments, it is possible to say that the proposed model provides useful information for the future research related to the parameter selection and its performance.