• Title/Summary/Keyword: Supervised Classification

검색결과 415건 처리시간 0.026초

손동작 인식 시스템을 위한 동적 학습 알고리즘 (Dynamic Training Algorithm for Hand Gesture Recognition System)

  • 배철수
    • 한국정보통신학회논문지
    • /
    • 제11권7호
    • /
    • pp.1348-1353
    • /
    • 2007
  • 본 논문에서는 카메라-투영 시스템에서 비전에 기반을 둔 손동작 인식을 위한 새로운 알고리즘을 제안하고 있다. 제안된 인식방법은 정적인 손동작 분류를 위하여 푸리에 변환을 사용하였다. 손 분할은 개선된 배경 제거 방법을 사용하였다. 대부분의 인식방법들이 같은 피검자에 의해 학습과 실험이 이루어지고 상호작용에 이전에 학습단계가 필요하다. 그러나 학습되지 않은 다양한 상황에 대해서도 상호작용을 위해 동작 인식이 요구된다. 그러므로 본 논문에서는 인식 작업 중에 검출된 불완전한 동작들을 정정하여 적용하였다. 그 결과 사용자와 독립되게 동작을 인식함으로써 새로운 사용자에게 신속하게 온라인 적용이 가능하였다.

Analysis of urbanization factor in river boundary using aerial image

  • Lee, Geun-Sang;Lee, Hyun-Seok;Chae, Hyo-Sok;Hwang, Eui-Ho
    • 대한원격탐사학회지
    • /
    • 제22권5호
    • /
    • pp.421-425
    • /
    • 2006
  • It can be important framework data to monitor the change of land-use pattern of river boundary in design and management of river. This study analyzed the change of land-use pattern of Gab and Yudeung River using time-series aerial images. To do this, we carried out radiation and geometric correction of image, and estimated land-use changes in inland and floodplain. As the analysis of inland, the ratio of residential, commercial, industrial, educational and public area, that is urbanized element, increases, but that of agricultural area shows a decline on the basis of 1990. Also, Minimum Distance Method, which is a kind of supervised classification method, is applied to extract water-body and sand bar layer in floodplain. As the analysis of land-use, the ratio of level-upped riverside land and water-body increases, but that of sand bar decreases. These time-series land use information can be important decision making data to evaluate the urbanization of river boundary, and especially it gives us goodness in river development project such as the composition of ecological habitat.

Will You Buy It Now?: Predicting Passengers that Purchase Premium Promotions Using the PAX Model

  • Al Emadi, Noora;Thirumuruganathan, Saravanan;Robillos, Dianne Ramirez;Jansen, Bernard Jim
    • Journal of Smart Tourism
    • /
    • 제1권1호
    • /
    • pp.53-64
    • /
    • 2021
  • Upselling is often a critical factor in revenue generation for businesses in the tourism and travel industry. Utilizing passenger data from a major international airline company, we develop the PAX (Passenger, Airline, eXternal) model to predict passengers that are most likely to accept an upgrade offer from economy to premium. Formulating the problem as an extremely unbalanced, cost-sensitive, supervised binary classification, we predict if a customer will take an upgrade offer. We use a feature vector created from the historical data of 3 million passenger records from 2017 to 2019, in which passengers received approximately 635,000 upgrade offers worth more than $422,000,000 U.S. dollars. The model has an F1-score of 0.75, outperforming the airline's current rule-based approach. Findings have several practical applications, including identifying promising customers for upselling and minimizing the number of indiscriminate emails sent to customers. Accurately identifying the few customers who will react positively to upgrade offers is of paramount importance given the airline 'industry's razor-thin margins. Research results have significant real-world impacts because there is the potential to improve targeted upselling to customers in the airline and related industries.

커널 밀도 측정에서의 나이브 베이스 접근 방법 (Naive Bayes Approach in Kernel Density Estimation)

  • 샹총량;유샹루;아메드 압둘하킴 알-압시;강대기
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2014년도 춘계학술대회
    • /
    • pp.76-78
    • /
    • 2014
  • 나이브 베이스 학습은 유명하면서도, 빠르면서도 효과적인 지도 학습 방법으로, 다소 잡음을 가진 라벨이 있는 데이터집합을 다루는 데 좋은 성능을 보인다. 그러나, 나이브 베이스의 조건적 독립성 가정은 실세계 데이터를 다루는 데 필요한 특성에 다소 제약사항을 가지게 한다. 지금까지 연구자들이 이 조건적 독립성 가정을 완화시키는 방법들을 제안해 왔다. 이러한 방법들은 어트리뷰트 가중치, 커널 밀도 측정 등이 있다. 본 논문에서, 우리는 커널 밀도 측정과 어트리뷰트 가증치를 이용하여 나이브 베이스의 학습 효과를 개선하기 위한 NB Based on Attribute Weighting in Kernel Density Estimation (NBAWKDE) 이라는 새로운 접근 방법을 제안한다.

  • PDF

Image-to-Image Translation with GAN for Synthetic Data Augmentation in Plant Disease Datasets

  • Nazki, Haseeb;Lee, Jaehwan;Yoon, Sook;Park, Dong Sun
    • 스마트미디어저널
    • /
    • 제8권2호
    • /
    • pp.46-57
    • /
    • 2019
  • In recent research, deep learning-based methods have achieved state-of-the-art performance in various computer vision tasks. However, these methods are commonly supervised, and require huge amounts of annotated data to train. Acquisition of data demands an additional costly effort, particularly for the tasks where it becomes challenging to obtain large amounts of data considering the time constraints and the requirement of professional human diligence. In this paper, we present a data level synthetic sampling solution to learn from small and imbalanced data sets using Generative Adversarial Networks (GANs). The reason for using GANs are the challenges posed in various fields to manage with the small datasets and fluctuating amounts of samples per class. As a result, we present an approach that can improve learning with respect to data distributions, reducing the partiality introduced by class imbalance and hence shifting the classification decision boundary towards more accurate results. Our novel method is demonstrated on a small dataset of 2789 tomato plant disease images, highly corrupted with class imbalance in 9 disease categories. Moreover, we evaluate our results in terms of different metrics and compare the quality of these results for distinct classes.

Obesity Level Prediction Based on Data Mining Techniques

  • Alqahtani, Asma;Albuainin, Fatima;Alrayes, Rana;Al muhanna, Noura;Alyahyan, Eyman;Aldahasi, Ezaz
    • International Journal of Computer Science & Network Security
    • /
    • 제21권3호
    • /
    • pp.103-111
    • /
    • 2021
  • Obesity affects individuals of all gender and ages worldwide; consequently, several studies have performed great works to define factors causing it. This study develops an effective method to trace obesity levels based on supervised data mining techniques such as Random Forest and Multi-Layer Perception (MLP), so as to tackle this universal epidemic. Notably, the dataset was from countries like Mexico, Peru, and Colombia in the 14- 61year age group, with varying eating habits and physical conditions. The data includes 2111 instances and 17 attributes labelled using NObesity, which facilitates categorization of data using Overweight Levels l I and II, Insufficient Weight, Normal Weight, as well as Obesity Type I to III. This study found that the highest accuracy was achieved by Random Forest algorithm in comparison to the MLP algorithm, with an overall classification rate of 96.7%.

An Efficient Machine Learning-based Text Summarization in the Malayalam Language

  • P Haroon, Rosna;Gafur M, Abdul;Nisha U, Barakkath
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권6호
    • /
    • pp.1778-1799
    • /
    • 2022
  • Automatic text summarization is a procedure that packs enormous content into a more limited book that incorporates significant data. Malayalam is one of the toughest languages utilized in certain areas of India, most normally in Kerala and in Lakshadweep. Natural language processing in the Malayalam language is relatively low due to the complexity of the language as well as the scarcity of available resources. In this paper, a way is proposed to deal with the text summarization process in Malayalam documents by training a model based on the Support Vector Machine classification algorithm. Different features of the text are taken into account for training the machine so that the system can output the most important data from the input text. The classifier can classify the most important, important, average, and least significant sentences into separate classes and based on this, the machine will be able to create a summary of the input document. The user can select a compression ratio so that the system will output that much fraction of the summary. The model performance is measured by using different genres of Malayalam documents as well as documents from the same domain. The model is evaluated by considering content evaluation measures precision, recall, F score, and relative utility. Obtained precision and recall value shows that the model is trustable and found to be more relevant compared to the other summarizers.

Tri-training algorithm based on cross entropy and K-nearest neighbors for network intrusion detection

  • Zhao, Jia;Li, Song;Wu, Runxiu;Zhang, Yiying;Zhang, Bo;Han, Longzhe
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권12호
    • /
    • pp.3889-3903
    • /
    • 2022
  • To address the problem of low detection accuracy due to training noise caused by mislabeling when Tri-training for network intrusion detection (NID), we propose a Tri-training algorithm based on cross entropy and K-nearest neighbors (TCK) for network intrusion detection. The proposed algorithm uses cross-entropy to replace the classification error rate to better identify the difference between the practical and predicted distributions of the model and reduce the prediction bias of mislabeled data to unlabeled data; K-nearest neighbors are used to remove the mislabeled data and reduce the number of mislabeled data. In order to verify the effectiveness of the algorithm proposed in this paper, experiments were conducted on 12 UCI datasets and NSL-KDD network intrusion datasets, and four indexes including accuracy, recall, F-measure and precision were used for comparison. The experimental results revealed that the TCK has superior performance than the conventional Tri-training algorithms and the Tri-training algorithms using only cross-entropy or K-nearest neighbor strategy.

Field Test of Automated Activity Classification Using Acceleration Signals from a Wristband

  • Gong, Yue;Seo, JoonOh
    • 국제학술발표논문집
    • /
    • The 8th International Conference on Construction Engineering and Project Management
    • /
    • pp.443-452
    • /
    • 2020
  • Worker's awkward postures and unreasonable physical load can be corrected by monitoring construction activities, thereby increasing the safety and productivity of construction workers and projects. However, manual identification is time-consuming and contains high human variance. In this regard, an automated activity recognition system based on inertial measurement unit can help in rapidly and precisely collecting motion data. With the acceleration data, the machine learning algorithm will be used to train classifiers for automatically categorizing activities. However, input acceleration data are extracted either from designed experiments or simple construction work in previous studies. Thus, collected data series are discontinuous and activity categories are insufficient for real construction circumstances. This study aims to collect acceleration data during long-term continuous work in a construction project and validate the feasibility of activity recognition algorithm with the continuous motion data. The data collection covers two different workers performing formwork at the same site. An accelerator, as well as portable camera, is attached to the worker during the entire working session for simultaneously recording motion data and working activity. The supervised machine learning-based models are trained to classify activity in hierarchical levels, which reaches a 96.9% testing accuracy of recognizing rest and work and 85.6% testing accuracy of identifying stationary, traveling, and rebar installation actions.

  • PDF

Leveraging Analytics for Talent Acquisition: Case of IT Sector in India

  • Avik Ghosh;Bhaskar Basu
    • Asia pacific journal of information systems
    • /
    • 제30권4호
    • /
    • pp.879-918
    • /
    • 2020
  • One of the challenges faced by Talent Acquisition teams today pertains to the acquisition of human resources by matching job descriptions and skillsets desired. It is more so in the case of competitive sectors like the Indian IT sector. There can be various channels for Talent Acquisition and accordingly, the cost and benefits might vary. However, the consequences of a mismatch have an impact on the quality of deliverables, high recruitment expenses and loss of revenue for the organization. With increased and diverse sources of data that are available to organizations today, there is ample opportunity to apply analytics for informed decision making in this field. This paper reveals useful insights that help streamline the Talent Acquisition process in the Indian IT Industry. The paper adopts a data-centric approach to examine the critical determinants for efficient and effective Talent Acquisition process in IT organizations. Selected supervised machine learning algorithms are applied for the analysis of the dataset. The study is likely to help organizations in reassessing their talent acquisition strategy with respect to key parameters like expected cost to company (CTC), candidate sourcing channels and optimal joining period.