• Title/Summary/Keyword: one class classification

Search Result 348, Processing Time 0.029 seconds

Feature Selection Using Submodular Approach for Financial Big Data

  • Attigeri, Girija;Manohara Pai, M.M.;Pai, Radhika M.
    • Journal of Information Processing Systems
    • /
    • v.15 no.6
    • /
    • pp.1306-1325
    • /
    • 2019
  • As the world is moving towards digitization, data is generated from various sources at a faster rate. It is getting humungous and is termed as big data. The financial sector is one domain which needs to leverage the big data being generated to identify financial risks, fraudulent activities, and so on. The design of predictive models for such financial big data is imperative for maintaining the health of the country's economics. Financial data has many features such as transaction history, repayment data, purchase data, investment data, and so on. The main problem in predictive algorithm is finding the right subset of representative features from which the predictive model can be constructed for a particular task. This paper proposes a correlation-based method using submodular optimization for selecting the optimum number of features and thereby, reducing the dimensions of the data for faster and better prediction. The important proposition is that the optimal feature subset should contain features having high correlation with the class label, but should not correlate with each other in the subset. Experiments are conducted to understand the effect of the various subsets on different classification algorithms for loan data. The IBM Bluemix BigData platform is used for experimentation along with the Spark notebook. The results indicate that the proposed approach achieves considerable accuracy with optimal subsets in significantly less execution time. The algorithm is also compared with the existing feature selection and extraction algorithms.

Credit Card Bad Debt Prediction Model based on Support Vector Machine (신용카드 대손회원 예측을 위한 SVM 모형)

  • Kim, Jin Woo;Jhee, Won Chul
    • Journal of Information Technology Services
    • /
    • v.11 no.4
    • /
    • pp.233-250
    • /
    • 2012
  • In this paper, credit card delinquency means the possibility of occurring bad debt within the certain near future from the normal accounts that have no debt and the problem is to predict, on the monthly basis, the occurrence of delinquency 3 months in advance. This prediction is typical binary classification problem but suffers from the issue of data imbalance that means the instances of target class is very few. For the effective prediction of bad debt occurrence, Support Vector Machine (SVM) with kernel trick is adopted using credit card usage and payment patterns as its inputs. SVM is widely accepted in the data mining society because of its prediction accuracy and no fear of overfitting. However, it is known that SVM has the limitation in its ability to processing the large-scale data. To resolve the difficulties in applying SVM to bad debt occurrence prediction, two stage clustering is suggested as an effective data reduction method and ensembles of SVM models are also adopted to mitigate the difficulty due to data imbalance intrinsic to the target problem of this paper. In the experiments with the real world data from one of the major domestic credit card companies, the suggested approach reveals the superior prediction accuracy to the traditional data mining approaches that use neural networks, decision trees or logistics regressions. SVM ensemble model learned from T2 training set shows the best prediction results among the alternatives considered and it is noteworthy that the performance of neural networks with T2 is better than that of SVM with T1. These results prove that the suggested approach is very effective for both SVM training and the classification problem of data imbalance.

A Model for Effective Customer Classification Using LTV and Churn Probability : Application of Holistic Profit Method (고객의 이탈 가능성과 LTV를 이용한 고객등급화 모형개발에 관한 연구)

  • Lee, HoonYoung;Yang, JooHwan;Ryu, Chi Hun
    • Journal of Intelligence and Information Systems
    • /
    • v.12 no.4
    • /
    • pp.109-126
    • /
    • 2006
  • An effective customer classification has been essential for the successful customer relationship management. The typical customer rating is carried out by the proportionally allocating the customers into classes in terms of their life time values. However, since this method does not accurately reflect the homogeneity within a class along with the heterogeneity between classes, there would be many problems incurred due to the misclassification. This paper suggests a new method of rating customer using Holistic profit technique, and validates the new method using the customer data provided by an insurance company. Holistic profit is one of the methods used for deciding the cutoff score in screening the loan application. By rating customers using the proposed techniques, insurance companies could effectively perform customer relationship management and diverse marketing activities.

  • PDF

Correlation of Clinical Class with Duplex Ultrasound Findings in Lower Limb Chronic Venous Disease

  • Hong, Ki Pyo
    • Journal of Chest Surgery
    • /
    • v.55 no.3
    • /
    • pp.233-238
    • /
    • 2022
  • Background: This study investigated the distribution of valve incompetence in patients with chronic venous disease (CVD) and its correlation with the clinical category of the clinical, etiological, anatomical, and pathophysiological (CEAP) classification. Methods: In total, 1,386 limbs with clinically suspected CVD were categorized according to the CEAP classification and consecutively underwent duplex ultrasonography between April 2017 and December 2020. Results: There were 362 limbs in male patients and 1,024 limbs in female patients. The limbs were classified as C0s-C1 (608 limbs, 43.8%), C2 (727 limbs, 52.5%), or C3-C6 (51 limbs, 3.7%). The prevalence of saphenous vein incompetence in CEAP C0s-C1 limbs was 43.6%. The saphenofemoral junction (SFJ) was competent in 37% of CEAP C2-C6 limbs. The CEAP C3-C6 category was not correlated with reflux patterns of the saphenous vein system (Cramer's V=0.07), incompetent SFJ (Cramer's V=0.07), deep vein reflux (Cramer's V=0.03), or the distribution of incompetent segments in the great saphenous vein (GSV) (Cramer's V=0.11). Conclusion: Duplex ultrasonography is necessary to formulate a proper treatment plan for limbs categorized as CEAP C0s-C1. The SFJ was competent in more than one-third of CEAP C2-C6 limbs with GSV reflux; as such, flush ligation of the GSV may be unnecessary in these patients. The CEAP C3-C6 category showed no correlations with reflux patterns of the saphenous vein system, SFJ reflux, deep vein reflux, or the distribution of incompetent segments in the GSV.

Research of Deep Learning-Based Multi Object Classification and Tracking for Intelligent Manager System (지능형 관제시스템을 위한 딥러닝 기반의 다중 객체 분류 및 추적에 관한 연구)

  • June-hwan Lee
    • Smart Media Journal
    • /
    • v.12 no.5
    • /
    • pp.73-80
    • /
    • 2023
  • Recently, intelligent control systems are developing rapidly in various application fields, and methods for utilizing technologies such as deep learning, IoT, and cloud computing for intelligent control systems are being studied. An important technology in an intelligent control system is recognizing and tracking objects in images. However, existing multi-object tracking technology has problems in accuracy and speed. In this paper, a real-time intelligent control system was implemented using YOLO v5 and YOLO v6 based on a one-shot architecture that increases the accuracy of object tracking and enables fast and accurate tracking even when objects overlap each other or when there are many objects belonging to the same class. The experiment was evaluated by comparing YOLO v5 and YOLO v6. As a result of the experiment, the YOLO v6 model shows performance suitable for the intelligent control system.

APPLICATION OF SUPPORT VECTOR MACHINE TO THE PREDICTION OF GEO-EFFECTIVE HALO CMES

  • Choi, Seong-Hwan;Moon, Yong-Jae;Vien, Ngo Anh;Park, Young-Deuk
    • Journal of The Korean Astronomical Society
    • /
    • v.45 no.2
    • /
    • pp.31-38
    • /
    • 2012
  • In this study we apply Support Vector Machine (SVM) to the prediction of geo-effective halo coronal mass ejections (CMEs). The SVM, which is one of machine learning algorithms, is used for the purpose of classification and regression analysis. We use halo and partial halo CMEs from January 1996 to April 2010 in the SOHO/LASCO CME Catalog for training and prediction. And we also use their associated X-ray flare classes to identify front-side halo CMEs (stronger than B1 class), and the Dst index to determine geo-effective halo CMEs (stronger than -50 nT). The combinations of the speed and the angular width of CMEs, and their associated X-ray classes are used for input features of the SVM. We make an attempt to find the best model by using cross-validation which is processed by changing kernel functions of the SVM and their parameters. As a result we obtain statistical parameters for the best model by using the speed of CME and its associated X-ray flare class as input features of the SVM: Accuracy=0.66, PODy=0.76, PODn=0.49, FAR=0.72, Bias=1.06, CSI=0.59, TSS=0.25. The performance of the statistical parameters by applying the SVM is much better than those from the simple classifications based on constant classifiers.

The Learning Styles and Curriculum for Environmental Experience-Based Learning in Classroom of the Small Scale (소규모 학급의 환경 체험 학습을 위한 학습 유형화와 그 교육 과정)

  • Kwak, Hong-Tak;Lee, Ok-Hee
    • Hwankyungkyoyuk
    • /
    • v.19 no.3
    • /
    • pp.40-56
    • /
    • 2006
  • The purpose of this study is to enhance elementary students' awareness of environment-friendly life and help them to prepare for a better life in the future. To achieve this purpose we examined the effect typical environmental experience-based learning activities, which were based on the local circumstances with high environmental-educational potential, have on the attitudes toward environment-friendly life. This study was carried out on the basis of typical environmental experience-based learning in the small class size. The research group used was composed of one sixth grade elementary school class called Sangroksu, whose total students were 9. The research period lasted from March 2005 to February 2006. To analyze the result of this study, two research methods were applied simultaneously : quantitative research methods and qualitative research methods. Especially statistical analysis in quantitative research methods by self-administrated questionnaire was done with SAS program. Qualitative research methods were analyzed in a cyclic pattern, including the processes of domain analysis, classification analysis, and factor analysis which continued to be associated with data-collecting methods. This research shows the following results. First of all, students have shown meaningful differences after typical environmental experience-based learning activities.(p<.05). Followings are fields of the differences - students‘ interest on the subject, their understanding levels of necessity for basic environmental facilities around us as well as for the kinds of environmental experience-based learning, awareness levels of various environmental problems, consciousness on environment conservation, and the practicing ability of environment - friendly lifestyles. Secondly, We have discovered improvements in the following fields after this study - the knowledge and understanding levels on our environment and human relationships, students' fundamental abilities to work out environmental problems, right ideas and appropriate attitudes on environment protection, the practicing ability of environment-friendly life styles, and their parents' understanding levels on the education related to environment. In conclusion, typical environmental experience-based learning activities have a positive effect on the improvement of elementary school students' environment-friendly life styles.

  • PDF

A Study on the Effects of Experiential Learning for Environment Based on Living Area (지역기반 환경체험학습의 효과에 관한 연구)

  • Lee, Dong-Yab;Kim, Hee-Cheol;Park, Man-Guen;An, A-Yeong;Lee, Ji-Suk;Lee, Ji-Hee;Cheong, Cheol
    • Hwankyungkyoyuk
    • /
    • v.20 no.1
    • /
    • pp.19-27
    • /
    • 2007
  • This study was intended to answer the question, 'What kinds of effects will be aroused by experiential learning for environment based on living area?'. Experiential learning for environment was operated to 17 elementary school students in 4th grade in Kyeong-san city. The results were drawn analyzing the mind map for the changes of environmental consciousness before and after learning, and they are as below. First, it had an effect to change the meaning association of the relationship between 'river and me'. Meaning association was 'river-a thing' before experiential learning, but it was developed as 'river-a thing-me' after learning. This means that students expanded understanding of the world that they were belonging and self-spatialization was promoted. The expansion of meaning association would be a start point and a method to promote their segmentation for each student. Second, students could self-directly modify misconception and preconception after experiential learning. It showed that students could find meanings in the world that they were belonging by experiential learning for environment, and misconception obtained by concept learning without actual situation could be revised through the truth recognition in meanings, and student could see what things displayed. Therefore preconception would be corrected. Of course, everything would not be completed by just one time of experiential learning, and consistent experience learning should be operated. Third, experiential learning promoted the change of sensitivity. Students had shallow sensitivity, which appeared in the relation with things, since having learned only inside of class without a direct observation. However their sensitivity could be increased by experiencing specific things. Fourth, there was the change of classification recognition. Students found properties of things with a direct observation. It raised their ability to classify things, and to understand an individual thing in 'a class'.

  • PDF

A Fuzzy Weights Decision Method based on Degree of Contribution for Recognition of Insect Footprints (곤충 발자국 인식을 위한 기여도 기반의 퍼지 가중치 결정 방법)

  • Shin, Bok-Suk;Cha, Eui-Young;Woo, Young-Woon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.12
    • /
    • pp.55-62
    • /
    • 2009
  • This paper proposes a decision method of fuzzy weights by utilizing degrees of contribution in order to classify insect footprint patterns having difficulties to classify species clearly. Insect footprints revealed delicately in the form of scattered spots since they are very small. Therefore it is not easy to define shape of footprints unlike other species, and there are lots of noises in the footprint patterns so that it is difficult to distinguish those from correct data. For these reasons, the extracted feature set has obvious feature values with some uncertain feature values, so we estimate weights according to degrees of contribution. If the one of feature values has distinct difference enough to decide a class among other classes, high weight is assigned to make classification. A calculated weight determines the membership values by fuzzy functions and objects are classified into the class having a superior value.atu present experimental resultseighrontribution. Iinsect footprints with noises by the proposed method.

Concrete Reinforcement Modeling with IFC for Automated Rebar Fabrication

  • LIU, Yuhan;AFZAL, Muhammad;CHENG, Jack C.P.;GAN, Vincent J.L.
    • International conference on construction engineering and project management
    • /
    • 2020.12a
    • /
    • pp.157-166
    • /
    • 2020
  • Automated rebar fabrication, which requires effective information exchange between model designers and fabricators, has brought the integration and interoperability of data from different sources to the notice of both academics and industry practitioners. Industry Foundation Classes (IFC) was one of the most commonly used data formats to represent the semantic information of prefabricated components in buildings, whereas the data format utilized by rebar fabrication machine is BundesVereinigung der Bausoftware (BVBS), which is a numerical data structure exchanging reinforcement information through ASCII encoded files. Seamless transformation between IFC and BVBS empowers the automated rebar fabrication and improve the construction productivity. In order to improve data interoperability between IFC and BVBS, this study presents an IFC extension based on the attributes required by automated rebar fabrication machines with the help of Information Delivery Manual (IDM) and Model View Definition (MVD). IDM is applied to describe and display the information needed for the design, construction and operation of projects, whereas MVD is a subset of IFC schema used to describe the automated rebar fabrication workflow. Firstly, with a rich pool of vocabularies practitioners, OmniClass is used in information exchange between IFC and BVBS, providing a hierarchy classification structure for reinforcing elements. Then, using International Framework for Dictionaries (IFD), the usage of each attribute is defined in a more consistent manner to assist the data mapping process. Besides, in order to address missing information within automated fabrication process, a schematic data mapping diagram has been made to deliver IFC information from BIM models to BVBS format for better data interoperability among different software agents. A case study based on the data mapping will be presented to demonstrate the proposed IFC extension and how it could assist/facilitate the information management.

  • PDF