• Title/Summary/Keyword: Self Organizing Map

Search Result 425, Processing Time 0.024 seconds

A Customer Segmentation Scheme Base on Big Data in a Bank (빅데이터를 활용한 은행권 고객 세분화 기법 연구)

  • Chang, Min-Suk;Kim, Hyoung Joong
    • Journal of Digital Contents Society
    • /
    • v.19 no.1
    • /
    • pp.85-91
    • /
    • 2018
  • Most banks use only demographic information such as gender, age, occupation and address to segment customers, but they do not reflect financial behavior patterns of customers. In this study, we aim to solve the problems by using various big data in a bank and to develop customer segmentation method which can be widely used in many banks in the future. In this paper, we propose an approach of segmenting clustering blocks with bottom-up method. This method has an advantage that it can accurately reflect various financial needs of customers based on various transaction patterns, channel contact patterns, and existing demographic information. Based on this, we will develop various marketing models such as product recommendation, financial need rating calculation, and customer churn-out prediction based on this, and we will adapt this models for the marketing strategy of NH Bank.

Runtime Prediction Based on Workload-Aware Clustering (병렬 프로그램 로그 군집화 기반 작업 실행 시간 예측모형 연구)

  • Kim, Eunhye;Park, Ju-Won
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.38 no.3
    • /
    • pp.56-63
    • /
    • 2015
  • Several fields of science have demanded large-scale workflow support, which requires thousands of CPU cores or more. In order to support such large-scale scientific workflows, high capacity parallel systems such as supercomputers are widely used. In order to increase the utilization of these systems, most schedulers use backfilling policy: Small jobs are moved ahead to fill in holes in the schedule when large jobs do not delay. Since an estimate of the runtime is necessary for backfilling, most parallel systems use user's estimated runtime. However, it is found to be extremely inaccurate because users overestimate their jobs. Therefore, in this paper, we propose a novel system for the runtime prediction based on workload-aware clustering with the goal of improving prediction performance. The proposed method for runtime prediction of parallel applications consists of three main phases. First, a feature selection based on factor analysis is performed to identify important input features. Then, it performs a clustering analysis of history data based on self-organizing map which is followed by hierarchical clustering for finding the clustering boundaries from the weight vectors. Finally, prediction models are constructed using support vector regression with the clustered workload data. Multiple prediction models for each clustered data pattern can reduce the error rate compared with a single model for the whole data pattern. In the experiments, we use workload logs on parallel systems (i.e., iPSC, LANL-CM5, SDSC-Par95, SDSC-Par96, and CTC-SP2) to evaluate the effectiveness of our approach. Comparing with other techniques, experimental results show that the proposed method improves the accuracy up to 69.08%.

Postprocessing Algorithm of Fingerprint Image Using Isometric SOM Neural Network (Isometric SOM 신경망을 이용한 지문 영상의 후처리 알고리듬)

  • Kim, Sang-Hee;Kim, Yung-Jung;Lee, Sung-Koo
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.45 no.5
    • /
    • pp.110-116
    • /
    • 2008
  • This paper presents a new postprocessing method to eliminate the false minutiae, that caused by the skelectonization of fingerprint image, and an image compression method using Isometric Self Organizing Map(ISOSOM). Since the SOM has simple structure, fast encoding time, and relatively good classification characteristics, many image processing areas adopt this such as image compression and pattern classification, etc. But, the SOM shows limited performances in pattern classification because of it's single layer structure. To maximize the performance of the pattern classification with small code book, we a lied the Isometric SOM with the isometry of the fractal theory. The proposed Isometric SOM postprocessing and compression algorithm of fingerprint image showed good performances in the elimination of false minutiae and the image compression simultaneously.

Customer's Job Identification using the Usage Patterns of Mobile Telecommunication (이동통신 사용패턴을 이용한 고객의 직업판정)

  • Lee Jae Sik;Cho You Jung
    • Journal of Intelligence and Information Systems
    • /
    • v.10 no.3
    • /
    • pp.115-132
    • /
    • 2004
  • Recently, as most companies recognize the importance of the customer relationship management, they strongly believe that they must know who their customers are. The job of a customer is very important information for us to understand the customer. However, since most customers are reluctant to reveal them-selves, they do not let us know their jobs, and even provide false information about their jobs. The target domain of our research is mobile telecommunication. In this research, we developed a system that identifies the customer's job by utilizing the Call Detail Record. Using artificial neural networks, we developed a two-step Job Identification System. In the first step, it identifies the four job classes, then in the second step, it subdivides these four job classes into seven jobs. The accuracy of identifying the seven jobs was $71.9\%$.

  • PDF

A Personalized Dietary Coaching Method Using Food Clustering Analysis (음식 군집분석을 통한 개인맞춤형 식이 코칭 기법)

  • Oh, Yoori;Choi, Jieun;Kim, Yoonhee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.6
    • /
    • pp.289-294
    • /
    • 2016
  • In recent times, as most people develop keen interest in health management, the importance of cultivating dietary habits to prevent various chronic diseases is emphasized. Subsequently, dietary management systems using a variety of mobile and web application interfaces have emerged. However, these systems are difficult to apply in real world and also do not provide personalized information reflective of the user's situation. Hence it is necessary to develop a personalized dietary management and recommendation method that considers user's body state information, food analysis and other essential statistics. In this paper, we analyze nutrition using self-organizing map (SOM) and prepare data about nutrition using clustering. We provide a substitute food recommendation method and also give feedback about the food that user wants to eat based on personalized criteria. The experiment results show that the distance between input food and recommended food of the proposed method is short compared to the recommended food results using general methods and proved that nutritional similar food is recommended.

Considering Customer Buying Sequences to Enhance the Quality of Collaborative Filtering (구매순서를 고려한 개선된 협업필터링 방법론)

  • Cho, Yeong-Bin;Cho, Yoon-Ho
    • Journal of Intelligence and Information Systems
    • /
    • v.13 no.2
    • /
    • pp.69-80
    • /
    • 2007
  • The preferences of customers change over time. However, existing collaborative filtering (CF) systems are static, since they only incorporate information regarding whether a customer buys a product during a certain period and do not make use of the purchase sequences of customers. Therefore, the quality of the recommendations of the typical CF could be improved through the use of information on such sequences. In this study, we propose a new methodology for enhancing the quality of CF recommendation that uses customer purchase sequences. The proposed methodology is applied to a large department store in Korea and compared to existing CF techniques. Various experiments using real-world data demonstrate that the proposed methodology provides higher quality recommendations than do typical CF techniques with better performance.

  • PDF

A Study on Clustering Representative Color of Natural Environment of Korean Peninsula for Optimal Camouflage Pattern Design (최적 위장무늬 디자인을 위한 한반도 자연환경 대표 색상 군집화 연구)

  • Chun, Sungkuk;Kim, Hoemin;Yoon, Seon Kyu;Yun, Jeongrok;Kim, Un Yong
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2019.07a
    • /
    • pp.315-316
    • /
    • 2019
  • 전투복, 군용 천막 등에 사용되는 위장무늬는 군 작전 수행 시 주변 환경의 색상, 패턴을 모사하여 개인병사 및 무기체계의 위장 기능을 극대화하고, 이를 통해 아군의 생명과 시설피해를 최소화하기 위한 목적으로 사용된다. 특히 최근 들어 군의 작전환경과 임무가 복잡하고 다양해짐에 따라, 작전환경에 대한 데이터의 취득 및 정량적 분석을 통해 전장 환경에 최적화된 위장무늬 패턴 및 색상 추출에 대한 연구의 필요성이 증대되고 있다. 본 논문에서는 한반도 자연환경 영상에 대한 자기 조직화 지도(SOM, Self-organizing Map) 기반의 한반도 자연환경 대표 색상 군집화 연구 방법에 대해 서술한다. 이를 위해 한반도 내 위도를 고려한 장소에서 시간별, 계절별 자연환경 영상 수집을 진행하며, 수집된 영상 내 다수의 화소의 군집화를 위해 2차원 SOM을 활용한다. 영상 내 각 화소의 색상 값에 대한 SOM의 학습 시, RGB공간상의 색차/색상 인지 왜곡을 피하기 위하여 CIEDE2000 색차 식을 통해 군집화를 진행한다. 실험결과에서는 온라인상으로 수집한 여름 및 가을철 대표 색상 군집화 결과와, 현재까지 수집된 계절별 자연환경 사진 내 6억 7648개 화소에 대한 대표 색상 군집화 결과를 보여준다.

  • PDF

Using Tower Flux Data to Assess the Impact of Land Use and Land Cover Change on Carbon Exchange in Heterogeneous Haenam Cropland (비균질한 해남 농경지의 탄소교환에 미치는 토지사용 및 피복변화의 영향에 대한 미기상학 자료의 활용에 관하여)

  • Indrawati, Yohana Maria;Kang, Minseok;Kim, Joon
    • Proceedings of The Korean Society of Agricultural and Forest Meteorology Conference
    • /
    • 2013.11a
    • /
    • pp.30-31
    • /
    • 2013
  • Land use and land cover change (LULCC) due to human activities directly affects natural systems and contributes to changes in carbon exchange and climate through a range of feedbacks. How land use and land cover changes affect carbon exchanges can be assessed using multiyear measurement data from micrometeorological flux towers. The objective of the research is to assess the impact of land use and land cover change on carbon exchange in a heterogeneous cropland area. The heterogeneous cropland area in Haenam, South Korea is also subjected to a land conversion due to rural development. Therefore, the impact of the change in land utilization in this area on carbon exchange should be assessed to monitor the cycle of energy, water, and carbon dioxide between this key agricultural ecosystem and the atmosphere. We are currently conducting the research based on 10 years flux measurement data from Haenam Koflux site and examining the LULCC patterns in the same temporal scale to evaluate whether the LULCC in the surrounding site and the resulting heterogeneity (or diversity) have a significant impact on carbon exchange. Haenam cropland is located near the southwestern coast of the Korean Peninsula with land cover types consisting of scattered rice paddies and various croplands (seasonally cultivated crops). The LULCC will be identified and quantified using remote sensing satellite data and then analyzing the relationships between LULCC and flux footprint of $CO_2$ from tower flux measurement. We plan to calculate annual flux footprint climatology map from 2003 to 2012 from the 10 years flux observation database. Eventually, these results will be used to quantify how the system's effective performance and reserve capacity contribute to moving the system towards more sustainable configuration. Broader significance of this research is to understand the co-evolution of the Haenam agricultural ecosystem and its societal counterpart which are assumed to be self-organizing hierarchical open systems.

  • PDF

Financial Fraud Detection using Data Mining: A Survey

  • Sudhansu Ranjan Lenka;Bikram Kesari Ratha
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.9
    • /
    • pp.169-185
    • /
    • 2024
  • Due to levitate and rapid growth of E-Commerce, most of the organizations are moving towards cashless transaction Unfortunately, the cashless transactions are not only used by legitimate users but also it is used by illegitimate users and which results in trouncing of billions of dollars each year worldwide. Fraud prevention and Fraud Detection are two methods used by the financial institutions to protect against these frauds. Fraud prevention systems (FPSs) are not sufficient enough to provide fully security to the E-Commerce systems. However, with the combined effect of Fraud Detection Systems (FDS) and FPS might protect the frauds. However, there still exist so many issues and challenges that degrade the performances of FDSs, such as overlapping of data, noisy data, misclassification of data, etc. This paper presents a comprehensive survey on financial fraud detection system using such data mining techniques. Over seventy research papers have been reviewed, mainly within the period 2002-2015, were analyzed in this study. The data mining approaches employed in this research includes Neural Network, Logistic Regression, Bayesian Belief Network, Support Vector Machine (SVM), Self Organizing Map(SOM), K-Nearest Neighbor(K-NN), Random Forest and Genetic Algorithm. The algorithms that have achieved high success rate in detecting credit card fraud are Logistic Regression (99.2%), SVM (99.6%) and Random Forests (99.6%). But, the most suitable approach is SOM because it has achieved perfect accuracy of 100%. But the algorithms implemented for financial statement fraud have shown a large difference in accuracy from CDA at 71.4% to a probabilistic neural network with 98.1%. In this paper, we have identified the research gap and specified the performance achieved by different algorithms based on parameters like, accuracy, sensitivity and specificity. Some of the key issues and challenges associated with the FDS have also been identified.

Status and Implications of Hydrogeochemical Characterization of Deep Groundwater for Deep Geological Disposal of High-Level Radioactive Wastes in Developed Countries (고준위 방사성 폐기물 지질처분을 위한 해외 선진국의 심부 지하수 환경 연구동향 분석 및 시사점 도출)

  • Jaehoon Choi;Soonyoung Yu;SunJu Park;Junghoon Park;Seong-Taek Yun
    • Economic and Environmental Geology
    • /
    • v.55 no.6
    • /
    • pp.737-760
    • /
    • 2022
  • For the geological disposal of high-level radioactive wastes (HLW), an understanding of deep subsurface environment is essential through geological, hydrogeological, geochemical, and geotechnical investigations. Although South Korea plans the geological disposal of HLW, only a few studies have been conducted for characterizing the geochemistry of deep subsurface environment. To guide the hydrogeochemical research for selecting suitable repository sites, this study overviewed the status and trends in hydrogeochemical characterization of deep groundwater for the deep geological disposal of HLW in developed countries. As a result of examining the selection process of geological disposal sites in 8 countries including USA, Canada, Finland, Sweden, France, Japan, Germany, and Switzerland, the following geochemical parameters were needed for the geochemical characterization of deep subsurface environment: major and minor elements and isotopes (e.g., 34S and 18O of SO42-, 13C and 14C of DIC, 2H and 18O of water) of both groundwater and pore water (in aquitard), fracture-filling minerals, organic materials, colloids, and oxidation-reduction indicators (e.g., Eh, Fe2+/Fe3+, H2S/SO42-, NH4+/NO3-). A suitable repository was selected based on the integrated interpretation of these geochemical data from deep subsurface. In South Korea, hydrochemical types and evolutionary patterns of deep groundwater were identified using artificial neural networks (e.g., Self-Organizing Map), and the impact of shallow groundwater mixing was evaluated based on multivariate statistics (e.g., M3 modeling). The relationship between fracture-filling minerals and groundwater chemistry also has been investigated through a reaction-path modeling. However, these previous studies in South Korea had been conducted without some important geochemical data including isotopes, oxidationreduction indicators and DOC, mainly due to the lack of available data. Therefore, a detailed geochemical investigation is required over the country to collect these hydrochemical data to select a geological disposal site based on scientific evidence.