• Title/Summary/Keyword: Case-Based Reasoning Algorithm

Search Result 80, Processing Time 0.03 seconds

Response Modeling for the Marketing Promotion with Weighted Case Based Reasoning Under Imbalanced Data Distribution (불균형 데이터 환경에서 변수가중치를 적용한 사례기반추론 기반의 고객반응 예측)

  • Kim, Eunmi;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.29-45
    • /
    • 2015
  • Response modeling is a well-known research issue for those who have tried to get more superior performance in the capability of predicting the customers' response for the marketing promotion. The response model for customers would reduce the marketing cost by identifying prospective customers from very large customer database and predicting the purchasing intention of the selected customers while the promotion which is derived from an undifferentiated marketing strategy results in unnecessary cost. In addition, the big data environment has accelerated developing the response model with data mining techniques such as CBR, neural networks and support vector machines. And CBR is one of the most major tools in business because it is known as simple and robust to apply to the response model. However, CBR is an attractive data mining technique for data mining applications in business even though it hasn't shown high performance compared to other machine learning techniques. Thus many studies have tried to improve CBR and utilized in business data mining with the enhanced algorithms or the support of other techniques such as genetic algorithm, decision tree and AHP (Analytic Process Hierarchy). Ahn and Kim(2008) utilized logit, neural networks, CBR to predict that which customers would purchase the items promoted by marketing department and tried to optimized the number of k for k-nearest neighbor with genetic algorithm for the purpose of improving the performance of the integrated model. Hong and Park(2009) noted that the integrated approach with CBR for logit, neural networks, and Support Vector Machine (SVM) showed more improved prediction ability for response of customers to marketing promotion than each data mining models such as logit, neural networks, and SVM. This paper presented an approach to predict customers' response of marketing promotion with Case Based Reasoning. The proposed model was developed by applying different weights to each feature. We deployed logit model with a database including the promotion and the purchasing data of bath soap. After that, the coefficients were used to give different weights of CBR. We analyzed the performance of proposed weighted CBR based model compared to neural networks and pure CBR based model empirically and found that the proposed weighted CBR based model showed more superior performance than pure CBR model. Imbalanced data is a common problem to build data mining model to classify a class with real data such as bankruptcy prediction, intrusion detection, fraud detection, churn management, and response modeling. Imbalanced data means that the number of instance in one class is remarkably small or large compared to the number of instance in other classes. The classification model such as response modeling has a lot of trouble to recognize the pattern from data through learning because the model tends to ignore a small number of classes while classifying a large number of classes correctly. To resolve the problem caused from imbalanced data distribution, sampling method is one of the most representative approach. The sampling method could be categorized to under sampling and over sampling. However, CBR is not sensitive to data distribution because it doesn't learn from data unlike machine learning algorithm. In this study, we investigated the robustness of our proposed model while changing the ratio of response customers and nonresponse customers to the promotion program because the response customers for the suggested promotion is always a small part of nonresponse customers in the real world. We simulated the proposed model 100 times to validate the robustness with different ratio of response customers to response customers under the imbalanced data distribution. Finally, we found that our proposed CBR based model showed superior performance than compared models under the imbalanced data sets. Our study is expected to improve the performance of response model for the promotion program with CBR under imbalanced data distribution in the real world.

EXPERT SYSTEM FOR A NUCLEAR POWER PLANT ACCIDENT DIAGNOSIS USING A FUZZY INFERENCE METHOD

  • Lee, Mal-Rey;Oh, Jong-Chul
    • Journal of applied mathematics & informatics
    • /
    • v.8 no.2
    • /
    • pp.505-518
    • /
    • 2001
  • The huge and complicated plants such as nuclear power stations are likely to cause the operators to make mistakes due to a variety of inexplicable reasons and symptoms in case of emergency. That’s why the prevention system assisting the operators is being developed for. First of all. I suggest an improved fuzzy diagnosis. Secondly, I want to demonstrate that a classification system of nuclear plant’s accident investigating the causes of accidents foresees possible problems, and maintains the reliability of the diagnostic reports in spite of improper working in part. In the event of emergency in a nuclear plant, a lot of operational steps enable the operators to find out what caused the problems based on an emergent operating plan. Our system is able to classify their types within twenty to thirty seconds. As so, we expect the system to put down the accidents right after the rapid detection of the damage control-method concerned.

On the Design of R&D Proposal Screening System (연구제안서 스크리닝 시스템의 설계에 관한 연구)

  • 최창우;김선우;김혜리;박용태
    • Proceedings of the Technology Innovation Conference
    • /
    • 2003.06a
    • /
    • pp.3-11
    • /
    • 2003
  • As the size and scope of R&D investment explodes, the strategic and managerial importance of R&D proposal screening becomes highlighted. This point is particularly true for a large-scale research center that deals with multi-product and multi-technology R&D projects. Despite the importance, however, previous research has focused on project evaluation and selection stage. In this research, we propose a R&D proposal screening system. The main objective of the system is to filter R&D proposals that are identified to be duplications of past or existing projects. To this end, the algorithm of the system employs text mining, multivariate statistical method, and case-based reasoning.

  • PDF

Development of Music Recommendation System based on Customer Sentiment Analysis (소비자 감성 분석 기반의 음악 추천 알고리즘 개발)

  • Lee, Seung Jun;Seo, Bong-Goon;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.197-217
    • /
    • 2018
  • Music is one of the most creative act that can express human sentiment with sound. Also, since music invoke people's sentiment to get empathized with it easily, it can either encourage or discourage people's sentiment with music what they are listening. Thus, sentiment is the primary factor when it comes to searching or recommending music to people. Regard to the music recommendation system, there are still lack of recommendation systems that are based on customer sentiment. An algorithm's that were used in previous music recommendation systems are mostly user based, for example, user's play history and playlists etc. Based on play history or playlists between multiple users, distance between music were calculated refer to basic information such as genre, singer, beat etc. It can filter out similar music to the users as a recommendation system. However those methodology have limitations like filter bubble. For example, if user listen to rock music only, it would be hard to get hip-hop or R&B music which have similar sentiment as a recommendation. In this study, we have focused on sentiment of music itself, and finally developed methodology of defining new index for music recommendation system. Concretely, we are proposing "SWEMS" index and using this index, we also extracted "Sentiment Pattern" for each music which was used for this research. Using this "SWEMS" index and "Sentiment Pattern", we expect that it can be used for a variety of purposes not only the music recommendation system but also as an algorithm which used for buildup predicting model etc. In this study, we had to develop the music recommendation system based on emotional adjectives which people generally feel when they listening to music. For that reason, it was necessary to collect a large amount of emotional adjectives as we can. Emotional adjectives were collected via previous study which is related to them. Also more emotional adjectives has collected via social metrics and qualitative interview. Finally, we could collect 134 individual adjectives. Through several steps, the collected adjectives were selected as the final 60 adjectives. Based on the final adjectives, music survey has taken as each item to evaluated the sentiment of a song. Surveys were taken by expert panels who like to listen to music. During the survey, all survey questions were based on emotional adjectives, no other information were collected. The music which evaluated from the previous step is divided into popular and unpopular songs, and the most relevant variables were derived from the popularity of music. The derived variables were reclassified through factor analysis and assigned a weight to the adjectives which belongs to the factor. We define the extracted factors as "SWEMS" index, which describes sentiment score of music in numeric value. In this study, we attempted to apply Case Based Reasoning method to implement an algorithm. Compare to other methodology, we used Case Based Reasoning because it shows similar problem solving method as what human do. Using "SWEMS" index of each music, an algorithm will be implemented based on the Euclidean distance to recommend a song similar to the emotion value which given by the factor for each music. Also, using "SWEMS" index, we can also draw "Sentiment Pattern" for each song. In this study, we found that the song which gives a similar emotion shows similar "Sentiment Pattern" each other. Through "Sentiment Pattern", we could also suggest a new group of music, which is different from the previous format of genre. This research would help people to quantify qualitative data. Also the algorithms can be used to quantify the content itself, which would help users to search the similar content more quickly.

A Method of Assigning Weight Values for Qualitative Attributes in CBR Cost Model (사례기반추론 코스트 모델의 정성변수 속성가중치 산정방법)

  • Lee, Hyun-Soo;Kim, Soo-Young;Park, Moon-Seo;Ji, Sae-Hyun;Seong, Ki-Hoon;Pyeon, Jae-Ho
    • Korean Journal of Construction Engineering and Management
    • /
    • v.12 no.1
    • /
    • pp.53-61
    • /
    • 2011
  • For construction projects, the importance of early cost estimates is highly recognized by the project team and sponsoring organization because early cost estimates are frequently a foundation of business decisions as well as a basis for identifying any changes as the project progresses from design to construction. However, it is difficult to accurately estimate construction cost in the early stage of a project due to various uncertainties in construction. To deal with these uncertainties, cost estimates should be made several times over the course of the project. In particular, early cost estimates are essential process for successful project management. For accurate construction cost estimates, it is necessary to compare cost estimates with actual costs based on historical project data. In this context, case-based reasoning (CBR), which is the process of solving new problems based on the solutions of similar past problems, can be considered as an effective method for cost estimating. To obtain this, it is also required to define the attribute similarities and the attribute weights. However, no existing method is capable of determining attribute weights of qualitative variables. Consequently, it has been a well-known barrier of accurate early cost estimates. Using Genetic Algorithms (GA), this research suggests the method of determining the attribute weight of qualitative variables. Based on building project case studies, the proposed methodology was validated.

Decision Support Method in Dynamic Car Navigation Systems by Q - Learning

  • Hong, Soo-Jung;Hong, Eon-Joo;Oh, Kyung-Whan
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2002.05a
    • /
    • pp.6-9
    • /
    • 2002
  • 오랜 세월동안 위대한 이동수단을 만들어내고자 하는 인간의 끓은 오늘날 눈부신 각종 운송기구를 만들어 내는 결실을 얻고 있다. 자동차 네비게이션 시스템도 그러한 결실중의 한 예라고 할 수 있을 것이다. 지능적으로 판단하고 정보를 처리할 수 있는 자동차 네비게이션 시스템을 부착함으로써 한단계 발전한 운송수단으로 진화할 수 있을 것이다. 이러한 자동차 네비게이션 시스템의 단점이라면 한정된 리 소스만으로 여러 가지 작업을 수행해야만 하는 어려움이다. 그래서 네비게이션 시스템의 주요 작업중의 하나인 경로를 추출하는 경로추출(Route Planing) 작업은 한정된 리 소스에서도 최적의 경로를 찾을 수 있는 지능적인 방법이어야만 한다. 이러한 경로를 추출하는 작업을 하는 데 기존에 일반적으로 쓰였던 두 가지 방법에는 Dijkstra's algorithm과 A* algorithm이 있다. 이 두 방법은 최적의 경로를 찾아 낸다는 점은 있지만 경로를 찾기 위해서 알고리즘의 특성상 각각, 넓은 영역에 대하여 탐색작업을 해야하고 또한 수행시간이 많이 걸린다는 단점과 또한 경로를 계산하기 위해서 Heuristic function을 추가적인 정보로 계산을 해야 한다는 단점이 있다. 본 논문에서는 적은 탐색 영역을 가지면서 또한 최적의 경로를 추출하는 데 드는 수행시간은 작으며 나아가 동적인 교통환경에서도 최적의 경로를 추출할 수 있는 최적 경로 추출방법을 강화학습의 일종인 Q- Learning을 이용하여 구현해 보고자 한다.

  • PDF

u-Mentoring System에서 속성 온톨로지와 CBR을 사용한 M3 알고리즘

  • Son, Mi-Ae;Gang, Cho-Rong
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2007.11a
    • /
    • pp.479-486
    • /
    • 2007
  • 멘토링은 조직이나 사회 구성원들의 발전을 돕기 위한 프로그램으로서, 조언자, 상담자 및 후원자 역할을 하는 '멘토(mentor)'와 도움을 얻고자 하는 '멘티(mentee)'가 긴밀한 관계를 맺고 유지함으로써 상호 발전을 위해 수행된다. 현재 이루어지고 있는 대부분의 멘토링은 면대면 (face-to-face) 시스템이거나 웹 기반의 e-mentoring 시스템으로, 전자는 시간적 그리고 지역적 한계를 극복해야만 하고 후자는 멘토나 멘티가 멘토링 사이트에 접속하여 게시판을 확인하지 않으면 제대로 된 멘토링을 수행할 수 없다는 한계를 가지고 있다. 또한 멘토와 멘티의 매칭은 무작위로 이루어지거나 코디네이터라고 불리는 사람이 수행하기 때문에, 비용이 많이 소용될 뿐 아니라 개인적인 편견이나 오류가 개입될 여지가 상존한다. 이에 본 연구에서는 시간과 장소의 제약에 구애 받지 않는 u-Mentoring 시스템을 개발하고자 하며, 그 첫 단계로써 멘토와 멘티간의 매칭을 지원하는 새로운 알고리즘(M3 Algorithm, Mentor-Mentee Matching Algorithm)을 제안하고자 한다. 본 연구에서 제안하는 알고리즘은 매칭의 정확도와 멘토-멘티의 매칭 만족도를 높이기 위해 멘토-멘티 온톨로지(M-Ontology)와 사례기반추론 기법을 사용하였다. 즉, 멘토-멘티의 효과적인 매칭을 위해, 멘토-멘티간 매칭 사례가 없는 초기 단계에는 멘토와 멘티의 속성 비교를 통한 추천 방식을 사용하고, 멘토링이 종료되어 충분한 멘토-멘티간 매칭사례가 수집되면 그 결과를 재사용해 추후 매칭에 활용한다. 본 논문에서는 제안한 매칭 알고리즘이 내장된 u-Mentoring system의 포로토타입을 보여주고자 한다.

  • PDF

Optimizing Similarity Threshold and Coverage of CBR (사례기반추론의 유사 임계치 및 커버리지 최적화)

  • Ahn, Hyunchul
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.8
    • /
    • pp.535-542
    • /
    • 2013
  • Since case-based reasoning(CBR) has many advantages, it has been used for supporting decision making in various areas including medical checkup, production planning, customer classification, and so on. However, there are several factors to be set by heuristics when designing effective CBR systems. Among these factors, this study addresses the issue of selecting appropriate neighbors in case retrieval step. As the criterion for selecting appropriate neighbors, conventional studies have used the preset number of neighbors to combine(i.e. k of k-nearest neighbor), or the relative portion of the maximum similarity. However, this study proposes to use the absolute similarity threshold varying from 0 to 1, as the criterion for selecting appropriate neighbors to combine. In this case, too small similarity threshold value may make the model rarely produce the solution. To avoid this, we propose to adopt the coverage, which implies the ratio of the cases in which solutions are produced over the total number of the training cases, and to set it as the constraint when optimizing the similarity threshold. To validate the usefulness of the proposed model, we applied it to a real-world target marketing case of an online shopping mall in Korea. As a result, we found that the proposed model might significantly improve the performance of CBR.

Evaluating LIMU System Quality with Interval Evidence and Input Uncertainty

  • Xiangyi Zhou;Zhijie Zhou;Xiaoxia Han;Zhichao Ming;Yanshan Bian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.11
    • /
    • pp.2945-2965
    • /
    • 2023
  • The laser inertial measurement unit is a precision device widely used in rocket navigation system and other equipment, and its quality is directly related to navigation accuracy. In the quality evaluation of laser inertial measurement unit, there is inevitably uncertainty in the index input information. First, the input numerical information is in interval form. Second, the index input grade and the quality evaluation result grade are given according to different national standards. So, it is a key step to transform the interval information input by the index into the data form consistent with the evaluation result grade. In the case of uncertain input, this paper puts forward a method based on probability distribution to solve the problem of asymmetry between the reference grade given by the index and the evaluation result grade when evaluating the quality of laser inertial measurement unit. By mapping the numerical relationship between the designated reference level and the evaluation reference level of the index information under different distributions, the index evidence symmetrical with the evaluation reference level is given. After the uncertain input information is transformed into evidence of interval degree distribution by this method, the information fusion of interval degree distribution evidence is carried out by interval evidential reasoning algorithm, and the evaluation result is obtained by projection covariance matrix adaptive evolution strategy optimization. Taking a five-meter redundant laser inertial measurement unit as an example, the applicability and effectiveness of this method are verified.

The development of automatic system using multimodel in hazard analysis (위험성 분석에서의 다중모델을 이용한 자동화 시스템의 개발)

  • Kang Kyung Wook;Kang Byung Kwan;Suh Jung Chul;Yoon En Sup
    • Journal of the Korean Institute of Gas
    • /
    • v.1 no.1
    • /
    • pp.87-94
    • /
    • 1997
  • There are many kinds of complicated equipments in the chemical plants. So the chemical plants have high possibility of accidents. Hazard analysis is one of the basic tasks to ensure the safety of chemical plants. However, it has many shortcomings. To overcome the problems, there have been attempts to automate this work by utilizing computer technology, particularly knowledge-based technique. However, many of the past approaches are lacking in properties: safeguard consideration, accident diversity, cause and consequence diversity, pathway leading to accidents, and various hazard analysis reasoning. Therefore, in this study, three analysis algorithms were proposed using multimodel approach, and a hazard analysis system, AHA, was developed on G2. The case study was solved with AHA system successfully.

  • PDF