• 제목/요약/키워드: Business Classification Systems

검색결과 341건 처리시간 0.028초

An Application of Support Vector Machines to Customer Loyalty Classification of Korean Retailing Company Using R Language

  • 응위엔푸티엔;이영찬
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제26권4호
    • /
    • pp.17-37
    • /
    • 2017
  • Purpose Customer Loyalty is the most important factor of customer relationship management (CRM). Especially in retailing industry, where customers have many options of where to spend their money. Classifying loyal customers through customers' data can help retailing companies build more efficient marketing strategies and gain competitive advantages. This study aims to construct classification models of distinguishing the loyal customers within a Korean retailing company using data mining techniques with R language. Design/methodology/approach In order to classify retailing customers, we used combination of support vector machines (SVMs) and other classification algorithms of machine learning (ML) with the support of recursive feature elimination (RFE). In particular, we first clean the dataset to remove outlier and impute the missing value. Then we used a RFE framework for electing most significant predictors. Finally, we construct models with classification algorithms, tune the best parameters and compare the performances among them. Findings The results reveal that ML classification techniques can work well with CRM data in Korean retailing industry. Moreover, customer loyalty is impacted by not only unique factor such as net promoter score but also other purchase habits such as expensive goods preferring or multi-branch visiting and so on. We also prove that with retailing customer's dataset the model constructed by SVMs algorithm has given better performance than others. We expect that the models in this study can be used by other retailing companies to classify their customers, then they can focus on giving services to these potential vip group. We also hope that the results of this ML algorithm using R language could be useful to other researchers for selecting appropriate ML algorithms.

머신러닝 기반의 기업 리뷰 다중 분류: 부분 문법 적용을 중심으로 (Multi-Label Classification for Corporate Review Text: A Local Grammar Approach)

  • 백혜연;장영균
    • 경영정보학연구
    • /
    • 제25권3호
    • /
    • pp.27-41
    • /
    • 2023
  • 최근 많은 분야에서 기계학습에 대한 연구가 활발히 진행되고 있는데, 상당수의 연구들이 학습 모델의 성능을 개선하는 최신 방법론을 제시하고 있다. 본 연구에서는 방법론의 개발 못지않게 기계학습에 투입되는 훈련용 데이터의 '품질'을 개선하는 것 역시 중요하다는 점에 착안하여, 코퍼스 분석에서 자주 사용되는 '부분 문법' 처리 프로세스를 통해 훈련 데이터의 품질을 향상시키는 방법을 제시한다. 우리나라 100대 기업에 근무하는 재직자들이 채용플랫폼에 게시하는 방대한 양의 비정형 기업 리뷰 텍스트 데이터를 수집하고, 데이터 품질을 부분 문법 프로세스로 개선한 후, 부분 문법이 적용된 분류 모델이 적용되지 않은 모델보다 분류 성능이 우수함을 확인하였다. 분류 카테고리는 직원 몰입의 5가지 요인으로 상정하였는데, 국내 직장인들이 기업 리뷰가 각 유형별로 빈도에 차이가 있는지를 분석하였다. 추가로 리뷰 양상이 코로나 팬데믹 전후로 어떠한 변화가 있었는지도 분석하였다. 본 연구를 통해 국내 직장인들의 생생한 일터 경험들을 자동적으로 식별하고 분류하여, 이직을 포함한 주요한 조직문화 현상의 행태와 유발 원인 등을 유추해 볼 수 있는 근거를 제공한다.

Applications of Machine Learning Models on Yelp Data

  • Ruchi Singh;Jongwook Woo
    • Asia pacific journal of information systems
    • /
    • 제29권1호
    • /
    • pp.35-49
    • /
    • 2019
  • The paper attempts to document the application of relevant Machine Learning (ML) models on Yelp (a crowd-sourced local business review and social networking site) dataset to analyze, predict and recommend business. Strategically using two cloud platforms to minimize the effort and time required for this project. Seven machine learning algorithms in Azure ML of which four algorithms are implemented in Databricks Spark ML. The analyzed Yelp business dataset contained 70 business attributes for more than 350,000 registered business. Additionally, review tips and likes from 500,000 users have been processed for the project. A Recommendation Model is built to provide Yelp users with recommendations for business categories based on their previous business ratings, as well as the business ratings of other users. Classification Model is implemented to predict the popularity of the business as defining the popular business to have stars greater than 3 and unpopular business to have stars less than 3. Text Analysis model is developed by comparing two algorithms, uni-gram feature extraction and n-feature extraction in Azure ML studio and logistic regression model in Spark. Comparative conclusions have been made related to efficiency of Spark ML and Azure ML for these models.

건설 CALS/EC 환경에서의 STEP 데이터 활용방안에 관한 연구 - 도면정보와 자재정보 연계 중심으로 - (A Study on the Usage of STEP data on the Construction CALS/EC Environment - Focusing on linking the Drawing Information and Material Information -)

  • 서종철;김인한
    • 한국전자거래학회지
    • /
    • 제8권1호
    • /
    • pp.121-139
    • /
    • 2003
  • Currently, it is not popular to use the STEP based product information in the construction drawing files, in spite of the importance and possibility of using various product data in drawing files on the CALS/EC environment. This paper aims to demonstrate a construction drawing information management system based on ISO 10303/STEP. To achieve this aim, the authors have analyzed the current construction drawing information classification hierarchy widely used for domestic and international, and examined the material data connection mechanism within CAD drawing data, and finally investigated the management systems for construction documentations and drawings in a public companies. Therefore, the expected benefit of the proposed system is that STEP drawing information management will be done standardization and the information of STEP construction drawing can be managed, shared and supported design business through materials data connection.

  • PDF

Research on Chinese Microblog Sentiment Classification Based on TextCNN-BiLSTM Model

  • Haiqin Tang;Ruirui Zhang
    • Journal of Information Processing Systems
    • /
    • 제19권6호
    • /
    • pp.842-857
    • /
    • 2023
  • Currently, most sentiment classification models on microblogging platforms analyze sentence parts of speech and emoticons without comprehending users' emotional inclinations and grasping moral nuances. This study proposes a hybrid sentiment analysis model. Given the distinct nature of microblog comments, the model employs a combined stop-word list and word2vec for word vectorization. To mitigate local information loss, the TextCNN model, devoid of pooling layers, is employed for local feature extraction, while BiLSTM is utilized for contextual feature extraction in deep learning. Subsequently, microblog comment sentiments are categorized using a classification layer. Given the binary classification task at the output layer and the numerous hidden layers within BiLSTM, the Tanh activation function is adopted in this model. Experimental findings demonstrate that the enhanced TextCNN-BiLSTM model attains a precision of 94.75%. This represents a 1.21%, 1.25%, and 1.25% enhancement in precision, recall, and F1 values, respectively, in comparison to the individual deep learning models TextCNN. Furthermore, it outperforms BiLSTM by 0.78%, 0.9%, and 0.9% in precision, recall, and F1 values.

Robust Real-time Intrusion Detection System

  • Kim, Byung-Joo;Kim, Il-Kon
    • Journal of Information Processing Systems
    • /
    • 제1권1호
    • /
    • pp.9-13
    • /
    • 2005
  • Computer security has become a critical issue with the rapid development of business and other transaction systems over the Internet. The application of artificial intelligence, machine learning and data mining techniques to intrusion detection systems has been increasing recently. But most research is focused on improving the classification performance of a classifier. Selecting important features from input data leads to simplification of the problem, and faster and more accurate detection rates. Thus selecting important features is an important issue in intrusion detection. Another issue in intrusion detection is that most of the intrusion detection systems are performed by off-line and it is not a suitable method for a real-time intrusion detection system. In this paper, we develop the real-time intrusion detection system, which combines an on-line feature extraction method with the Least Squares Support Vector Machine classifier. Applying the proposed system to KDD CUP 99 data, experimental results show that it has a remarkable feature extraction and classification performance compared to existing off-line intrusion detection systems.

해외 건설산업의 사례에 의한 국내 건설 업종 분류체계의 비교 분석 (Analysis of the Domestic Construction Industry Classification System through an Overseas Construction Industry Case Study)

  • 김정욱;김규용;최민수;남정수;이상수
    • 한국건축시공학회지
    • /
    • 제22권5호
    • /
    • pp.463-471
    • /
    • 2022
  • 건설업은 타 업종에 비하여 정보비대칭 문제가 복잡하게 산재하고 있는 구조를 갖고 있으며, 현행 국내 건설업 등록과 관련된 건설업종 분류 체계는 신설, 통합, 폐지 등의 과정을 거치면서 종합건설업과 전문건설업으로 구분하여 통·폐합을 추진하고 있다. 건설업 업종분류체계는 건설시장에 진입하는데 지침이 되고, 건설수요자가 적정한 서비스를 제공할 수 있는 공급자를 선정하는데 기준이 될 수 있기 때문에 건설업등록제도의 운용 목적이나 취지로 판단할 때, 건설업종 분류 체계를 합리적으로 검토하여 설정되는 것이 매우 중요하다. 본 연구에서는 국내의 건설업 등록 업종 분류체계에 관한 리스크 요인을 고려하여 개선 가능성을 검토하고자 하는데 목적이 있다. 이를 위하여 일본, 미국, 호주 등의 해외의 건설업 면허나 등록제도에서 운영하고 있는 건설업종 분류 체계에 대한 사례 조사를 수행하여 이를 국내 업종 분류 체계와 비교하여 시사점 및 개선 방향에 대하여 검토하였다.

전통적인 챗봇과 ChatGPT 연계 서비스 방안 연구 (A Study on the Service Integration of Traditional Chatbot and ChatGPT)

  • 정천수
    • Journal of Information Technology Applications and Management
    • /
    • 제30권4호
    • /
    • pp.11-28
    • /
    • 2023
  • This paper proposes a method of integrating ChatGPT with traditional chatbot systems to enhance conversational artificial intelligence(AI) and create more efficient conversational systems. Traditional chatbot systems are primarily based on classification models and are limited to intent classification and simple response generation. In contrast, ChatGPT is a state-of-the-art AI technology for natural language generation, which can generate more natural and fluent conversations. In this paper, we analyze the business service areas that can be integrated with ChatGPT and traditional chatbots, and present methods for conducting conversational scenarios through case studies of service types. Additionally, we suggest ways to integrate ChatGPT with traditional chatbot systems for intent recognition, conversation flow control, and response generation. We provide a practical implementation example of how to integrate ChatGPT with traditional chatbots, making it easier to understand and build integration methods and actively utilize ChatGPT with existing chatbots.

Feature Selection for Multi-Class Support Vector Machines Using an Impurity Measure of Classification Trees: An Application to the Credit Rating of S&P 500 Companies

  • Hong, Tae-Ho;Park, Ji-Young
    • Asia pacific journal of information systems
    • /
    • 제21권2호
    • /
    • pp.43-58
    • /
    • 2011
  • Support vector machines (SVMs), a machine learning technique, has been applied to not only binary classification problems such as bankruptcy prediction but also multi-class problems such as corporate credit ratings. However, in general, the performance of SVMs can be easily worse than the best alternative model to SVMs according to the selection of predictors, even though SVMs has the distinguishing feature of successfully classifying and predicting in a lot of dichotomous or multi-class problems. For overcoming the weakness of SVMs, this study has proposed an approach for selecting features for multi-class SVMs that utilize the impurity measures of classification trees. For the selection of the input features, we employed the C4.5 and CART algorithms, including the stepwise method of discriminant analysis, which is a well-known method for selecting features. We have built a multi-class SVMs model for credit rating using the above method and presented experimental results with data regarding S&P 500 companies.

전력시스템 고조파 외란의 자동식별 (Automatic Classification of Power System Harmonic Disturbances)

  • 김병철;김현수;남상원
    • 제어로봇시스템학회논문지
    • /
    • 제6권7호
    • /
    • pp.551-558
    • /
    • 2000
  • In this paper a systematic approach to automatic classificationi of power system harmonic disturbances is proposed where the proposed approach consists of the following three steps:(i) detecting and localizing each harmonic disturbance by applying discrete wavelet transform(DWT) (ii) extracting an efficient feature vector from each detected disturbance waveform by utilizing FFT and principal component analysis (PCA) along with Fisher's criterion and (iii) classifying the corresponding type of each harmonic disturbance by recognizing the pattern of each feature vector. To demonstrate the performance and applicability of the proposed classification procedure some simulation results obtained by analyzing 8-class power system harmonic disturbances being generated with Matlab power system blockset are also provided.

  • PDF