• Title/Summary/Keyword: Machine selection

Search Result 913, Processing Time 0.04 seconds

Establishment of Bank Channel Strategy using Correspondence Analysis : Based on the Customer's Choice Factors of Bank Channel (대응분석을 이용한 은행 채널전략 수립연구 : 고객의 은행채널 선택요인을 바탕으로)

  • Park, Un Hak;Park, Young Bae
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.6
    • /
    • pp.151-171
    • /
    • 2023
  • For the efficient establishment of a channel strategy for banks, this study aims to propose a channel model by classifying channels into types, and carrying out a correspondence analysis per type. A survey of bankers was conducted to visualize categorical data and create a positioning map. As a result, first, 12 banking channels were classified into 4 types based on business processing subjects and places, which were then, further grouped into the categories of full-banking and self-banking. Second, a correspondence analysis according to the classified types was carried out, and it was found that the branch-type is suitable for product description and customer management, while the banking-type is suitable for efficient business processing without time and space constraints. Furthermore, the analysis also showed that the machine-type and banking-type are inappropriate for customer management, and the mobility-type demonstrates low operational effectiveness due to a lack of awareness. The aforementioned findings suggest the need for a hybrid convergence channel that reflects the characteristics of banking tasks and fills in the gaps between the different channels. Third, a channel model was derived by adding a common area to the 2×2 model consisting of the business processing subjects and places. Therefore, this study is meaningful in that it examines the diversification of channels and factors in the division of roles by channel type based on customers' banking channel selection factors, and presents basic research findings for future channel strategy establishment and efficient channel operation.

A Study on Efficient AI Model Drift Detection Methods for MLOps (MLOps를 위한 효율적인 AI 모델 드리프트 탐지방안 연구)

  • Ye-eun Lee;Tae-jin Lee
    • Journal of Internet Computing and Services
    • /
    • v.24 no.5
    • /
    • pp.17-27
    • /
    • 2023
  • Today, as AI (Artificial Intelligence) technology develops and its practicality increases, it is widely used in various application fields in real life. At this time, the AI model is basically learned based on various statistical properties of the learning data and then distributed to the system, but unexpected changes in the data in a rapidly changing data situation cause a decrease in the model's performance. In particular, as it becomes important to find drift signals of deployed models in order to respond to new and unknown attacks that are constantly created in the security field, the need for lifecycle management of the entire model is gradually emerging. In general, it can be detected through performance changes in the model's accuracy and error rate (loss), but there are limitations in the usage environment in that an actual label for the model prediction result is required, and the detection of the point where the actual drift occurs is uncertain. there is. This is because the model's error rate is greatly influenced by various external environmental factors, model selection and parameter settings, and new input data, so it is necessary to precisely determine when actual drift in the data occurs based only on the corresponding value. There are limits to this. Therefore, this paper proposes a method to detect when actual drift occurs through an Anomaly analysis technique based on XAI (eXplainable Artificial Intelligence). As a result of testing a classification model that detects DGA (Domain Generation Algorithm), anomaly scores were extracted through the SHAP(Shapley Additive exPlanations) Value of the data after distribution, and as a result, it was confirmed that efficient drift point detection was possible.

Mapping Mammalian Species Richness Using a Machine Learning Algorithm (머신러닝 알고리즘을 이용한 포유류 종 풍부도 매핑 구축 연구)

  • Zhiying Jin;Dongkun Lee;Eunsub Kim;Jiyoung Choi;Yoonho Jeon
    • Journal of Environmental Impact Assessment
    • /
    • v.33 no.2
    • /
    • pp.53-63
    • /
    • 2024
  • Biodiversity holds significant importance within the framework of environmental impact assessment, being utilized in site selection for development, understanding the surrounding environment, and assessing the impact on species due to disturbances. The field of environmental impact assessment has seen substantial research exploring new technologies and models to evaluate and predict biodiversity more accurately. While current assessments rely on data from fieldwork and literature surveys to gauge species richness indices, limitations in spatial and temporal coverage underscore the need for high-resolution biodiversity assessments through species richness mapping. In this study, leveraging data from the 4th National Ecosystem Survey and environmental variables, we developed a species distribution model using Random Forest. This model yielded mapping results of 24 mammalian species' distribution, utilizing the species richness index to generate a 100-meter resolution map of species richness. The research findings exhibited a notably high predictive accuracy, with the species distribution model demonstrating an average AUC value of 0.82. In addition, the comparison with National Ecosystem Survey data reveals that the species richness distribution in the high-resolution species richness mapping results conforms to a normal distribution. Hence, it stands as highly reliable foundational data for environmental impact assessment. Such research and analytical outcomes could serve as pivotal new reference materials for future urban development projects, offering insights for biodiversity assessment and habitat preservation endeavors.

A Study on the Prediction Models of Used Car Prices Using Ensemble Model And SHAP Value: Focus on Feature of the Vehicle Type (앙상블 모델과 SHAP Value를 활용한 국내 중고차 가격 예측 모델에 관한 연구: 차종 특성을 중심으로)

  • Seungjun Yim;Joungho Lee;Choonho Ryu
    • Journal of Service Research and Studies
    • /
    • v.14 no.1
    • /
    • pp.27-43
    • /
    • 2024
  • The market share of online platform services in the used car market continues to expand. And The used car online platform service provides service users with specifications of vehicles, accident history, inspection details, detailed options, and prices of used cars. SUV vehicle type's share in the domestic automobile market will be more than 50% in 2023, Sales of Hybrid vehicle type are doubled compared to last year. And these vehicle types are also gaining popularity in the used car market. Prior research has proposed a used car price prediction model by executing a Machine Learning model for all vehicles or vehicles by brand. On the other hand, the popularity of SUV and Hybrid vehicles in the domestic market continues to rise, but It was difficult to find a study that proposed a used car price prediction model for these vehicle type. This study selects a used car price prediction model by vehicle type using vehicle specifications and options for Sedans, SUV, and Hybrid vehicles produced by domestic brands. Accordingly, after selecting feature through the Lasso regression model, which is a feature selection, the ensemble model was sequentially executed with the same sampling, and the best model by vehicle type was selected. As a result, the best model for all models was selected as the CBR model, and the contribution and direction of the features were confirmed by visualizing Tree SHAP Value for the best model for each model. The implications of this study are expected to propose a used car price prediction model by vehicle type to sales officials using online platform services, confirm the attribution and direction of features, and help solve problems caused by asymmetry fo information between them.

The study of CFD Modelling and numerical analysis for MSW in MBT system (생활폐기물 전처리시스템(MBT)의 동역학적 수치해석 및 모델링에 대한 연구)

  • Lee, Keon joo;Cho, Min tae;Na, Kyung Deok
    • Journal of the Korea Organic Resources Recycling Association
    • /
    • v.18 no.3
    • /
    • pp.77-86
    • /
    • 2010
  • In this study, the model of the indirect wind suction waste sorting machine for characteristics of the screening of waste was studied using computational fluid dynamics and the drag coefficient for the model and the suction wind speed were obtained. The wind separator are developing by installing a cyclone air outlet to the suction blower impeller waste is selective in a way that does not pass the features and characteristics of the inlet pipe of the pressure loss and separation efficiency can have a significant impact on. Using Wind separator for selection of waste in the waste prior research on the aerodynamic properties are essential. For plastic cases, it is reasonable to take the drag coefficient between 0.8 and 1.0, and for cans, compression depending on whether the cans, the drag coefficient is in the range from 0.2 to 0.7. The separation efficiency of waste as change suction speed was the highest efficiency when the suction speed was 25~26 m/s. Shape of the inlet, depending on how the transfer pipe of the duct pressure loss occurs because the inlet velocity changes through the appropriate design standards to allow for continued research is needed.

Incidence and Risk Factors for Extended-Spectrum ${\beta}-Lactamase-Producing$ Escherichia coli in Community-acquired Childhood Urinary Tract Infection (지역사회 획득 소아 요로 감염에서 Extended-Spectrum ${\beta}-Lactamase$ 생성)

  • Lee Jung-Won;Shin Jee-Sun;Seo Jeong-Wan;Lee Mi-Ae;Lee Seung-Joo
    • Childhood Kidney Diseases
    • /
    • v.8 no.2
    • /
    • pp.214-222
    • /
    • 2004
  • Purpose: Appropriate antibiotic therapy is important in childhood urinary tract infection and the selection of anibiotics is based on antimicrobial sensitivity of Escherichia coli. Extended-Spectrum ${\beta}-Lactamase(ESBL)$ is an enzyme produced by gram-negative bacilli that has the ability to hydrolyse penicillins, broad-spectrum cephalosporin and monobactam. There have been many reports of outbreaks of hospital infection by ESBL-producing organism. However, community-acquired infection with ESBL-producing organism are rare. This study was performed to retrospectively identify the incidence, characteristics and risk factors of ESBL (+) E. coli in community-acquired childhood UTI. Methods: In 288 children admitted in Ewha Womans University Hospital with E. coli UTI from Mar 2001 to February 2003, ESBL was isolated. ESBL was confirmed by the utilization of an automatized machine(Vitek GNS 433 card) using liquid medium dilution method according to National Committee for Clinical Laboratory Standard. The clinical characteristics, risk factors, antimicrobial resistance and treatment effectiveness were compared with ESBL(-) E. coli UTI. Results: Of 288 E. coli isolates, 31(10.8%) produced ESBL and 93.5%(29/31) occurred in infants younger than 6 month of age(P<0.01). No significant differences were noted in prior antibiotic use, prior admission history and underlying urogenital anomaly. Antimicrobial resistance was significantly higher in ESBL(+) E. coli compared with control patients (P<0.05). Although ceftriaxone showed 100% resistance in ESBL(+) E. coli, bacteriologic sterilization rate after ceftriaxone therapy was higher(96.8%). However, the recurrence rate of febrile UTI within 6 months was higher(25.8%) than control patients(6.6%). Conclusion: Epidemiologic study is required to find out any new risk factors of community-acquired ESBL(+) E. coli UTI and changes in selection of empirical antibiotics should be considered.

  • PDF

The Empirical Exploration of the Conception on Nursing (간호개념에 대한 기초조사)

  • 백혜자
    • Journal of Korean Academy of Nursing
    • /
    • v.11 no.1
    • /
    • pp.65-87
    • /
    • 1981
  • The study is aimed at exploring concept held by clinical nurses of nursing. The data were collected from 225 nurses conviniently selected from the population of nurses working in Kang Won province. Findings include. 1) Nurse's Qualification. The respondents view that specialized knowledge is more important qualification of the nurse. Than warm personality. Specifically, 92.9% of the respondents indicated specialized knowledge as the most important qualification while only 43.1% indicated warm personality. 2) On Nursing Profession. The respondents view that nursing profession as health service oriented rather than independent profession specifically. This suggests that nursing profession is not consistentic present health care delivery system nor support nurses working independently. 3) On Clients of Nursing Care The respondents include patients, family and the community residents in the category of nursing care. Specifically, 92.0% of the respondents view that patient is the client, while only 67.1% of nursing student and 74.7% of herself. This indicates the lack of the nurse's recognition toward their clients. 4) On the Priority of Nursing care. Most of the respondents view the clients physical psychological respects as important component of nursing care but not the spiritual ones. Specially, 96.0% of the respondents indicated the physical respects, 93% psychological ones, while 64.1% indicated the spiritual ones. This means the lack of comprehensive conception on nursing aimension. 5) On Nursing Care. 91.6% of the respondents indicated that nursing care is the activity decreasing pain or helping to recover illness, while only 66.2% indicated earring out the physicians medical orders. 6) On Purpose of Nursing Care. 89.8% of the respondents indicated preventing illness and than 76.6% of them decreasing 1;ai of clients. On the other hand, maintaining health has the lowest selection at the degree of 13.8%. This means the lack of nurses' recognition for maintaining health as the most important point. 7) On Knowledge Needed in Nursing Care. Most of the respondents view that the knowledge faced with the spot of nursing care is needed. Specially, 81.3% of the respondents indicated simple curing method and 75.1%, 73.3%, 71.6% each indicated child nursing, maternal nursing and controlling for the communicable disease. On the other hand, knowledge w hick has been neglected in the specialized courses of nursing education, that is, thinking line among com-w unity members, overcoming style against between stress and personal relation in each home, and administration, management have a low selection at the depree of 48.9%,41.875 and 41.3%. 8) On Nursing Idea. The highest degree of selection is that they know themselves rightly, (The mean score measuring distribution was 4.205/5) In the lowest degree,3.016/5 is that devotion is the essential element of nursing, 2.860/5 the religious problems that human beings can not settle, such as a fatal ones, 2,810/5 the nursing profession is worth trying in one's life. This means that the peculiarly essential ideas on the professional sense of value. 9) On Nursing Services. The mean score measuring distribution for the nursing services showed that the inserting of machine air way is 2.132/5, the technique and knowledge for surviving heart-lung resuscitating is 2.892/s, and the preventing air pollution 3.021/5. Specially, 41.1% of the respondents indicated the lack of the replied ratio. 10) On Nurses' Qualifications. The respondents were selected five items as the most important qualifications. Specially, 17.4% of the respondents indicated specialized knowledge, 15.3% the nurses' health, 10.6% satisfaction for nursing profession, 9.8% the experience need, 9.2% comprehension and cooperation, while warm personality as nursing qualifications have a tendency of being lighted. 11) On the Priority of Nursing Care The respondents were selected three items as the most important component. Most of the respondents view the client's physical, spiritual: economic points as important components of nursing care. They showed each 36.8%, 27.6%, 13.8% while educational ones showed 1.8%. 12) On Purpose of Nursing Care. The respondents were selected four items as the most important purpose. Specially,29.3% of the respondents indicated curing illness for clients, 21.3% preventing illness for client 17.4% decreasing pain, 15.3% surviving. 13) On the Analysis of Important Nursing Care Ranging from 5 point to 25 point, the nurses' qualification are concentrated at the degree of 95.1%. Ranging from 3 point to 25, the priorities of nursing care are concentrated at the degree of 96.4%. Ranging from 4 point to 16, the purpose of nursing care is concentrated at the degree of 84.0%. 14) The Analysis, of General Characteristics and Facts of Nursing Concept. The correlation between the educational high level and nursing care showed significance. (P < 0.0262). The correction between the educational low level and purpose of nursing care showed significance. (P < 0.002) The correlation between nurses' working yeras and the degree of importance for the purpose of nursing care showed significance (P < 0.0155) Specially, the most affirmative answers were showed from two years to four ones. 15) On Nunes' qualification and its Degree of Importance The correlation between nurses' qualification and its degree of importance showed significance. (r = 0.2172, p< 0.001) 0.005) B. General characteristics of the subjects The mean age of the subject was 39 ; with 38.6% with in the age range of 20-29 ; 52.6% were male; 57.9% were Schizophrenia; 35.1% were graduated from high school or high school dropouts; 56.l% were not have any religion; 52.6% were unmarried; 47.4% were first admission; 91.2% were involuntary admission patients. C. Measurement of anxiety variables. 1. Measurement tools of affective anxiety in this study demonstrated high reliability (.854). 2. Measurement tools of somatic anxiety in this study demonstrated high reliability (.920). D. Relationship between the anxiety variables and the general characteristics. 1. Relationship between affective anxiety and general characteristics. 1) The level of female patients were higher than that of the male patient (t = 5.41, p < 0.05). 2) Frequencies of admission were related to affective anxiety, so in the first admission the anxiety level was the highest. (F = 5.50, p < 0.005). 2, Relationship between somatic anxiety and general characteristics. 1) The age range of 30-39 was found to have the highest level of the somatic anxiety. (F = 3.95, p < 0.005). 2) Frequencies of admission were related to the somatic anxiety, so .in first admission the anxiety level was the highest. (F = 9.12, p < 0.005) 0. Analysis of significant anxiety symptoms for nursing intervention. 1. Seven items such as dizziness, mental integration, sweating, restlessness, anxiousness, urinary frequency and insomnia, init. accounted for 96% of the variation within the first 24 hours after admission. 2. Seven items such as fear, paresthesias, restlessness, sweating insomnia, init., tremors and body aches and pains accounted for 84% of the variation on the 10th day after admission.

  • PDF

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.

A Comparative Evaluation of Multiple Meteorological Datasets for the Rice Yield Prediction at the County Level in South Korea (우리나라 시군단위 벼 수확량 예측을 위한 다종 기상자료의 비교평가)

  • Cho, Subin;Youn, Youjeong;Kim, Seoyeon;Jeong, Yemin;Kim, Gunah;Kang, Jonggu;Kim, Kwangjin;Cho, Jaeil;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.2
    • /
    • pp.337-357
    • /
    • 2021
  • Because the growth of paddy rice is affected by meteorological factors, the selection of appropriate meteorological variables is essential to build a rice yield prediction model. This paper examines the suitability of multiple meteorological datasets for the rice yield modeling in South Korea, 1996-2019, and a hindcast experiment for rice yield using a machine learning method by considering the nonlinear relationships between meteorological variables and the rice yield. In addition to the ASOS in-situ observations, we used CRU-JRA ver. 2.1 and ERA5 reanalysis. From the multiple meteorological datasets, we extracted the four common variables (air temperature, relative humidity, solar radiation, and precipitation) and analyzed the characteristics of each data and the associations with rice yields. CRU-JRA ver. 2.1 showed an overall agreement with the other datasets. While relative humidity had a rare relationship with rice yields, solar radiation showed a somewhat high correlation with rice yields. Using the air temperature, solar radiation, and precipitation of July, August, and September, we built a random forest model for the hindcast experiments of rice yields. The model with CRU-JRA ver. 2.1 showed the best performance with a correlation coefficient of 0.772. The solar radiation in the prediction model had the most significant importance among the variables, which is in accordance with the generic agricultural knowledge. This paper has an implication for selecting from multiple meteorological datasets for rice yield modeling.

Customer Behavior Prediction of Binary Classification Model Using Unstructured Information and Convolution Neural Network: The Case of Online Storefront (비정형 정보와 CNN 기법을 활용한 이진 분류 모델의 고객 행태 예측: 전자상거래 사례를 중심으로)

  • Kim, Seungsoo;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.221-241
    • /
    • 2018
  • Deep learning is getting attention recently. The deep learning technique which had been applied in competitions of the International Conference on Image Recognition Technology(ILSVR) and AlphaGo is Convolution Neural Network(CNN). CNN is characterized in that the input image is divided into small sections to recognize the partial features and combine them to recognize as a whole. Deep learning technologies are expected to bring a lot of changes in our lives, but until now, its applications have been limited to image recognition and natural language processing. The use of deep learning techniques for business problems is still an early research stage. If their performance is proved, they can be applied to traditional business problems such as future marketing response prediction, fraud transaction detection, bankruptcy prediction, and so on. So, it is a very meaningful experiment to diagnose the possibility of solving business problems using deep learning technologies based on the case of online shopping companies which have big data, are relatively easy to identify customer behavior and has high utilization values. Especially, in online shopping companies, the competition environment is rapidly changing and becoming more intense. Therefore, analysis of customer behavior for maximizing profit is becoming more and more important for online shopping companies. In this study, we propose 'CNN model of Heterogeneous Information Integration' using CNN as a way to improve the predictive power of customer behavior in online shopping enterprises. In order to propose a model that optimizes the performance, which is a model that learns from the convolution neural network of the multi-layer perceptron structure by combining structured and unstructured information, this model uses 'heterogeneous information integration', 'unstructured information vector conversion', 'multi-layer perceptron design', and evaluate the performance of each architecture, and confirm the proposed model based on the results. In addition, the target variables for predicting customer behavior are defined as six binary classification problems: re-purchaser, churn, frequent shopper, frequent refund shopper, high amount shopper, high discount shopper. In order to verify the usefulness of the proposed model, we conducted experiments using actual data of domestic specific online shopping company. This experiment uses actual transactions, customers, and VOC data of specific online shopping company in Korea. Data extraction criteria are defined for 47,947 customers who registered at least one VOC in January 2011 (1 month). The customer profiles of these customers, as well as a total of 19 months of trading data from September 2010 to March 2012, and VOCs posted for a month are used. The experiment of this study is divided into two stages. In the first step, we evaluate three architectures that affect the performance of the proposed model and select optimal parameters. We evaluate the performance with the proposed model. Experimental results show that the proposed model, which combines both structured and unstructured information, is superior compared to NBC(Naïve Bayes classification), SVM(Support vector machine), and ANN(Artificial neural network). Therefore, it is significant that the use of unstructured information contributes to predict customer behavior, and that CNN can be applied to solve business problems as well as image recognition and natural language processing problems. It can be confirmed through experiments that CNN is more effective in understanding and interpreting the meaning of context in text VOC data. And it is significant that the empirical research based on the actual data of the e-commerce company can extract very meaningful information from the VOC data written in the text format directly by the customer in the prediction of the customer behavior. Finally, through various experiments, it is possible to say that the proposed model provides useful information for the future research related to the parameter selection and its performance.