• Title/Summary/Keyword: 추천 모형

Search Result 233, Processing Time 0.03 seconds

Crop Recommendation Service based on Agriculture Environment Data (농업 환경 데이터에 기반한 농작물 추천 서비스)

  • Bae, Jiwon;Lee, Sangwook;Lee, Sywan;Lee, Yeji;Choi, Jun Hyung;Cho, Pil Kuk;Gil, Joon-Min
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.193-195
    • /
    • 2021
  • 최근 우리나라에서 재배되고 있는 농작물은 지구 온난화 등의 영향으로 점점 북상하고 있다. 이러한 농업 환경의 변화에 적극적으로 대처하기 위해 본 논문에서는 농업 재배지의 환경 데이터를 수집하고 분석하여 현재 농업 재배지에 최적화된 농작물을 추천할 수 있는 농작물 추천 서비스를 제안한다. 이를 위해 농작물 추천 서비스에 활용하기 위해 농업 환경 데이터의 모니터링과 농작물 데이터 관리 스마트팜 모형을 설계 및 구축한다.

A Study on Prediction Model of Subjective Well-Being Using Collaborative Filtering (협력적 필터링을 이용한 주관적 행복감 예측 모형연구)

  • Lee Sangyeop;Kim Jiyeon;Ryu dong in;Gi Hyeon Han;Park Saehan;Koo Jee Hyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2024.05a
    • /
    • pp.552-553
    • /
    • 2024
  • 협력적 필터링은 추천시스템을 구축하는 알고리즘으로 고객별 선호도를 예측하는데 사용되고 있다. 이에 본 연구는 행복감에 영향을 주는 요인인 자존감과 생활여건을 사용하여, 협력적 필터링을 기반으로 한 예측정확도가 높은 모형을 연구하고자 한다. 이를 위해, 자존감과 생활여건에 대한 응답자 간의 유사도 가중치를 각각 계산한 후, 자존감 유사도 가중치를 적용한 모형으로 행복감을 예측하고, 자존감 유사도 가중치에 생활여건 유사도 가중치를 부여한 유사도 가중치를 적용한 모형으로 행복감을 예측하였다. 그 결과 전자의 모형이 후자의 모형보다 예측정확도가 높게 나타났다.

Comparison of Product and Customer Feature Selection Methods for Content-based Recommendation in Internet Storefronts (인터넷 상점에서의 내용기반 추천을 위한 상품 및 고객의 자질 추출 성능 비교)

  • Ahn Hyung-Jun;Kim Jong-Woo
    • The KIPS Transactions:PartD
    • /
    • v.13D no.2 s.105
    • /
    • pp.279-286
    • /
    • 2006
  • One of the widely used methods for product recommendation in Internet storefronts is matching product features against target customer profiles. When using this method, it's very important to choose a suitable subset of features for recommendation efficiency and performance, which, however, has not been rigorously researched so far. In this paper, we utilize a dataset collected from a virtual shopping experiment in a Korean Internet book shopping mall to compare several popular methods from other disciplines for selecting features for product recommendation: the vector-space model, TFIDF(Term Frequency-Inverse Document Frequency), the mutual information method, and the singular value decomposition(SVD). The application of SVD showed the best performance in the analysis results.

Recommender Systems using SVD with Social Network Information (사회연결망정보를 고려하는 SVD 기반 추천시스템)

  • Kim, Min-Gun;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.1-18
    • /
    • 2016
  • Collaborative Filtering (CF) predicts the focal user's preference for particular item based on user's preference rating data and recommends items for the similar users by using them. It is a popular technique for the personalization in e-commerce to reduce information overload. However, it has some limitations including sparsity and scalability problems. In this paper, we use a method to integrate social network information into collaborative filtering in order to mitigate the sparsity and scalability problems which are major limitations of typical collaborative filtering and reflect the user's qualitative and emotional information in recommendation process. In this paper, we use a novel recommendation algorithm which is integrated with collaborative filtering by using Social SVD++ algorithm which considers social network information in SVD++, an extension algorithm that can reflect implicit information in singular value decomposition (SVD). In particular, this study will evaluate the performance of the model by reflecting the real-world user's social network information in the recommendation process.

How to Recommend Online Shopping Consumers the Best of Many Sellers? : Online Seller Recommendation System Using DEA Method (DEA 방법론을 이용한 온라인 판매자 추천 시스템의 구축)

  • An, Jung-Nam;Rho, Sang-Kyu;Yoo, Byung-Joon
    • The Journal of Society for e-Business Studies
    • /
    • v.16 no.3
    • /
    • pp.191-209
    • /
    • 2011
  • In a buyer-seller transaction process, 'value for money,' a measure of quality-price-ratio, is one of the most important criteria for buyers' purchasing decisions. The purpose of this paper is to suggest a method which helps online shoppers choose the best of several sellers offering homogeneous goods. We suggest FDH (free disposal hull) model, an applied model of data envelopment analysis (DEA), for online buyer-seller transactions and verify it with the data from an Internet comparison shopping site. For this purpose, we analyze consumer choice behaviors by examining how consumers respond to different sale conditions such as price, brand, or delivery time. Then, we implement a seller recommendation system to support buyers' purchasing decisions. We expect our FDH model to provide valuable information for rational buyers who want to pay the least price for high quality products/services and to be used in implementing automated evaluation processes in micro transactions. Moreover, we expect that our results can be utilized for sellers' benchmarking strategies which help sellers be more competitive by showing them how to attract buyers.

Conditional Generative Adversarial Network based Collaborative Filtering Recommendation System (Conditional Generative Adversarial Network(CGAN) 기반 협업 필터링 추천 시스템)

  • Kang, Soyi;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.157-173
    • /
    • 2021
  • With the development of information technology, the amount of available information increases daily. However, having access to so much information makes it difficult for users to easily find the information they seek. Users want a visualized system that reduces information retrieval and learning time, saving them from personally reading and judging all available information. As a result, recommendation systems are an increasingly important technologies that are essential to the business. Collaborative filtering is used in various fields with excellent performance because recommendations are made based on similar user interests and preferences. However, limitations do exist. Sparsity occurs when user-item preference information is insufficient, and is the main limitation of collaborative filtering. The evaluation value of the user item matrix may be distorted by the data depending on the popularity of the product, or there may be new users who have not yet evaluated the value. The lack of historical data to identify consumer preferences is referred to as data sparsity, and various methods have been studied to address these problems. However, most attempts to solve the sparsity problem are not optimal because they can only be applied when additional data such as users' personal information, social networks, or characteristics of items are included. Another problem is that real-world score data are mostly biased to high scores, resulting in severe imbalances. One cause of this imbalance distribution is the purchasing bias, in which only users with high product ratings purchase products, so those with low ratings are less likely to purchase products and thus do not leave negative product reviews. Due to these characteristics, unlike most users' actual preferences, reviews by users who purchase products are more likely to be positive. Therefore, the actual rating data is over-learned in many classes with high incidence due to its biased characteristics, distorting the market. Applying collaborative filtering to these imbalanced data leads to poor recommendation performance due to excessive learning of biased classes. Traditional oversampling techniques to address this problem are likely to cause overfitting because they repeat the same data, which acts as noise in learning, reducing recommendation performance. In addition, pre-processing methods for most existing data imbalance problems are designed and used for binary classes. Binary class imbalance techniques are difficult to apply to multi-class problems because they cannot model multi-class problems, such as objects at cross-class boundaries or objects overlapping multiple classes. To solve this problem, research has been conducted to convert and apply multi-class problems to binary class problems. However, simplification of multi-class problems can cause potential classification errors when combined with the results of classifiers learned from other sub-problems, resulting in loss of important information about relationships beyond the selected items. Therefore, it is necessary to develop more effective methods to address multi-class imbalance problems. We propose a collaborative filtering model using CGAN to generate realistic virtual data to populate the empty user-item matrix. Conditional vector y identify distributions for minority classes and generate data reflecting their characteristics. Collaborative filtering then maximizes the performance of the recommendation system via hyperparameter tuning. This process should improve the accuracy of the model by addressing the sparsity problem of collaborative filtering implementations while mitigating data imbalances arising from real data. Our model has superior recommendation performance over existing oversampling techniques and existing real-world data with data sparsity. SMOTE, Borderline SMOTE, SVM-SMOTE, ADASYN, and GAN were used as comparative models and we demonstrate the highest prediction accuracy on the RMSE and MAE evaluation scales. Through this study, oversampling based on deep learning will be able to further refine the performance of recommendation systems using actual data and be used to build business recommendation systems.

Predicting the Performance of Recommender Systems through Social Network Analysis and Artificial Neural Network (사회연결망분석과 인공신경망을 이용한 추천시스템 성능 예측)

  • Cho, Yoon-Ho;Kim, In-Hwan
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.159-172
    • /
    • 2010
  • The recommender system is one of the possible solutions to assist customers in finding the items they would like to purchase. To date, a variety of recommendation techniques have been developed. One of the most successful recommendation techniques is Collaborative Filtering (CF) that has been used in a number of different applications such as recommending Web pages, movies, music, articles and products. CF identifies customers whose tastes are similar to those of a given customer, and recommends items those customers have liked in the past. Numerous CF algorithms have been developed to increase the performance of recommender systems. Broadly, there are memory-based CF algorithms, model-based CF algorithms, and hybrid CF algorithms which combine CF with content-based techniques or other recommender systems. While many researchers have focused their efforts in improving CF performance, the theoretical justification of CF algorithms is lacking. That is, we do not know many things about how CF is done. Furthermore, the relative performances of CF algorithms are known to be domain and data dependent. It is very time-consuming and expensive to implement and launce a CF recommender system, and also the system unsuited for the given domain provides customers with poor quality recommendations that make them easily annoyed. Therefore, predicting the performances of CF algorithms in advance is practically important and needed. In this study, we propose an efficient approach to predict the performance of CF. Social Network Analysis (SNA) and Artificial Neural Network (ANN) are applied to develop our prediction model. CF can be modeled as a social network in which customers are nodes and purchase relationships between customers are links. SNA facilitates an exploration of the topological properties of the network structure that are implicit in data for CF recommendations. An ANN model is developed through an analysis of network topology, such as network density, inclusiveness, clustering coefficient, network centralization, and Krackhardt's efficiency. While network density, expressed as a proportion of the maximum possible number of links, captures the density of the whole network, the clustering coefficient captures the degree to which the overall network contains localized pockets of dense connectivity. Inclusiveness refers to the number of nodes which are included within the various connected parts of the social network. Centralization reflects the extent to which connections are concentrated in a small number of nodes rather than distributed equally among all nodes. Krackhardt's efficiency characterizes how dense the social network is beyond that barely needed to keep the social group even indirectly connected to one another. We use these social network measures as input variables of the ANN model. As an output variable, we use the recommendation accuracy measured by F1-measure. In order to evaluate the effectiveness of the ANN model, sales transaction data from H department store, one of the well-known department stores in Korea, was used. Total 396 experimental samples were gathered, and we used 40%, 40%, and 20% of them, for training, test, and validation, respectively. The 5-fold cross validation was also conducted to enhance the reliability of our experiments. The input variable measuring process consists of following three steps; analysis of customer similarities, construction of a social network, and analysis of social network patterns. We used Net Miner 3 and UCINET 6.0 for SNA, and Clementine 11.1 for ANN modeling. The experiments reported that the ANN model has 92.61% estimated accuracy and 0.0049 RMSE. Thus, we can know that our prediction model helps decide whether CF is useful for a given application with certain data characteristics.

A Study on the Real-time Recommendation Box Recommendation of Fulfillment Center Using Machine Learning (기계학습을 이용한 풀필먼트센터의 실시간 박스 추천에 관한 연구)

  • Dae-Wook Cha;Hui-Yeon Jo;Ji-Soo Han;Kwang-Sup Shin;Yun-Hong Min
    • The Journal of Bigdata
    • /
    • v.8 no.2
    • /
    • pp.149-163
    • /
    • 2023
  • Due to the continuous growth of the E-commerce market, the volume of orders that fulfillment centers have to process has increased, and various customer requirements have increased the complexity of order processing. Along with this trend, the operational efficiency of fulfillment centers due to increased labor costs is becoming more important from a corporate management perspective. Using historical performance data as training data, this study focused on real-time box recommendations applicable to packaging areas during fulfillment center shipping. Four types of data, such as product information, order information, packaging information, and delivery information, were applied to the machine learning model through pre-processing and feature-engineering processes. As an input vector, three characteristics were used as product specification information: width, length, and height, the characteristics of the input vector were extracted through a feature engineering process that converts product information from real numbers to an integer system for each section. As a result of comparing the performance of each model, it was confirmed that when the Gradient Boosting model was applied, the prediction was performed with the highest accuracy at 95.2% when the product specification information was converted into integers in 21 sections. This study proposes a machine learning model as a way to reduce the increase in costs and inefficiency of box packaging time caused by incorrect box selection in the fulfillment center, and also proposes a feature engineering method to effectively extract the characteristics of product specification information.

Development of probability distribution for simulation of monthly characteristics of torrential rainfall events (집중호우사상의 월별 발생특성 모의를 위한 확률분포 개발)

  • Kim, Sang Ug;Kim, Hyeong Bae
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2016.05a
    • /
    • pp.246-246
    • /
    • 2016
  • 본 연구에서는 최근 기후변화로 인한 집중호우의 발생횟수의 경향을 확률적으로 분석함에 있어 1개월 동안 80 mm/day 이상의 강우사상을 집중호우로 정의하여, 대구 및 부산 강우관측소로부터 수집된 384개월 동안의 집중호우를 분석하였다. 집중호우 월별 발생횟수와 같은 형식의 자료의 확률적 분석은 대개 Poisson 분포 (POI)가 사용되나 자료에 포함된 0자료의 과잉은 확률분포를 왜곡시키는 문제를 발생시킨다. 본 연구에서는 이 문제를 개선하기 위하여 개발된 일반화 Poisson 확률분포 (GPD), 0-과잉 Poisson 확률분포 (ZIP), 0-과잉 일반화 Poisson 확률분포 (ZIGP), Bayesian 0-과잉 일반화 Poisson 확률분포 (Bayesian ZIGP)를 집중호우 자료에 적용하고, 5개 모형의 특성을 비교분석하였으며, Bayesian ZIGP 모형의 구축에 있어서는 정보적 사전분포를 사용함으로써 모형의 정확도를 개선하였다. 분석결과 분석하고자 하는 자료에 0이 과다하게 포함되어 있는 경우 POI 및 GPD 분포는 관측결과와는 다른 결과를 제시하여 적절한 모형으로 고려되지 못함을 알 수 있었다. 5가지 모형 중 정보적 사전분포를 탑재한 Bayesian ZIGP 모형이 가장 관측 자료와 유사한 결과를 도출하였으나 모형의 구축에 수반되는 실용적인 측면을 고려하면 ZIP 모형도 충분히 사용될 수 있는 모형으로 추천되었다.

  • PDF

Automatic TV Program Recommendation using LDA based Latent Topic Inference (LDA 기반 은닉 토픽 추론을 이용한 TV 프로그램 자동 추천)

  • Kim, Eun-Hui;Pyo, Shin-Jee;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.270-283
    • /
    • 2012
  • With the advent of multi-channel TV, IPTV and smart TV services, excessive amounts of TV program contents become available at users' sides, which makes it very difficult for TV viewers to easily find and consume their preferred TV programs. Therefore, the service of automatic TV recommendation is an important issue for TV users for future intelligent TV services, which allows to improve access to their preferred TV contents. In this paper, we present a recommendation model based on statistical machine learning using a collaborative filtering concept by taking in account both public and personal preferences on TV program contents. For this, users' preference on TV programs is modeled as a latent topic variable using LDA (Latent Dirichlet Allocation) which is recently applied in various application domains. To apply LDA for TV recommendation appropriately, TV viewers's interested topics is regarded as latent topics in LDA, and asymmetric Dirichlet distribution is applied on the LDA which can reveal the diversity of the TV viewers' interests on topics based on the analysis of the real TV usage history data. The experimental results show that the proposed LDA based TV recommendation method yields average 66.5% with top 5 ranked TV programs in weekly recommendation, average 77.9% precision in bimonthly recommendation with top 5 ranked TV programs for the TV usage history data of similar taste user groups.