Product Recommender Systems using Multi-Model Ensemble Techniques (다중모형조합기법을 이용한 상품추천시스템)
-
- Journal of Intelligence and Information Systems
- /
- v.19 no.2
- /
- pp.39-54
- /
- 2013
Recent explosive increase of electronic commerce provides many advantageous purchase opportunities to customers. In this situation, customers who do not have enough knowledge about their purchases, may accept product recommendations. Product recommender systems automatically reflect user's preference and provide recommendation list to the users. Thus, product recommender system in online shopping store has been known as one of the most popular tools for one-to-one marketing. However, recommender systems which do not properly reflect user's preference cause user's disappointment and waste of time. In this study, we propose a novel recommender system which uses data mining and multi-model ensemble techniques to enhance the recommendation performance through reflecting the precise user's preference. The research data is collected from the real-world online shopping store, which deals products from famous art galleries and museums in Korea. The data initially contain 5759 transaction data, but finally remain 3167 transaction data after deletion of null data. In this study, we transform the categorical variables into dummy variables and exclude outlier data. The proposed model consists of two steps. The first step predicts customers who have high likelihood to purchase products in the online shopping store. In this step, we first use logistic regression, decision trees, and artificial neural networks to predict customers who have high likelihood to purchase products in each product group. We perform above data mining techniques using SAS E-Miner software. In this study, we partition datasets into two sets as modeling and validation sets for the logistic regression and decision trees. We also partition datasets into three sets as training, test, and validation sets for the artificial neural network model. The validation dataset is equal for the all experiments. Then we composite the results of each predictor using the multi-model ensemble techniques such as bagging and bumping. Bagging is the abbreviation of "Bootstrap Aggregation" and it composite outputs from several machine learning techniques for raising the performance and stability of prediction or classification. This technique is special form of the averaging method. Bumping is the abbreviation of "Bootstrap Umbrella of Model Parameter," and it only considers the model which has the lowest error value. The results show that bumping outperforms bagging and the other predictors except for "Poster" product group. For the "Poster" product group, artificial neural network model performs better than the other models. In the second step, we use the market basket analysis to extract association rules for co-purchased products. We can extract thirty one association rules according to values of Lift, Support, and Confidence measure. We set the minimum transaction frequency to support associations as 5%, maximum number of items in an association as 4, and minimum confidence for rule generation as 10%. This study also excludes the extracted association rules below 1 of lift value. We finally get fifteen association rules by excluding duplicate rules. Among the fifteen association rules, eleven rules contain association between products in "Office Supplies" product group, one rules include the association between "Office Supplies" and "Fashion" product groups, and other three rules contain association between "Office Supplies" and "Home Decoration" product groups. Finally, the proposed product recommender systems provides list of recommendations to the proper customers. We test the usability of the proposed system by using prototype and real-world transaction and profile data. For this end, we construct the prototype system by using the ASP, Java Script and Microsoft Access. In addition, we survey about user satisfaction for the recommended product list from the proposed system and the randomly selected product lists. The participants for the survey are 173 persons who use MSN Messenger, Daum Caf
Sea-surface wind is an important variable in ocean-atmosphere interactions, leading to the changes in ocean surface currents and circulation, mixed layers, and heat flux. With the development of satellite technology, sea-surface winds data retrieved from scatterometer observation data have been used for various purposes. In a complex marine environment such as the Korean Peninsula coast, scatterometer-observed sea-surface wind is an important factor for analyzing ocean and atmospheric phenomena. Therefore, the validation results of wind accuracy can be used for diverse applications. In this study, the sea-surface winds derived from ASCAT (Advanced SCATterometer) mounted on MetOp-A/B (METeorological Operational Satellite-A/B) were validated compared to in-situ wind measurements at 16 marine buoy stations around the Korean Peninsula from January to December 2020. The buoy winds measured at a height of 4-5 m from the sea surface were converted to 10-m neutral winds using the LKB (Liu-Katsaros-Businger) model. The matchup procedure produced 5,544 and 10,051 collocation points for MetOp-A and MetOp-B, respectively. The root mean square errors (RMSE) were 1.36 and 1.28 m s-1, and bias errors amounted to 0.44 and 0.65 m s-1 for MetOp-A and MetOp-B, respectively. The wind directions of both scatterometers exhibited negative biases of -8.03° and -6.97° and RMSE values of 32.46° and 36.06° for MetOp-A and MetOp-B, respectively. These errors were likely associated with the stratification and dynamics of the marine-atmospheric boundary layer. In the seas around the Korean Peninsula, the sea-surface winds of the ASCAT tended to be more overestimated than the in-situ wind speeds, particularly at weak wind speeds. In addition, the closer the distance from the coast, the more the amplification of error. The present results could contribute to the development of a prediction model as improved input data and the understanding of air-sea interaction and impact of typhoons in the coastal regions around the Korean Peninsula.
This study was carried out to explore the accuracy of near infrared spectroscopy(NIRS) for the prediction of moisture content and chemical parameters on winter annual forage crops. A population of 2454 winter annual forages representing a wide range in chemical parameters was used in this study. Samples of forage were scanned at 1nm intervals over the wavelength range 680-2500nm and the optical data was recorded as log 1/Reflectance(log 1/R), which scanned in intact fresh condition. The spectral data were regressed against a range of chemical parameters using partial least squares(PLS) multivariate analysis in conjunction with spectral math treatments to reduced the effect of extraneous noise. The optimum calibrations were selected based on the highest coefficients of determination in cross validation(
Accurate evaluation of sea-to-air
Numerical model that considered the shrinking core model and elutriation and degradation of particles was developed to predict selective chlorination of ilmenite and carbo-chlorination of
Collapse of an Antarctic ice shelf and its flow velocity changes has the potential to reduce the restraining stress to the seaward flow of the Antarctic Ice Sheet, which can cause sea level rising. In this study, variations in ice velocity from 2000 to 2017 for the Nansen Ice Shelf in East Antarctica that experienced a large-scale collapse in April 2016 were analyzed using Landsat-7 Enhanced Thematic Mapper Plus (ETM+) and Landsat-8 Operational Land Imager (OLI) images. To extract ice velocity, image matching based on orientation correlation was applied to the image pairs of blue, green, red, near-infrared, panchromatic, and the first principal component image of the Landsat multispectral data, from which the results were combined. The Landsat multispectral image matching produced reliable ice velocities for at least 14% wider area on the Nansen Ice Shelf than for the case of using single band (i.e., panchromatic) image matching. The ice velocities derived from the Landsat multispectral image matching have the error of
Persistent droughts due to climate change will intensify water shortage problems in Korea. According to the 1st National Water Management Plan, the shortage of domestic and industrial waters is projected to be 0.07 billion m3/year under a 50-year drought event. A long-term prediction of water demand is essential for effectively responding to water shortage problems. Unlike industrial water, which has a relatively constant monthly usage, domestic water is analyzed on monthly basis due to apparent monthly usage patterns. We analyzed monthly water usage patterns using water usage data from 2017 to 2021 in Chungcheong, South Korea. The monthly water usage rate was calculated by dividing monthly water usage by annual water usage. We also calculated the water distribution rate considering correlations between water usage rate and climate variables. The division method that divided the monthly water usage rate by monthly average temperature resulted in the smallest absolute error. Using the division method with average temperature, we calculated the water distribution rates for the Chungcheong region. Then we predicted future water usage rates in the Chungcheong region by multiplying the average temperature of the SSP5-8.5 scenario and the water distribution rate. As a result, the average of the maximum water usage rate increased from 1.16 to 1.29 and the average of the minimum water usage rate decreased from 0.86 to 0.84, and the first quartile decreased from 0.95 to 0.93 and the third quartile increased from 1.04 to 1.06. Therefore, it is expected that the variability in monthly water usage rates will increase in the future.
Over a billion people in the world generate new news minute by minute. People forecasts some news but most news are from unexpected events such as natural disasters, accidents, crimes. People spend much time to watch a huge amount of news delivered from many media because they want to understand what is happening now, to predict what might happen in the near future, and to share and discuss on the news. People make better daily decisions through watching and obtaining useful information from news they saw. However, it is difficult that people choose news suitable to them and obtain useful information from the news because there are so many news media such as portal sites, broadcasters, and most news articles consist of gossipy news and breaking news. User interest changes over time and many people have no interest in outdated news. From this fact, applying users' recent interest to personalized news service is also required in news service. It means that personalized news service should dynamically manage user profiles. In this paper, a content-based news recommendation system is proposed to provide the personalized news service. For a personalized service, user's personal information is requisitely required. Social network service is used to extract user information for personalization service. The proposed system constructs dynamic user profile based on recent user information of Facebook, which is one of social network services. User information contains personal information, recent articles, and Facebook Page information. Facebook Pages are used for businesses, organizations and brands to share their contents and connect with people. Facebook users can add Facebook Page to specify their interest in the Page. The proposed system uses this Page information to create user profile, and to match user preferences to news topics. However, some Pages are not directly matched to news topic because Page deals with individual objects and do not provide topic information suitable to news. Freebase, which is a large collaborative database of well-known people, places, things, is used to match Page to news topic by using hierarchy information of its objects. By using recent Page information and articles of Facebook users, the proposed systems can own dynamic user profile. The generated user profile is used to measure user preferences on news. To generate news profile, news category predefined by news media is used and keywords of news articles are extracted after analysis of news contents including title, category, and scripts. TF-IDF technique, which reflects how important a word is to a document in a corpus, is used to identify keywords of each news article. For user profile and news profile, same format is used to efficiently measure similarity between user preferences and news. The proposed system calculates all similarity values between user profiles and news profiles. Existing methods of similarity calculation in vector space model do not cover synonym, hypernym and hyponym because they only handle given words in vector space model. The proposed system applies WordNet to similarity calculation to overcome the limitation. Top-N news articles, which have high similarity value for a target user, are recommended to the user. To evaluate the proposed news recommendation system, user profiles are generated using Facebook account with participants consent, and we implement a Web crawler to extract news information from PBS, which is non-profit public broadcasting television network in the United States, and construct news profiles. We compare the performance of the proposed method with that of benchmark algorithms. One is a traditional method based on TF-IDF. Another is 6Sub-Vectors method that divides the points to get keywords into six parts. Experimental results demonstrate that the proposed system provide useful news to users by applying user's social network information and WordNet functions, in terms of prediction error of recommended news.
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70