• Title/Summary/Keyword: ranking model

Search Result 395, Processing Time 0.025 seconds

The MeSH-Term Query Expansion Models using LDA Topic Models in Health Information Retrieval (MeSH 기반의 LDA 토픽 모델을 이용한 검색어 확장)

  • You, Sukjin
    • Journal of Korean Library and Information Science Society
    • /
    • v.52 no.1
    • /
    • pp.79-108
    • /
    • 2021
  • Information retrieval in the health field has several challenges. Health information terminology is difficult for consumers (laypeople) to understand. Formulating a query with professional terms is not easy for consumers because health-related terms are more familiar to health professionals. If health terms related to a query are automatically added, it would help consumers to find relevant information. The proposed query expansion (QE) models show how to expand a query using MeSH terms. The documents were represented by MeSH terms (i.e. Bag-of-MeSH), found in the full-text articles. And then the MeSH terms were used to generate LDA (Latent Dirichlet Analysis) topic models. A query and the top k retrieved documents were used to find MeSH terms as topic words related to the query. LDA topic words were filtered by threshold values of topic probability (TP) and word probability (WP). Threshold values were effective in an LDA model with a specific number of topics to increase IR performance in terms of infAP (inferred Average Precision) and infNDCG (inferred Normalized Discounted Cumulative Gain), which are common IR metrics for large data collections with incomplete judgments. The top k words were chosen by the word score based on (TP *WP) and retrieved document ranking in an LDA model with specific thresholds. The QE model with specific thresholds for TP and WP showed improved mean infAP and infNDCG scores in an LDA model, comparing with the baseline result.

A Study on the Asia Container Ports Clustering Using Hierarchical Clustering(Single, Complete, Average, Centroid Linkages) Methods with Empirical Verification of Clustering Using the Silhouette Method and the Second Stage(Type II) Cross-Efficiency Matrix Clustering Model (계층적 군집분석(최단, 최장, 평균, 중앙연결)방법에 의한 아시아 컨테이너 항만의 클러스터링 측정 및 실루엣방법과 2단계(Type II) 교차효율성 메트릭스 군집모형을 이용한 실증적 검증에 관한 연구)

  • Park, Ro-Kyung
    • Journal of Korea Port Economic Association
    • /
    • v.37 no.1
    • /
    • pp.31-70
    • /
    • 2021
  • The purpose of this paper is to measure the clustering change and analyze empirical results, and choose the clustering ports for Busan, Incheon, and Gwangyang ports by using Hierarchical clustering(single, complete, average, and centroid), Silhouette, and 2SCE[the Second Stage(Type II) cross-efficiency] matrix clustering models on Asian container ports over the period 2009-2018. The models have chosen number of cranes, depth, birth length, and total area as inputs and container TEU as output. The main empirical results are as follows. First, ranking order according to the efficiency increasing ratio during the 10 years analysis shows Silhouette(0.4052 up), Hierarchical clustering(0.3097 up), and 2SCE(0.1057 up). Second, according to empirical verification of the Silhouette and 2SCE models, 3 Korean ports should be clustered with ports like Busan Port[ Dubai, Hong Kong, and Tanjung Priok], and Incheon Port and Gwangyang Port are required to cluster with most ports. Third, in terms of the ASEAN, it would be good to cluster like Busan (Singapore), Incheon Port (Tanjung Priok, Tanjung Perak, Manila, Tanjung Pelpas, Leam Chanbang, and Bangkok), and Gwangyang Port(Tanjung Priok, Tanjung Perak, Port Kang, Tanjung Pelpas, Leam Chanbang, and Bangkok). Third, Wilcoxon's signed-ranks test of models shows that all P values are significant at an average level of 0.852. It means that the average efficiency figures and ranking orders of the models are matched each other. The policy implication is that port policy makers and port operation managers should select benchmarking ports by introducing the models used in this study into the clustering of ports, compare and analyze the port development and operation plans of their ports, and introduce and implement the parts which required benchmarking quickly.

A Study on the Model Development and Empirical Application for Measuring the Radial and Non-radial Efficiencies of Investment in Domestic Seaports (국내항만투자의 방사.비방사적 효율성 측정을 위한 모형개발 및 실증적 적용에 관한 연구)

  • Park, Ro-Kyung
    • Journal of Korea Port Economic Association
    • /
    • v.27 no.1
    • /
    • pp.185-212
    • /
    • 2011
  • The purpose of this paper is to show the empirical analysis way for measuring the seaport efficiency by using the previous radial model and the newly modified non-radial models( panel additive model, panel RAM model, and panel SBM model)with Spearman rank order correlation coefficient(SROCC) for 20 Korean ports during 11 years(1997-2007) for 1 inputs(port investment amount) and 4 outputs(Number of Ship Calls, Port Revenue, Customer Satisfaction Score for Port Service and Container Cargo Throughput). The main empirical results of this paper are as follows. First, consistency ratio of SROCC in terms of efficiency scores between radial and panel Additive model was over about 76% and overall consistency ratio was about 71.6%. Second, an efficiency of panel RAM model was higher than that of radial model with similarity. However, panel SBM model shows the very similar efficiency scores with panel radial model. Third, the slack size of radial model is smaller compared to non-radial model. Models' ranking orders in terms of efficiency scores, number of efficient ports are panel RAM model, panel SBM model, and radial model. The order from the minimum efficiency scores was the same order like just before. The policy implication to the Korean seaports and planner is that Korean seaports should introduce the new methods like non-radial models(panel additive model, panel RAM model, and panel SBM model) for measuring the port performance.

Automatic Recommendation of (IP)TV programs based on A Rank Model using Collaborative Filtering (협업 필터링을 이용한 순위 정렬 모델 기반 (IP)TV 프로그램 자동 추천)

  • Kim, Eun-Hui;Pyo, Shin-Jee;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.14 no.2
    • /
    • pp.238-252
    • /
    • 2009
  • Due to the rapid increase of available contents via the convergence of broadcasting and internet, the efficient access to personally preferred contents has become an important issue. In this paper, for recommendation scheme for TV programs using a collaborative filtering technique is studied. For recommendation of user preferred TV programs, our proposed recommendation scheme consists of offline and online computation. About offline computation, we propose reasoning implicitly each user's preference in TV programs in terms of program contents, genres and channels, and propose clustering users based on each user's preferences in terms of genres and channels by dynamic fuzzy clustering method. After an active user logs in, to recommend TV programs to the user with high accuracy, the online computation includes pulling similar users to an active user by similarity measure based on the standard preference list of active user and filtering-out of the watched TV programs of the similar users, which do not exist in EPG and ranking of the remaining TV programs by proposed rank model. Especially, in this paper, the BM (Best Match) algorithm is extended to make the recommended TV programs be ranked by taking into account user's preferences. The experimental results show that the proposed scheme with the extended BM model yields 62.1% of prediction accuracy in top five recommendations for the TV watching history of 2,441 people.

Analysis of Temperature and Probability Distribution Model of Frozen Storage Warehouses in South Korea (국내 식품냉동창고 온도분포 실태 및 확률분포모델 분석)

  • Park, Myoung-Su;Kim, Ga-Ram;Bahk, Gyung-Jin
    • Journal of Food Hygiene and Safety
    • /
    • v.34 no.2
    • /
    • pp.199-204
    • /
    • 2019
  • This study aimed to generate a probability distribution model based on temperature data of frozen food storage facility as input variables for microbial risk assessment (MRA). We visited 8 food-handling businesses to collect temperature data from their cold storage warehouses. The overall mean temperature inside the storage facilities was $-20.48{\pm}3.08^{\circ}C$, with 20.4% of the facilities having above $-18^{\circ}C$, with minimum and maximum temperature values of -10.3 and $-25.80^{\circ}C$ respectively. Temperature distributions by space locations of natural and forced convection were $-22.57{\pm}0.84$ and $-17.81{\pm}1.47^{\circ}C$, $-22.49{\pm}1.05$ and $-17.94{\pm}1.44^{\circ}C$, and $-22.68{\pm}1.03$ and $-18.08{\pm}1.42^{\circ}C$ in the upper (2.4~4 m), middle (1.5~2.4 m), and lower (0.7~1.5 m) shelves, respectively. Probability distributions from the temperature data were obtained using the program @RISK. Statistical ranking was determined using goodness of fit to determine the probability distribution model. Our results show that a log-normal distribution [5.9731, 3.3483, shift (-26.4281)] is most appropriate for relative MRA conduction.

A Study on the Effect of Network Centralities on Recommendation Performance (네트워크 중심성 척도가 추천 성능에 미치는 영향에 대한 연구)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.23-46
    • /
    • 2021
  • Collaborative filtering, which is often used in personalization recommendations, is recognized as a very useful technique to find similar customers and recommend products to them based on their purchase history. However, the traditional collaborative filtering technique has raised the question of having difficulty calculating the similarity for new customers or products due to the method of calculating similaritiesbased on direct connections and common features among customers. For this reason, a hybrid technique was designed to use content-based filtering techniques together. On the one hand, efforts have been made to solve these problems by applying the structural characteristics of social networks. This applies a method of indirectly calculating similarities through their similar customers placed between them. This means creating a customer's network based on purchasing data and calculating the similarity between the two based on the features of the network that indirectly connects the two customers within this network. Such similarity can be used as a measure to predict whether the target customer accepts recommendations. The centrality metrics of networks can be utilized for the calculation of these similarities. Different centrality metrics have important implications in that they may have different effects on recommended performance. In this study, furthermore, the effect of these centrality metrics on the performance of recommendation may vary depending on recommender algorithms. In addition, recommendation techniques using network analysis can be expected to contribute to increasing recommendation performance even if they apply not only to new customers or products but also to entire customers or products. By considering a customer's purchase of an item as a link generated between the customer and the item on the network, the prediction of user acceptance of recommendation is solved as a prediction of whether a new link will be created between them. As the classification models fit the purpose of solving the binary problem of whether the link is engaged or not, decision tree, k-nearest neighbors (KNN), logistic regression, artificial neural network, and support vector machine (SVM) are selected in the research. The data for performance evaluation used order data collected from an online shopping mall over four years and two months. Among them, the previous three years and eight months constitute social networks composed of and the experiment was conducted by organizing the data collected into the social network. The next four months' records were used to train and evaluate recommender models. Experiments with the centrality metrics applied to each model show that the recommendation acceptance rates of the centrality metrics are different for each algorithm at a meaningful level. In this work, we analyzed only four commonly used centrality metrics: degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality. Eigenvector centrality records the lowest performance in all models except support vector machines. Closeness centrality and betweenness centrality show similar performance across all models. Degree centrality ranking moderate across overall models while betweenness centrality always ranking higher than degree centrality. Finally, closeness centrality is characterized by distinct differences in performance according to the model. It ranks first in logistic regression, artificial neural network, and decision tree withnumerically high performance. However, it only records very low rankings in support vector machine and K-neighborhood with low-performance levels. As the experiment results reveal, in a classification model, network centrality metrics over a subnetwork that connects the two nodes can effectively predict the connectivity between two nodes in a social network. Furthermore, each metric has a different performance depending on the classification model type. This result implies that choosing appropriate metrics for each algorithm can lead to achieving higher recommendation performance. In general, betweenness centrality can guarantee a high level of performance in any model. It would be possible to consider the introduction of proximity centrality to obtain higher performance for certain models.

A Regression-Model-based Method for Combining Interestingness Measures of Association Rule Mining (연관상품 추천을 위한 회귀분석모형 기반 연관 규칙 척도 결합기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.127-141
    • /
    • 2017
  • Advances in Internet technologies and the proliferation of mobile devices enabled consumers to approach a wide range of goods and services, while causing an adverse effect that they have hard time reaching their congenial items even if they devote much time to searching for them. Accordingly, businesses are using the recommender systems to provide tools for consumers to find the desired items more easily. Association Rule Mining (ARM) technology is advantageous to recommender systems in that ARM provides intuitive form of a rule with interestingness measures (support, confidence, and lift) describing the relationship between items. Given an item, its relevant items can be distinguished with the help of the measures that show the strength of relationship between items. Based on the strength, the most pertinent items can be chosen among other items and exposed to a given item's web page. However, the diversity of the measures may confuse which items are more recommendable. Given two rules, for example, one rule's support and confidence may not be concurrently superior to the other rule's. Such discrepancy of the measures in distinguishing one rule's superiority from other rules may cause difficulty in selecting proper items for recommendation. In addition, in an online environment where a web page or mobile screen can provide a limited number of recommendations that attract consumer interest, the prudent selection of items to be included in the list of recommendations is very important. The exposure of items of little interest may lead consumers to ignore the recommendations. Then, such consumers will possibly not pay attention to other forms of marketing activities. Therefore, the measures should be aligned with the probability of consumer's acceptance of recommendations. For this reason, this study proposes a model-based approach to combine those measures into one unified measure that can consistently determine the ranking of recommended items. A regression model was designed to describe how well the measures (independent variables; i.e., support, confidence, and lift) explain consumer's acceptance of recommendations (dependent variables, hit rate of recommended items). The model is intuitive to understand and easy to use in that the equation consists of the commonly used measures for ARM and can be used in the estimation of hit rates. The experiment using transaction data from one of the Korea's largest online shopping malls was conducted to show that the proposed model can improve the hit rates of recommendations. From the top of the list to 13th place, recommended items in the higher rakings from the proposed model show the higher hit rates than those from the competitive model's. The result shows that the proposed model's performance is superior to the competitive model's in online recommendation environment. In a web page, consumers are provided around ten recommendations with which the proposed model outperforms. Moreover, a mobile device cannot expose many items simultaneously due to its limited screen size. Therefore, the result shows that the newly devised recommendation technique is suitable for the mobile recommender systems. While this study has been conducted to cover the cross-selling in online shopping malls that handle merchandise, the proposed method can be expected to be applied in various situations under which association rules apply. For example, this model can be applied to medical diagnostic systems that predict candidate diseases from a patient's symptoms. To increase the efficiency of the model, additional variables will need to be considered for the elaboration of the model in future studies. For example, price can be a good candidate for an explanatory variable because it has a major impact on consumer purchase decisions. If the prices of recommended items are much higher than the items in which a consumer is interested, the consumer may hesitate to accept the recommendations.

An Analysis on Correlation of General Manager's Leadership of Hotel Employees - Based on Seoul's 2nd Deluxe Hotel - (호텔 총지배인리더십의 종사원 관리에 관한 중요도 분석 - 서울 특 2급 호텔을 중심으로 -)

  • Nam, Taeg-Yeong
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.10
    • /
    • pp.287-300
    • /
    • 2008
  • The purpose of this study is to show the optimal leadership model based on general manager management that is related to hotel employees, which may then be applied to ideal hotel management in Korea. The variables analyzed in this study are employee-management relations from the lowest ranking employee to the highest level of management, with communication and job understanding held common among the subjects. This study looks in detail at four main concepts. Firstly, the employees are encouraged to put forth their best efforts through the standardization of the requirements and regulations of the job and ensuring product suitability. Secondly, the cultivation of new hotel culture empowers the employees. The third point taken into account is the encouragement of employees to take pride in the work they do, which is done through a kind of reward system and salary. The final point considered is the empowerment of employees, which results in employee commitment to the job.

A Study on the Evaluation of Foreign Science and Technology Serials with Reference to the Result of Interlibrary Loan Activity (상호대차 활성화에 따른 대학도서관 이.공계열 외국학술지의 평가에 관한 연구)

  • 이창수;김신영
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.1
    • /
    • pp.71-88
    • /
    • 2002
  • The purpose of this use study is to evaluate foreign science and technology serials with reference to the result of interlibrary loan activity and to present the model of cooperative acquisition. This study was based upon an analysis of actual use data by library users and Interlibrary loan from March 1, 2000 to February 28, 2001. 603 titles of foreign serials which were composed of 459 subscription titles of I University Library and 144 cooperative acquisition titles in the fields of science and technology were analyzed. The study reveals that only 439 serials(72.8%) were used even more once and 164 serials were not used at all during 1 year interval. A relationship was found between rankings of serials as measured by use and JCR citation ranking. Based upon various aspects of use analysis, this study suggested effective techniques for managing academic serials.

Fuzzy TOPSIS Approach to Flood Vulnerability Assessment in Korea (우리나라 홍수 취약성 평가를 위한 Fuzzy TOPSIS 접근법)

  • Kim, Yeong-Kyu;Chung, Eun-Sung;Lee, Kil-Seong
    • Journal of Korea Water Resources Association
    • /
    • v.45 no.9
    • /
    • pp.901-913
    • /
    • 2012
  • This study will be a new attempt to quantify flood vulnerability taking into account uncertainty. Information obtained from the real world has lots of uncertainties. Therefore, this study developed an approach to quantify spatial flood vulnerability of Korea using Fuzzy TOPSIS approach. Also, Fuzzy TOPSIS were compared with TOPSIS and weighted sum method. As a result, rankings of some areas were changed dramatically due to the uncertainty. Spearman rank correlation analysis indicated that the rankings of TOPSIS and weighted sum method were almost similar, but quite different from ranking of Fuzzy TOPSIS. In other words, because applying Fuzzy concept in regional vulnerability assessment may cause a significant change in priorities, the model presented in this study may be a method of vulnerability assessment.