• Title/Summary/Keyword: document frequency

Search Result 302, Processing Time 0.019 seconds

A Folksonomy Ranking Framework: A Semantic Graph-based Approach (폭소노미 사이트를 위한 랭킹 프레임워크 설계: 시맨틱 그래프기반 접근)

  • Park, Hyun-Jung;Rho, Sang-Kyu
    • Asia pacific journal of information systems
    • /
    • v.21 no.2
    • /
    • pp.89-116
    • /
    • 2011
  • In collaborative tagging systems such as Delicious.com and Flickr.com, users assign keywords or tags to their uploaded resources, such as bookmarks and pictures, for their future use or sharing purposes. The collection of resources and tags generated by a user is called a personomy, and the collection of all personomies constitutes the folksonomy. The most significant need of the folksonomy users Is to efficiently find useful resources or experts on specific topics. An excellent ranking algorithm would assign higher ranking to more useful resources or experts. What resources are considered useful In a folksonomic system? Does a standard superior to frequency or freshness exist? The resource recommended by more users with mere expertise should be worthy of attention. This ranking paradigm can be implemented through a graph-based ranking algorithm. Two well-known representatives of such a paradigm are Page Rank by Google and HITS(Hypertext Induced Topic Selection) by Kleinberg. Both Page Rank and HITS assign a higher evaluation score to pages linked to more higher-scored pages. HITS differs from PageRank in that it utilizes two kinds of scores: authority and hub scores. The ranking objects of these pages are limited to Web pages, whereas the ranking objects of a folksonomic system are somewhat heterogeneous(i.e., users, resources, and tags). Therefore, uniform application of the voting notion of PageRank and HITS based on the links to a folksonomy would be unreasonable, In a folksonomic system, each link corresponding to a property can have an opposite direction, depending on whether the property is an active or a passive voice. The current research stems from the Idea that a graph-based ranking algorithm could be applied to the folksonomic system using the concept of mutual Interactions between entitles, rather than the voting notion of PageRank or HITS. The concept of mutual interactions, proposed for ranking the Semantic Web resources, enables the calculation of importance scores of various resources unaffected by link directions. The weights of a property representing the mutual interaction between classes are assigned depending on the relative significance of the property to the resource importance of each class. This class-oriented approach is based on the fact that, in the Semantic Web, there are many heterogeneous classes; thus, applying a different appraisal standard for each class is more reasonable. This is similar to the evaluation method of humans, where different items are assigned specific weights, which are then summed up to determine the weighted average. We can check for missing properties more easily with this approach than with other predicate-oriented approaches. A user of a tagging system usually assigns more than one tags to the same resource, and there can be more than one tags with the same subjectivity and objectivity. In the case that many users assign similar tags to the same resource, grading the users differently depending on the assignment order becomes necessary. This idea comes from the studies in psychology wherein expertise involves the ability to select the most relevant information for achieving a goal. An expert should be someone who not only has a large collection of documents annotated with a particular tag, but also tends to add documents of high quality to his/her collections. Such documents are identified by the number, as well as the expertise, of users who have the same documents in their collections. In other words, there is a relationship of mutual reinforcement between the expertise of a user and the quality of a document. In addition, there is a need to rank entities related more closely to a certain entity. Considering the property of social media that ensures the popularity of a topic is temporary, recent data should have more weight than old data. We propose a comprehensive folksonomy ranking framework in which all these considerations are dealt with and that can be easily customized to each folksonomy site for ranking purposes. To examine the validity of our ranking algorithm and show the mechanism of adjusting property, time, and expertise weights, we first use a dataset designed for analyzing the effect of each ranking factor independently. We then show the ranking results of a real folksonomy site, with the ranking factors combined. Because the ground truth of a given dataset is not known when it comes to ranking, we inject simulated data whose ranking results can be predicted into the real dataset and compare the ranking results of our algorithm with that of a previous HITS-based algorithm. Our semantic ranking algorithm based on the concept of mutual interaction seems to be preferable to the HITS-based algorithm as a flexible folksonomy ranking framework. Some concrete points of difference are as follows. First, with the time concept applied to the property weights, our algorithm shows superior performance in lowering the scores of older data and raising the scores of newer data. Second, applying the time concept to the expertise weights, as well as to the property weights, our algorithm controls the conflicting influence of expertise weights and enhances overall consistency of time-valued ranking. The expertise weights of the previous study can act as an obstacle to the time-valued ranking because the number of followers increases as time goes on. Third, many new properties and classes can be included in our framework. The previous HITS-based algorithm, based on the voting notion, loses ground in the situation where the domain consists of more than two classes, or where other important properties, such as "sent through twitter" or "registered as a friend," are added to the domain. Forth, there is a big difference in the calculation time and memory use between the two kinds of algorithms. While the matrix multiplication of two matrices, has to be executed twice for the previous HITS-based algorithm, this is unnecessary with our algorithm. In our ranking framework, various folksonomy ranking policies can be expressed with the ranking factors combined and our approach can work, even if the folksonomy site is not implemented with Semantic Web languages. Above all, the time weight proposed in this paper will be applicable to various domains, including social media, where time value is considered important.

A Study on Serum Lipid Levels of Elderly People in Wando Area -Based on Dietary Behaviors- (완도지역 중·장년층의 혈중지질 수준에 관한 연구 -식행동을 중심으로-)

  • Kim, Eun-Ju;Cha, Bok-Kyeong
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.36 no.9
    • /
    • pp.1148-1160
    • /
    • 2007
  • This study was performed to document the association between eating behaviors and major risk factors for cardiovascular diseases. Those who stated that they ate a daily breakfast for male and female were 88.6% and 96.0%, $1{\sim}2$ times a week were 9.1% and 2.0%, $3{\sim}4$ times a week were 2.3% and 2.0%, respectively. Those who stated that they were overeating of $0{\sim}1$ time a week for man and female were 80.7% and 89.9%, overeating of $2{\sim}3$ times a week were 19.3% and 10.1%, respectively. Those who said that they were regular of meal time for man and female were 38.6%, and 37.4%, sometimes irregular of meal time were 14.8% and 19.2%, irregular of meal time were 46.6% and 43.4%, respectively. Those who said that they were light of eating volume for man and female were 20.5% and 25.3%, moderate of eating volume were 69.3% and 61.6%, heavy of eating volume were 10.2% and 13.1%, respectively. Those who said that they were very fast of eating speed for man and female were 15.9% and 8.1%, fast of eating speed were 51.1% and 34.3%, moderate of eating speed were 4.5% and 20.2%, slow of eating speed were 17.0% and 14.1%, and very slow of eating speed were 11.4% and 23.2%, respectively. Higher frequency of breakfast a week resulted in higher serum total cholesterol and blood sugar for the daily eating group for both genders with women having high LDL-cholesterol and HDL-cholesterol. Both group had high HDL-cholesterol and low blood sugar with less number of overeating, with men having low triglyceride and LDL-cholesterol. With regular meal, both group had low triglyceride, total-cholesterol, atherogenic index, and blood sugar with women having low LDL-cholesterol. For both groups, the triglyceride, total-cholesterol, LDL-cholesterol, atherogenic index, and blood sugar had higher figures for overeating, with men having low HDL-cholesterol and women having high HDL-cholesterol. This study revealed that less number of overeating, regular mealtime, and less volume of food intake are effective in preventing and treating for the cardiovascular diseases.