• Title/Summary/Keyword: Keyword weight

Search Result 61, Processing Time 0.024 seconds

A Keyword Network Analysis on Obesity Research Trends in Korea: Focusing on keywords co-occured of 'Obesity' and 'Physical Education'

  • Kim, Woo-Kyung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.1
    • /
    • pp.151-158
    • /
    • 2019
  • This study aimed to analyze the research trend related on obesity in physical education in Korea through the keyword network analysis and to establish a basic database for effective design of prospective studies. To achieve it the study crawled co-occured keywords with 'obesity' and 'physical education' from RISS and analyzed the list from 1990 to 2018. They include 25 journal papers and 38 dissertations. The results are as follows. First, recent 30 years 63 papers published in Korea with 'Obesity' and 'Physical Education', and there were 144 related keywords. Second, analyzing journals which have 'Obesity' and 'Physical Education', co-occured keywords in 4 centrality were 24 keywords(student, Korea, prevention, effect, level, body, activation, actual condition, lesson, child, investigation, participation, book, cause, activity, normal, degree, nutrition, physical strength, weight, elementary, light, inquiry, health), and 37 keyword occurred in top 30. Lastly, by CONCOR analysis the result could be divided into 2 clusters. One consists of the object of obesity and its invervention, and the other consists of negative keywords of obesity and its preliminery dimenstion. Through the result, this study showed the research trend which involves the concept of obesity in physical education in Korea. Through the result, prospective obesity research in physical education in Korea would be promoted.

Understanding of Structural Changes of Keyword Networks in the Computer Engineering Field (컴퓨터공학 분야 키워드네트워크의 구조적 변화 이해)

  • Kwon, Yung-Keun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.3
    • /
    • pp.187-194
    • /
    • 2013
  • Recently, there have been many trials to analyze characteristics of research trends through a structural analysis of keyword networks in various fields. However, most previous studies have mainly focused on structural analysis harbored in some static networks and there is a lack of research on changes of such networks structure with time. In this paper, we constructed annual keyword networks by using a database of papers published in the international computer engineering-field journals from 2002 through 2011, and examined the changes of them. As a result, it was shown that most keywords in a network are preserved in the network of the next year, and their degree of connectivity and the average weight of the connections were higher and smaller, respectively, than those of the keywords which are not preserved. In addition, when a keyword network shifted to one of the next year, the connections between keywords were more likely to be removed than preserved, and the average weight of the removal connections was higher than that of the preserved ones. These results imply that the keywords are not changed over time but their connections are very likely to be changed; and there is apparent differences between the preserved and removal groups of keywords/connections with respect to degree and weights of connections. All these results are consistently observed over the ten-year datasets and they can be important principles in understanding the structural changes of the keyword networks.

A Design and Implementation of a Content_Based Image Retrieval System using Color Space and Keywords (칼라공간과 키워드를 이용한 내용기반 화상검색 시스템 설계 및 구현)

  • Kim, Cheol-Ueon;Choi, Ki-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.6
    • /
    • pp.1418-1432
    • /
    • 1997
  • Most general content_based image retrieval techniques use color and texture as retrieval indices. In color techniques, color histogram and color pair based color retrieval techniques suffer from a lack of spatial information and text. And This paper describes the design and implementation of content_based image retrieval system using color space and keywords. The preprocessor for image retrieval has used the coordinate system of the existing HSI(Hue, Saturation, Intensity) and preformed to split One image into chromatic region and achromatic region respectively, It is necessary to normalize the size of image for 200*N or N*200 and to convert true colors into 256 color. Two color histograms for background and object are used in order to decide on color selection in the color space. Spatial information is obtained using a maximum entropy discretization. It is possible to choose the class, color, shape, location and size of image by using keyword. An input color is limited by 15 kinds keyword of chromatic and achromatic colors of the Korea Industrial Standards. Image retrieval method is used as the key of retrieval properties in the similarity. The weight values of color space ${\alpha}(%)and\;keyword\;{\beta}(%)$ can be chosen by the user in inputting the query words, controlling the values according to the properties of image_contents. The result of retrieval in the test using extracted feature such as color space and keyword to the query image are lower that those of weight value. In the case of weight value, the average of te measuring parameters shows approximate Precision(0.858), Recall(0.936), RT(1), MT(0). The above results have proved higher retrieval effects than the content_based image retrieval by using color space of keywords.

  • PDF

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

Co-author.Keyword Network and its Two Culture Appearance in Health Policy Fields in Korea: Analysis of articles in the Korean Journal of Health Policy and Administration, 1991~2006 (국내 보건학 분야 학술활동의 군집화와 '두 문화' 현상 - 보건행정학회지(1991~2006) 게재논문의 공저자 네트워크 분석 -)

  • Jung, Min-Soo;Chung, Dong-Jun
    • Health Policy and Management
    • /
    • v.18 no.2
    • /
    • pp.86-106
    • /
    • 2008
  • This research analyzed. knowledge structure and its effect factor by analysis of co-author and keyword network in Korea's health policy and administration sector. The data was extracted from 339 articles listed in the Korean Journal of Health Policy and Administration, and was transformed into a co-author and keyword matrix. In this matrix the existence of a link was defined by impact factors which were calculated by the weight value of what the role was and the rate of how many authors contributed. We demonstrated that the research achievement was dependent on the author's status and network index. Analysis methods were neighborhood degree, correspondence analysis, multiple regression and the difference of weight distribution by research fields. Co-author networks were developed as closeness centrality as well as degree centrality by a few high productivity researchers. In particular, power law distribution was discovered in impact factor and research productivity. The effect of the author's role was significant in both the impact factor calculated by the participatory rate and the number of listed articles. Especially, this journal shared its major researchers who had a licensed physician with the Journal of Preventive Medicine and Public Health. Therefore, social scientists were likely to be small co-author network differently from natural scientists. It was so called 'two cultures' phenomenon. This study showed how can we verified academic research structure existed in the unit of journal like as citation networks. The co-author networks in the field of health policy and administration had more differentiated and clustered than preventive medicine and epidemiology fields.

Tendency and Network Analysis of Diet Using Big Data (빅데이터를 활용한 다이어트 현황 및 네트워크 분석)

  • Jung, Eun-Jin;Chang, Un-Jae
    • Journal of the Korean Dietetic Association
    • /
    • v.22 no.4
    • /
    • pp.310-319
    • /
    • 2016
  • Limitation of a questionnaire survey which is widely used is time and money, limited numbers of participants, biased confidence interval and unreliable results. To overcome these, we performed tendency and network analysis of diet using big Data in Koreans. The keyword on diet were collected from the portal site Naver from January 1, 2015 until December 31, 2015 and collected data were analyzed by simple frequency analysis, N-gram analysis, keyword network analysis and seasonality analysis. The results showed that diet menu appeared most frequently by N-gram analysis, even though exercise had the highest frequency by simple frequency analysis. In addition, keyword network analysis were categorized into four groups: diet group, exercise group, commercial diet program company group and commercial diet food group. The analysis of seasonality showed that subjects' interests in diet had increased steadily since February, 2015, although subjects were most interested indiet in July, these results suggest that the best strategies for weight loss are based on diet menu and starting diet before July. As people are especially sensitive to diet trends, researches are needed about annual analysis of big data.

Keyword Weight based Paragraph Extraction Algorithm (문단 가중치 분석 기반 본문 영역 선정 알고리즘)

  • Lee, Jongwon;Yu, Seongjong;Kim, Doan;Jung, Hoekyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.05a
    • /
    • pp.462-463
    • /
    • 2018
  • Traditional document analysis systems used word-based analysis using a morphological analyzer or TF-IDF technique. These systems have the advantage of being able to derive key keywords by calculating the weights of the keywords. On the other hand, it is not appropriate to analyze the contents of documents due to the structural limitations. To solve this problem, the proposed algorithm calculates the weights of the documents in the document and divides the paragraphs into areas. And we calculate the importance of the divided regions and let the user know the area with the most important paragraphs in the document. So, it is expected that the user will be provided with a service suitable for analyzing documents rather than using existing document analysis systems.

  • PDF

Analysis of Research Trends in Information Literacy Education Using Keyword Network Analysis and Topic Modeling (키워드 네트워크 분석과 토픽모델링을 활용한 정보활용교육 연구 동향 분석)

  • Jeong-Hoon, Lim
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.4
    • /
    • pp.23-48
    • /
    • 2022
  • The purpose of this study is to investigate the flow of domestic information literacy education research using keyword network analysis and topic modeling and to explore the direction of information literacy education in the future. For this reason, 306 academic papers related to information literacy education published in academic journals of the library and information science field in Korea were chosen. And through the preprocessing process for abstracts of the paper, total keyword appearance frequency, keyword appearance frequency by period, and keyword simultaneous occurrence frequency were analyzed. Subsequently, keyword network analysis analyzed the degree centrality, between centrality, and eigenvector centrality of keywords. Using structural topic modeling analysis, 15 topics -curriculum, information literacy effect, contents of information literacy education, school library education, information media literacy, information literacy ability evaluation index, library anxiety, public library program, health information literacy ability, digital divide, library assisted instruction improvement, research trend, information literacy model, and teacher role-were derived. In addition, the trend of topics by year was analyzed to confirm the change in relative weight by topic. Based on these results, the direction of information literacy education and the suggestions for follow-up research were presented.

Comparison and Analysis of Dieting Practices Using Big Data from 2010 and 2015 (빅데이터를 통한 2010년과 2015년의 다이어트 실태 비교 및 분석)

  • Jung, Eun-Jin;Chang, Un-Jae
    • Korean Journal of Community Nutrition
    • /
    • v.23 no.2
    • /
    • pp.128-136
    • /
    • 2018
  • Objectives: The purpose of this study was to compare and analyse dieting practices and tendencies in 2010 and 2015 using big data. Methods: Keywords related to diet were collected from the portal site Naver from January 1, 2010 until December 31, 2010 for 2010 data and from January 1, 2015 until December 31, 2015 for 2015 data. Collected data were analyzed by simple frequency analysis, N-gram analysis, keyword network analysis, and seasonality analysis. Results: The results show that exercise had the highest frequency in simple frequency analysis in both years. However, weight reduction in 2010 and diet menu in 2015 appeared most frequently in N-gram analysis. In addition, keyword network analysis was categorized into three groups in 2010 (diet group, exercise group, and commercial weight control group) and four groups in 2015 (diet group, exercise group, commercial program for weight control group, and commercial food for weight control group). Analysis of seasonality showed that subjects' interests in diets increased steadily from February to July, although subjects were most interested in diets in July in both years. Conclusions: In this study, the number of data in 2015 steadily increased compared with 2010, and diet grouping could be further subdivided. In addition, it can be confirmed that a similar pattern appeared over a one-year cycle in 2010 and 2015. Therefore, dietary method is reflected in society, and it changes according to trends.

A Study on the Decision Model Agent System based on the Customer기s Preference in Electronic Commerce (전자상거래에서 고객선호기반의 의사결정모델 에이전트 시스템에 관한 연구)

  • 황현숙;어윤양
    • The Journal of Information Systems
    • /
    • v.8 no.2
    • /
    • pp.91-110
    • /
    • 1999
  • Recently, searching agent systems to help purchase of products between business and customer have been actively studied in Electronic Commerce(EC). However, the most of comparative searching agent systems are only provided customers with searching results by the keyword-based search, and is not support the efficient decision models to be selected products considering the customer's requirements. This paper proposes the decision agent system applied decision model as well as searching functions based on the keyword-input to be selected useful products in EC. The proposed decision agent system is consist of the user interface, provider interface, decision model. Especially, as the example of the decision model, this paper is designed and implemented the prototype of decision agent system which is normalized the searching data and value of customer's preference weight as to each attribute, and orderly provided customers with computed results. This agent system is also carried out sensitive analysis according to the reflection ratio of the each attribute.

  • PDF