• Title/Summary/Keyword: Retrieval Relevance

Search Result 160, Processing Time 0.028 seconds

Performance Evaluation of Re-ranking and Query Expansion for Citation Metrics: Based on Citation Index Databases (인용 지표를 이용한 재순위화 및 질의 확장의 성능 평가 - 인용색인 데이터베이스를 기반으로 -)

  • HyeKyung Lee;Yong-Gu lee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.3
    • /
    • pp.249-277
    • /
    • 2023
  • The purpose of this study is to explore the potential contribution of citation metrics to improving the search performance of citation index databases. To this end, the study generated ten queries in the field of library and information science and conducted experiments based on the relevance assessment using 3,467 documents retrieved from the Web of Science and 60,734 documents published in 85 SSCI journals in the field of library and information science from 2000 to 2021. The experiments included re-ranking of the top 100 search results using citation metrics and search methods, query expansion experiments using vector space model retrieval systems, and the construction of a citation-based re-ranking system. The results are as follows: 1) Re-ranking using citation metrics differed from Web of Science's performance, acting as independent metrics. 2) Combining query term frequencies and citation counts positively affected performance. 3) Query expansion generally improved performance compared to the vector space model baseline. 4) User-based query expansion outperformed system-based. 5) Combining citation counts with suitability documents affected ranking within top suitability documents.

Performance Evaluation of Recommendation Results through Optimization on Content Recommendation Algorithm Applying Personalization in Scientific Information Service Platform (과학 학술정보 서비스 플랫폼에서 개인화를 적용한 콘텐츠 추천 알고리즘 최적화를 통한 추천 결과의 성능 평가)

  • Park, Seong-Eun;Hwang, Yun-Young;Yoon, Jungsun
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.11
    • /
    • pp.183-191
    • /
    • 2017
  • In order to secure the convenience of information retrieval by users of scientific information service platforms and to reduce the time required to acquire the proper information, this study proposes an optimized content recommendation algorithm among the algorithms that currently provide service menus and content information for each service, and conducts comparative evaluation on the results. To enhance the recommendation accuracy, users' major items were added to the original algorithm, and performance evaluations on the recommendation results from the original and optimized algorithms were performed. As a result of this evaluation, we found that the relevance of the content provided to the users through the optimized algorithm was increased by 21.2%. This study proposes a method to shorten the information acquisition time and extend the life cycle of the results as valuable information by automatically computing and providing content suitable for users in the system for each service menu.

Standard Translation of Terms of Korean Medicine through Consideration of Chinese-Korean Collated Medical Classics - With focus on 『Eonhaegugeupbang』, 『Eonhaetaesanjipyo』 and 『Eonhaetaesanjipyo』 - (언해의서 비교고찰을 통한 한의학용어의 번역표준안 - 『언해두창집요』, 『언해구급방』, 『언해태산집요』를 중심으로)

  • Ku, Hyunhee;Kim, Hyunkoo;Lee, JungHyun;Oh, Junho;Kwon, Ohmin
    • Korean Journal of Oriental Medicine
    • /
    • v.18 no.3
    • /
    • pp.49-61
    • /
    • 2012
  • This article set out to develop an old Chinese - modern Korean collated terminology by analyzing and paralleling Chinese-Korean translational terms relevant to Korean medicine at a minimum meaning unit from "Eonhaegugeupbang", "Eonhaetaesanjipyo" and "Eonhaetaesanjipyo". Those are composed of original Chinese texts and their subsequent corresponding Korean translations. It tries to make a list of translational standards of Korean medicine terms by classifying the cases of translational ambiguity in terms of disease, body position, thumbnail-pressing acupuncture method, and disease-curing method. The above-mentioned ancient books are medical classics written by Huh Jun, the representative medical physician, and published by the Joseon government. Thus, they are appropriate enough as historically legitimate medical documents, from which are drawn out words and terms to form an old Chinese - modern Korean collation dictionary. This collation glossary will contribute to the increased relevance of data ming, or information retrieval. in a database system and information search engine of massive Korean medical records, by means of providing a novel way to obtaining synchronized results between the original writings of old Chinese and the secondary translated ones of modern Korean. The glossary will promote the collective but consistent translation of numerous old archives of Korean medicine and in other related fields as well.

Survival Processing Advantage and Sex Differences in Location Memory (위치 기억에서의 생존 처리 이득과 성차)

  • Choi, Joon-Hyuk;Kim, Min-Shik
    • Korean Journal of Cognitive Science
    • /
    • v.21 no.4
    • /
    • pp.697-723
    • /
    • 2010
  • Recent studies report that in terms of object memory, survival context has mnemonic advantage over other context conditions (e.g., Nairne et al, 2007). The present experiments explored whether this effect can also affect task-irreverent object location memory, and tested whether the context can change gender difference in object location memory. Participants were asked to rate the relevance of pictures presented at random locations (experiment 1) or words (experiment 2) under survival context or moving context. After rating the pictures or words, they answered recall test and location retrieval test. The results revealed higher accuracy in memory for objects encoded under survival context. Moreover, survival processing enhanced location memory, and the survival advantage in location memory emerged among woman.

  • PDF

Construction of an Information Retrieval Test Collection and its Validation (정보검색 테스트 컬렉션 구축 및 유효성 평가)

  • Myaeng, Sung-Hyon;Jang, Dong-Hyun;Song, Sa-Kwang;Kim, Ji-Young;Lee, Seok-Hoon;Lee, Joon-Ho;Lee, Eung-Bong;Seo, Jeong-Hyun
    • Annual Conference on Human and Language Technology
    • /
    • 1999.10e
    • /
    • pp.20-27
    • /
    • 1999
  • 본 논문은 정보검색 시스템 평가에 필요한 한국어 문서집합 구축과 적합 문서리스트(relevance file) 생성에 관한 기법을 문서 수집과정부터 평가작업까지 상세히 기술한다. 문서집합은 일반, 사회과학, 과학기술 분야에서 각각 4만 건으로 영역별로 균등히 구축하였으며, 질의 집합도 각 분야에 대해 10개씩 할당하여 총 30개의 질의 집합을 생성하였다. 또한 질의집합은 사용자의 수준을 고려하여 일반인, 영역 전문가, 중고등학생에 해당하는 질의를 생성함으로써 특정 영역, 특정 사용자에 독립적인 문서집합 및 질의집합을 구축하고자 하였다. 생성된 질의를 사용하여 여러 검색기에서 총 38가지의 방법으로 검색을 실시하였으며, 검색결과를 바탕으로 각 질의당 500개의 문서로 이루어진 후보 결과집합을 만든 후 이들을 대상으로 각 질의에 대한 문서의 적합성 평가를 실시하였다. 이 과정을 통해 생성된 적합문서 집합의 유효성을 보이기 위해 후보 문서 리스트 이외의 문서집합에서 적합문서가 존재할 가능성을 확인하였는데 그 방법으로 후보 리스트의 개수 증가에 따른 적합문서 개수의 변동 추세를 알아보았다. 현재 질의 개수를 50개로 확장하는 방향으로 테스트 컬렉션 구축에 대한 연구를 진행 중에 있으며, 일본 NACSIS와의 질의 교환을 통해 질의 개수를 확장할 뿐만 아니라 일본어 질의 또는 한국어 질의에 대해서 한국어 문서, 일본어 문서를 각각 검색할 수 있는 한일 교차언어 문서검색 환경을 구축하고 있다.

  • PDF

A Comparative Study on Clustering Methods for Grouping Related Tags (연관 태그의 군집화를 위한 클러스터링 기법 비교 연구)

  • Han, Seung-Hee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.43 no.3
    • /
    • pp.399-416
    • /
    • 2009
  • In this study, clustering methods with related tags were discussed for improving search and exploration in the tag space. The experiments were performed on 10 Delicious tags and the strongly-related tags extracted by each 300 documents, and hierarchical and non-hierarchical clustering methods were carried out based on the tag co-occurrences. To evaluate the experimental results, cluster relevance was measured. Results showed that Ward's method with cosine coefficient, which shows good performance to term clustering, was best performed with consistent clustering tendency. Furthermore, it was analyzed that cluster membership among related tags is based on users' tagging purposes or interest and can disambiguate word sense. Therefore, tag clusters would be helpful for improving search and exploration in the tag space.

A Case Study on Implementation of the Shipping Market Information Service System (해운시황정보서비스시스템 구현 사례연구)

  • Lee, Seokyong;Jeong, Myounghwan
    • Journal of Korea Port Economic Association
    • /
    • v.29 no.3
    • /
    • pp.73-94
    • /
    • 2013
  • The necessity of shipping market information services has been on the rise which emphasizes the relevance of transaction information and market information to parties both in and outside the shipping industry. However, previous related researches have been restricted to explorations limited by the offerings of existing shipping market information providers. Users today require effective information, an efficient contents management system, interfacing to help the information provider, graphing and spread sheets to facilitate and present the analyzed information through diverse formats, and reliable web and mobile services to provide information effectively with limited human resources. As a first step, service information has to be defined, so that it takes into account user utility, information retrieval and data development. Second, benchmark information and services must be provided from leading shipbrokers and research institutes. Third, a review of the latest technical trends is required to identify the most suitable technologies for servicing shipping market information. Finally, analysis is required on the implementation of a system with selected technologies, as well as the development of channels to post information which have been analyzed by users. Such a process would enable the continual redefinition of the shipping market information users actively need. The application of an X-Internet based WCMS, with a single-window dashboard providing user-customized information, and used to obtain and manage processes, add spread sheets to sustain calculations using the latest information, graph results, and to input additional information following predefined rules. Access to data and use of the system would require agreement that the system will incorporate user data and user-analyzed information into the market report, web portal, and hybrid app to provide current shipping market information appropriately and accurately to service users.

Correlation of Hanwoo (Korean Native Cattle) Carcass Classification and Oocyte Donor for Blastocyst Production In Vitro (한우 육질등급이 난포란의 배반포 체외생산에 미치는 영향)

  • Kim, Kang-Sig;Lee, Hong-Chul;Park, Yong-Su;Kim, So-Sub;Park, Hum-Dai
    • Journal of Embryo Transfer
    • /
    • v.30 no.3
    • /
    • pp.161-170
    • /
    • 2015
  • These studies were conducted to establish the practical Hanwoo (Korean native cattle) improvement system through the combining of embryo transfer technology and confirming individual Hanwoo oocyte culture system and to investigate that correlation of Hanwoo carcass classification (intramuscular marbling) and oocyte donor for blastocyst production in vitro. In case of Hanwoo, the carcass meat quality grades were divided to grade $1^{{+}{+}}$, $1^{+}$, 1, 2, and 3 depends on fat distribution of longest muscle cross-sectional surface. As results, the numbers of follicular oocytes collected from individual fundamentally-registered Hanwoo yielded $1^{{+}{+}}$, $1^{+}$, 1, 2 and 3 meat quality were 9.5, 9.4, 8.5, 8.8 and 8.8 per ovary, respectively. The numbers of retrieval oocyte from follicles were significantly higher in the cattle yield $1^{{+}{+}}$, $1^{+}$ meat quality than in the cattle yield 1, 2 and 3 meat quality (p<0.05). The rates of blastocyst formation were 18.2, 21.3, 29.4, 30.9, and 31.5% in the cattle yield $1^{{+}{+}}$, $1^{+}$, 1, 2 and 3 meat quality of after in vitro maturation, respectively. It was significantly lower in the cattle yield $^{{+}{+}}$ and $1^{+}$ meat quality than in the cattle yield 1, 2 and 3 meat quality (p<0.05). In order to evaluate embryos quality, TUNNEL assay was conducted for each meat quality grade using blastocyst stage embryo on day 8. The results showed that apoptosis cell number was higher tendency in the cattle yield $1^{{+}{+}}$and $1^{+}$ meat quality (81 and 79, respectively) than in the cattle yield 1, 2 and 3 meat quality (51, 48 and 50, respectively) but there was no statistical significance in each group. After embryo transfer, the conception rate of recipient was 53.5 (23 out of 43), 52.1 (38 out of 73) and 58.0% (58 out of 100) in the meat quality of 1, $1^{+}$ and $1^{{+}{+}}$, respectively. These results showed that the conception rate was significantly higher in the cattle yield 1 meat quality than in the cattle yield $1^{{+}{+}}$, $1^{+}$, 2, and 3 meat quality (p<0.05). In summary, these results indicate that the application of confirming Hanwoo individual oocyte culture system and embryo transfer technology can make good use of the genetic resources conservation and improvement of Hanwoo. Relevance of the meat quality grade and reproductive ability of carcasses of Hanwoo will be considered to be one of the effective means for the associated research with obesity and reproduction.

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

Probabilistic Anatomical Labeling of Brain Structures Using Statistical Probabilistic Anatomical Maps (확률 뇌 지도를 이용한 뇌 영역의 위치 정보 추출)

  • Kim, Jin-Su;Lee, Dong-Soo;Lee, Byung-Il;Lee, Jae-Sung;Shin, Hee-Won;Chung, June-Key;Lee, Myung-Chul
    • The Korean Journal of Nuclear Medicine
    • /
    • v.36 no.6
    • /
    • pp.317-324
    • /
    • 2002
  • Purpose: The use of statistical parametric mapping (SPM) program has increased for the analysis of brain PET and SPECT images. Montreal Neurological Institute (MNI) coordinate is used in SPM program as a standard anatomical framework. While the most researchers look up Talairach atlas to report the localization of the activations detected in SPM program, there is significant disparity between MNI templates and Talairach atlas. That disparity between Talairach and MNI coordinates makes the interpretation of SPM result time consuming, subjective and inaccurate. The purpose of this study was to develop a program to provide objective anatomical information of each x-y-z position in ICBM coordinate. Materials and Methods: Program was designed to provide the anatomical information for the given x-y-z position in MNI coordinate based on the Statistical Probabilistic Anatomical Map (SPAM) images of ICBM. When x-y-z position was given to the program, names of the anatomical structures with non-zero probability and the probabilities that the given position belongs to the structures were tabulated. The program was coded using IDL and JAVA language for 4he easy transplantation to any operating system or platform. Utility of this program was shown by comparing the results of this program to those of SPM program. Preliminary validation study was peformed by applying this program to the analysis of PET brain activation study of human memory in which the anatomical information on the activated areas are previously known. Results: Real time retrieval of probabilistic information with 1 mm spatial resolution was archived using the programs. Validation study showed the relevance of this program: probability that the activated area for memory belonged to hippocampal formation was more than 80%. Conclusion: These programs will be useful for the result interpretation of the image analysis peformed on MNI coordinate, as done in SPM program.