• Title/Summary/Keyword: Citation Matching

Search Result 10, Processing Time 0.019 seconds

Influence of Normalization and Types of Citation Fields on Citation Matching (인용 필드 정규화와 타입이 인용매칭에 미치는 영향)

  • Koo, Hee-Kwan;Jung, Han-Min;Sung, Won-Kyung
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.11
    • /
    • pp.395-403
    • /
    • 2008
  • In this paper, we present the analysis of the fact that normalization and types of citation fields have an effect to the citation matching. Citation matching indicates the series of grouping process for the citation records referring to the same paper. The citation matching combines the comparison results of citation fields, and determines which citation records are the same. For the citation field comparison in the citation matching phase, studies on the normalization and types of citation fields are needed. But they are relatively insufficient when compared with the studies on citation matching methods. In this research, we showed that the citation matching performance was affected by the normalization and types of citation fields. Additionally, we also analyzed the combination of normalized multiple fields. According to the experimental result, the citation field had the overall performance improvement through a normalization, and the performance mode differently showed up at the citation field type.

Evaluating an Influence of Individual Citation Field on Citation Matching (개별 인용 필드의 인용 매칭에 대한 영향력 평가)

  • Koo, HeeKwan;Kang, In-Su;Jung, Hanmin;Lee, Seung-Woo;Sung, Won-Kyung
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.414-417
    • /
    • 2007
  • Citation matching (CM) is a method for clustering citation records that refer to the same paper. Normally, CM is preceded by citation field segmentation (CFS) which divides a citation record into its fields such as author(s), a title, a title of publication, year, etc. Although many studies have attacked CFS and CM, the relationship between CFS and CM was not sufficiently explored. Among many aspects of the effect of CFS on CM, this study concentrates on what citation fields should identify for CM. As its first attempt, we compared CM performances over different sets of citation fields manually segmented, and confirmed that the use of more citation fields help CM to cluster citation records better.

  • PDF

Influence of Citation Field Segmentation on Citation Matching for Social Network Construction (사회연계망 구축을 위한 인용 매칭에서의 인용 필드 분해 영향 분석)

  • Koo, HeeKwan;Kang, In-Su;Jung, Hanmin;Lee, Seungwoo;Sung, Won-Kyung
    • Annual Conference on Human and Language Technology
    • /
    • 2007.10a
    • /
    • pp.194-201
    • /
    • 2007
  • 인용 매칭(Citation Matching, CM)은 동일한 논문을 지칭하는 인용레코드(Citation Record)를 군집화하는 것으로 인용 관계를 가진 사회연계망 구축시 필요한 기술의 하나이다. 인용 매칭의 전단계로써, 인용 레코드를 저자, 논문 제목, 게재지명, 발행연도 등의 필드로 구분하는 인용 필드 분해가 고려될 수 있다. 본 논문은 인용 필드 분해(Citation Field Segmentation, CFS)와 인용 매칭의 상관관계를 분석하고자 한다. 즉, 인용 필드 분해가 인용 매칭에 필수적인 단계인지를 밝히고 개별 인용 필드가 인용 매칭에 미치는 영향을 분석한다. 실험을 통해 인용 필드 분해를 한 인용 매칭(CFS-based CM)이 인용 필드 분해를 적용하지 않은 인용 매칭(CFS-free CM)에 비해 1% 내외의 성능의 차이를 보이므로, 인용매칭의 성능에 크게 영향을 미친다고 보기 어려웠다. 이는 인용 레코드의 서로 다른 필드들 사이에서 어휘 중복 비율이 크게 낮기 때문에 따로 필드를 구별하지 않아도 필드가 구별되는 특성때문이었다.

  • PDF

Research and Development of Citation Matcher for Reference Parsing and Cross-Reference Linking (참고문헌 자동파싱 및 참조링킹을 위한 Citation Matcher 연구 및 개발)

  • Lee, Sang-gi;Kim, Sun-tae;Lee, Yong-sik;Yi, Tae-seok
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.426-429
    • /
    • 2007
  • CrossRef operates a cross-publisher citation linking system based on the DOI(R) global identifier. The number of organization building a reference citations linking structure through CrossRef is increasing. This paper concentrates on developing a Citation Matcher Solution to effectively build the reference linking structure. Citation Matcher automatically builds and processes the reference citation and identifier mapping which used to be handled manually. After the copy & paste of the reference citation, analyzation is processed to parse the journal title, author name, volume, issue, and start pages from the free style text. CrossRef, PubMed, and YesKISTI's identifiers are collected by through a standardized method. Renovation of the building process for domestic scholastic resources' reference linking and matching will be made possible by using a Citation Matcher. The connection between resources and seamless access for the electronic full-text will enhance the usability.

  • PDF

Study of Multiple Topic Citation Analysis Service Method Using Citing and Cited Phrases (인용·피인용 구절을 이용한 다주제 인용 분석 서비스 방법 연구)

  • Jung, Hanmin;Kim, Taehong
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.10
    • /
    • pp.11-20
    • /
    • 2021
  • The analysis of citing and cited phrases provides an opportunity to enhance search-centric academic information services. However, most current studies focus only on citation analysis among academic associations, researchers, and articles, making it challenging to develop higher citation-based information services. This study proposes citation analysis service methods using citing and cited phrases. First, to verify the feasibility of suggested services, we have collected the most highly cited articles with specific domain terms and followed their citing relationship; after that, we found formal citation types and ratios in the original articles. And we conducted structural analysis, especially with three topics, "Deep Learning," "Green Energy," and "Aging," and then structurally illustrates the citation characteristics of related articles. Finally, we collected four most cited articles and all their citing ones for each subject from Google Scholar and analyzed the ratio of citation types and citation spread. We hope that various citation analysis studies and information services can be further developed based on our discussion for designing better information services.

The Reference Identifier Matching System for Developing Reference Linking Service (참조연계 서비스 구현을 위한 참고문헌 식별자 매칭 시스템)

  • Lee, Yong-Sik;Lee, Sang-Gi
    • Journal of Information Management
    • /
    • v.41 no.3
    • /
    • pp.191-209
    • /
    • 2010
  • A reference linking service that is connection of each other different information resource need to setup the reference database and to match identifier. CrossRef, PubMed and Web Of Science etc. the many overseas agencies developed reference linking service, that they used the automatic tools of Inera eXstyles, Parity Computings Reference Extractor etc. and setup in base DOI and PMID etc. Domestic the various agencies of KISTI(Korea Institute Science and Technology of Information), KRF(Korea Research Foundation) etc are construction reference database. But each research communities adopts a various reference bibliography writing format. As, the data base construction which is collect is confronting is many to being difficult. In this paper, We developed the Citation Matcher System. This system is automatic parsing the reference string to metadata and matching DOI, PMID and KOI as Identifier. It is improved the effectiveness of reference database setup.

Patent Citations and Localization of Knowledge Spillovers: Evidence from Korea (특허자료를 이용한 우리나라 지식전파의 지역화 분석)

  • Lee, Jihong;Nam, Yunmi
    • Economic Analysis
    • /
    • v.25 no.4
    • /
    • pp.25-57
    • /
    • 2019
  • This paper studies localization effects of knowledge spillovers in Korea using U.S. patents granted over the period 1996-2015. The "sample-matching" analysis initiated by Jaffe, Trajtenberg, and Henderson (1993) is adopted. We do not find evidence of positive localization effects in Korea. In particular, controlling for the existing geographic distribution of knowledge production, the frequency of domestic citations of Korean patents is no more than the citation frequency from overseas, and the difference is decreasing within the sample period. We also examine localization effects across regions and industries, and compare Korea with Taiwan and Japan.

Study on the Relation of Field Normalization with Citation Matching (인용 필드 정규화와 인용매칭의 관계 연구)

  • Koo, HeeKwan;Kang, In-Su;Jung, Hanmin;Sung, Won-Kyung
    • Annual Conference on Human and Language Technology
    • /
    • 2008.10a
    • /
    • pp.69-74
    • /
    • 2008
  • 본 논문은 인용필드 정규화와 인용매칭의 관계에 대한 분석을 제시한다. 인용매칭은 논문에서 수집된 인용레코드의 인용필드들 간의 비교 결과를 조합하여 동일 논문의 참조여부를 판별하여 인용레코드를 군집화한다. 따라서 인용매칭에 성능을 높일 수 있는 인용필드와 인용매칭 성능의 관계에 대한 연구가 필요하다. 본 논문에서는 인용필드 정규화 및 필드 별 결합에 의하여 인용매칭 성능이 변화하는 것을 보였다. 또한, 인용매칭 성능을 인용필드 유사도와의 관점에서 분석하였다. 앞으로, 인용필드 정규화 및 특성이 인용매칭에 미치는 영향에 대한 이해를 넓혀, 이를 인용매칭에 활용할 수 있으리라 여겨진다.

  • PDF

Current Research Status of National Health Insurance Database Studies in Korea Related to Parkinson's Disease and Future Research Proposals for Integrative Therapies (국민건강보험공단 청구자료를 활용한 파킨슨병과 관련된 코호트 연구 디자인 분석 및 향후 한의중재 관련 파킨슨 후향적 코호트 연구를 위한 제언)

  • Ye-Chae Hwang;Jungtae Leem
    • Journal of Society of Preventive Korean Medicine
    • /
    • v.28 no.1
    • /
    • pp.69-87
    • /
    • 2024
  • Objectives : This study is to investigate the current National Health Insurance Database cohort studies related to complications of Parkinson's Disease (PD) and suggest the design of Korean medical epidemiological studies of PD. Methods : Nationwide longitudinal studies of PD patients in South Korea were collected through Pubmed and the Korea Citation Index (KCI). We selected cohort studies that used the National Health Insurance Database in Korea and targeted Parkinson's disease patients. Studies published before February 2024 were categorized according to study designs. We examined variables and covariates, enroll dates and matching methods. Results : Of a total of 536 studies, 18 studies met the inclusion criteria. All studies used the National Health Insurance (NHI) Research Database and among them, 5 used sample data and one senior database. Studies can be classified into two types. 11 cohort studies were comparing PD patients and non-PD patients. Another type was 4 PD patients cohort studies. Most studies used two diagnostic codes (G20 and V124) for inclusion criteria. Enroll periods were from 2002 to 2017, and follow-up periods were from 7 to 14 years. 16 studies considered age and sex as covariates. 15 studies used the propensity score matching method to increase the level of causality. There was only one study related to the Korean medical treatment. Conclusion : In future cohort studies on Korean medical treatment, more attempts should be made to reveal the effect of the treatments on PD patients by defining inclusion criteria for patient groups, covariates, exposure variables, and assessment indicators more operatively.

A Study on the Analysis of Intellectual Structure of Korean Veterinary Sciences (국내 수의과학 분야의 지적 구조 분석에 관한 연구)

  • Cho, Hyun-Yang
    • Journal of Information Management
    • /
    • v.43 no.2
    • /
    • pp.43-66
    • /
    • 2012
  • The purpose of this study is to see the intellectual structure in the field of veterinary sciences in Korea, using author profiling analysis(APA), a bibliometric approach. Three journals are selected on the basis of citation data, exchanging most citations with Korean Journal of Veterinary. And then, 50 authors who published most articles at selected journals during the given period of time were chosen. The analysis of similarity and dissimilarity among authors by comparing co-word appearance patterns from article title, abstracts, and keywords was made. Authors can be grouped 11 minor clusters under 4 major clusters, depending on their interests in the area of veterinary sciences in Korea. The subjects for each cluster at the veterinary sciences are decided by the matching the keyword, representing author's research interest. As a result, it is possible to figure out the current research trends and the researcher network in the field of veterinary sciences.