• Title/Summary/Keyword: Record linkage

Search Result 35, Processing Time 0.021 seconds

A study on the probabilistic record linkage and its application (확률적 자료연계의 이론과 적용에 관한 연구)

  • Choi, Yeonok;Lee, Sangin
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.849-861
    • /
    • 2021
  • This paper aims to introduce the basic concept of probabilistic record linkage and its statistical framework, and describe the specific process and principle of performing it using a real example from Statistics Korea. First, we briefly describe the deterministic record linkage and compare it with probabilistic record linkage. We introduce the Fellegi-Sunter model framework for record linkage and the related paprameters: m-probability, u-probability, matched weight and decision rule. Finally, we show the detailed process of record linkage under Fellegi-Sunter model framework and evaluate the record linkage results, using sample data from the registered-based census and Population and Housing Census survey in Statistics Korea.

Advancement Plans for Linkage of National Archives Portal Service to Improve Accessibility and Usability of National Records (국가기록물 접근성 및 활용성 향상을 위한 국가기록포털 연계 개선방안)

  • Yoona, Kang;Young Jun, Jo;Minjung, Kim;Hyo-Jung, Oh
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.4
    • /
    • pp.99-125
    • /
    • 2022
  • In order to understand a record, not only the contents of the record but also the production background and work context of the record must be grasped. It also requires a function that makes it easy to find related records scattered across various departments and agencies. Accordingly, the 'linkage' of information in archival information services is becoming more important. NAK also emphasizes 'linkage' as a search service function of the archives management system, but some problems were identified at the National Archives Portal Service (NAPS) such as a lack of linkage with authority data, disruption of internal service, and absence of linkage with other related organizations. To solve the limitations of the NAPS, we selected and analyzed advanced record management institutions that have built an ideal linkage service; checked the overall linkage structure of these institutions; and identified characteristics that could not be seen by other institutions. Also, elements that can be adopted from the NAPS were derived. Next, the current status of the NAPS linkage structure was analyzed to identify the parts that were not linked and the items that need to be improved in the linkage method, and specific advancement plans were suggested to solve these problems. The purpose of this study is to increase users' satisfaction with search and to advance the accessibility and utilization of records and internal services through improved linkage services of NAPS.

Secure Blocking + Secure Matching = Secure Record Linkage

  • Karakasidis, Alexandros;Verykios, Vassilios S.
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.3
    • /
    • pp.223-235
    • /
    • 2011
  • Performing approximate data matching has always been an intriguing problem for both industry and academia. This task becomes even more challenging when the requirement of data privacy rises. In this paper, we propose a novel technique to address the problem of efficient privacy-preserving approximate record linkage. The secure framework we propose consists of two basic components. First, we utilize a secure blocking component based on phonetic algorithms statistically enhanced to improve security. Second, we use a secure matching component where actual approximate matching is performed using a novel private approach of the Levenshtein Distance algorithm. Our goal is to combine the speed of private blocking with the increased accuracy of approximate secure matching.

The Efficient Methods of Population-based Cancer Registration in Daegu City (대구지역 암등록사업의 효율적 수행방안)

  • Jin, Dae-Gu;Chun, Byung-Yeol;Ahn, Soon-Ki;Kim, Jong-Yeon;Kam, Sin
    • Journal of Preventive Medicine and Public Health
    • /
    • v.35 no.4
    • /
    • pp.322-330
    • /
    • 2002
  • Objective: This study was conducted to automatically improve the completeness and validity of the Daegu Cancer Registry, using cross record linkage of many data sources, and to develop a computerized patient enrollment system for efficient communication among cancer researchers via the internet. Method: We analyzed 10,229 cancer patients who were reported in the National Cancer Registry, and from pathological reports, health insurance cancer claims lists, cancer patient records at hospital information centers and death certificates from the Korea National Statistical Office. Result: We confirmed 4,624 cancer patients and found 897 of new cases from a review of medical chart. The new cases were detected efficiently using cross record linkage. We developed a computerized patient enrollment system, based on a client-sewer model, for the input of cancer patients, and then developed a web-based reporting homepage and patient enrollment system for the internet. Conclusion: This system could manage cancer databases systematically, and could be given to other researchers as a basic database.

Record Linkage를 통해 본 영아 사망 요인 분석

  • Lee, Han-Na;Lee, Jong-Tae
    • Proceedings of the Korean Environmental Health Society Conference
    • /
    • 2005.11a
    • /
    • pp.121-125
    • /
    • 2005
  • 우리나라 영아 사망은 계속 감소를 보이고 있으나 상대적으로 낮은 출산율이 최근 문제시되고 있다. 영아 사망률은 인구의 사회적 건강의 요인으로서 넓게 인식된다. 따라서 영아 사망률의 사인을 밝히는 것은 낮은 출생률에 대비하고 출생아가 건강한 성인으로 자라날 수 있는 토대를 마련하기 위해서 중요한 연구가 될 것이다. 이에 본 연구에서는 국내에서는 처음으로 Record linkage를 통해 2000년부터 2003년 까지의 출생 자료와 사망 자료를 통합하여 유아 사망에 영향을 미치는 요인을 분석하였다. 다중 로지스틱 회귀분석을 통해 관련 변수들을 보정한 상태에서 조산아의 유아 사망 위험비는 1.42(95%CI =1.25-1.63)로 나타났다. 그 외에 산모의 연령, 부모의 직업, 거주지역 등이 유의한 위험요인으로 나타났고 본 연구에서 저체중은 영아 사망의 위험 요인으로 나타나지 않았다.

  • PDF

A study on Improving Operation of the Records Disposition Schedule (기록물분류기준표의 운영과 과제)

  • Park, Yoo Jin
    • The Korean Journal of Archival Studies
    • /
    • no.8
    • /
    • pp.57-95
    • /
    • 2003
  • For a good record maintenance according to organization and functions in Korea, it is required to make better use of 'Records Disposition Schedule', which is originally developed as a computerized system that can control the whole records maintenance procedure and manage every record according to organization and functions. 'Records Disposition Schedule' is only a system that allows us to maintain every record according to organization and functions and manage every information about such organization and functions. Accordingly, a well-functioning Records Disposition Schedule requires the exact modification and operation of such schedule depending upon organizational or functional changes. If the Records Disposition Schedule is not reasonably modified and operated depending upon organizational or functional changes, we won't be able to maintain any records in linkage with organization and functions and control the whole works throughout record maintenance.

A Study on SRU & SRU Record Update Protocol for Openness and Linkage of Resources (정보자원의 개방과 연계를 위한 SRU, SRU Record Update 프로토콜 연구)

  • Lee, Ji-Won
    • Journal of Korean Library and Information Science Society
    • /
    • v.40 no.3
    • /
    • pp.317-336
    • /
    • 2009
  • Several protocols have been developed to efficiently utilize a great number of distributed resources. This paper investigated the background, operations and elements of SRU and SRU Record Update protocol, compared them with other protocols, and reviewed their implementation cases. The purpose of this pager is to broaden the understanding of the two new standards and to provide a practical guide to ensure their interoperability for libraries and information service centers which want to expose their own contents and to access to external resources.

  • PDF

Statin Intake and Gastric Cancer Risk: An Updated Subgroup Meta-analysis Considering Immortal Time Bias

  • Bae, Jong-Myon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.55 no.5
    • /
    • pp.424-427
    • /
    • 2022
  • A retrospective record-linkage study (RLS) based on medical records containing drug prescription histories involves immortal time bias (ITB). Thus, it is necessary to control for this bias in the research planning and analysis stages. Furthermore, a summary of a meta-analysis including RLSs that did not control for ITB showed that specific drugs had a preventive effect on the occurrence of the disease. Previous meta-analytic results of three systematic reviews evaluating the association between statin intake and gastric cancer risk showed that the summary hazard ratio (sHR) of the RLSs was lower than 1 and was statistically significant. We should consider the possibility of ITB in the sHR of RLSs and interpret the results carefully.

Combination Key Generation Scheme Robust to Updates of Personal Information (결합키 생성항목의 갱신에 강건한 결합키 생성 기법)

  • Jang, Hobin;Noh, Geontae;Jeong, Ik Rae;Chun, Ji Young
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.5
    • /
    • pp.915-932
    • /
    • 2022
  • According to the Personal Information Protection Act and Pseudonymization Guidelines, the mapping is processed to the hash value of the combination key generation items including Salt value when different combination applicants wish to combine. Example of combination key generation items may include personal information like name, phone number, date of birth, address, and so on. Also, due to the properties of the hash functions, when different applicants store their items in exactly the same form, the combination can proceed without any problems. However, this method is vulnerable to combination in scenarios such as address changing and renaming, which occur due to different database update times of combination applicants. Therefore, we propose a privacy preserving combination key generation scheme robust to updates of items used to generate combination key even in scenarios such as address changing and renaming, based on the thresholds through probabilistic record linkage, and it can contribute to the development of domestic Big Data and Artificial Intelligence business.

A study on Wikidata linkage methods for utilization of digital archive records of the National Debt Redemption Movement (국채보상운동 디지털 아카이브 기록물의 활용을 위한 위키데이터 연계 방안에 대한 연구)

  • Seulki Do;Heejin Park
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.23 no.2
    • /
    • pp.95-115
    • /
    • 2023
  • This study designed a data model linked to Wikidata and examined its applicability to increase the utilization of the digital archive records of the National Debt Redemption Movement, registered as World Memory Heritage, and implications were derived by analyzing the existing metadata, thesaurus, and semantic network graph. Through analysis of the original text of the National Debt Redemption Movement records, key data model classes for linking with Wikidata, such as record item, agent, time, place, and event, were derived. In addition, by identifying core properties for linking between classes and applying the designed data model to actual records, the possibility of acquiring abundant related information was confirmed through movement between classes centered on properties. Thus, this study's result showed that Wikidata's strengths could be utilized to increase data usage in local archives where the scale and management of data are relatively small. Therefore, it can be considered for application in a small-scale archive similar to the National Debt Redemption Movement digital archive.