• Title/Summary/Keyword: Data Curation

Search Result 92, Processing Time 0.02 seconds

A Study on the Improvement of 'Geospatial Information Open Platform' for Geospatial Information Convergence Industry

  • Song, Ki-Sung;Seok, Sang-Muk;Kwon, Hoe-Yun;Hwang, Jung-Rae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.7
    • /
    • pp.31-38
    • /
    • 2016
  • In this paper, we propose a direction for improving 'Geospatial information open platform' service to support the converged and integrated geospatial information. Since there can be a number of issues relating to the support for geospatial information convergence industry, two qualitative surveys were performed to collect opinions comprehensively and specifically. The responses from 165 experts from 5 areas that use geospatial information were used, and the requirements of demanders were divided into the aspect of policy, aspect of data development and distribution, and aspect of data utilization support in order to effectively analyze the survey results. As a result, a total of 26 major issues were derived and it was deemed that it is necessary to find a way to expand the role of 'Geospatial information open platform' from "Open-API Oriented Passive Spatial Information Open Platform" to "Platform that Comprehensively Provides Active Convergence Support Information" order to resolve the issues derived.

Risk Factors for Sarcopenia, Sarcopenic Obesity, and Sarcopenia Without Obesity in Older Adults

  • Kim, Seo-hyun;Yi, Chung-hwi;Lim, Jin-seok
    • Physical Therapy Korea
    • /
    • v.28 no.3
    • /
    • pp.177-185
    • /
    • 2021
  • Background: Muscle undergoes change continuously with aging. Sarcopenia, in which muscle mass decrease with aging, is associated with various diseases, the risk of falling, and the deterioration of quality of life. Obesity and sarcopenia also have a synergy effect on the disease of the older adults. Objects: This study examined the risk factors for sarcopenia, sarcopenic obesity, and sarcopenia without obesity and developed prediction models. Methods: This machine-learning study used the 2008-2011 Korea National Health and Nutrition Examination Surveys in the analysis. After data curation, 5,563 older participants were selected, of whom 1,169 had sarcopenia, 538 had sarcopenic obesity, and 631 had sarcopenia without obesity; the remaining 4,394 were normal. Decision tree and random forest models were used to identify risk factors. Results: The risk factors for sarcopenia chosen by both methods were body mass index (BMI) and duration of moderate physical activity; those for sarcopenic obesity were sex, BMI, and duration of moderate physical activity; and those for sarcopenia without obesity were BMI and sex. The areas under the receiver operating characteristic curves of all prediction models exceeded 0.75. BMI could predict sarcopenia-related disease. Conclusion: Risk factors for sarcopenia-related diseases should be identified and programs for sarcopenia-related disease prevention should be developed. Data-mining research using population data should be conducted to enhance the effectiveness of early treatment for people with sarcopenia-related diseases through predictive models.

Evaluation and Case Study of Geoscience Data Repositories Using re3data.org (re3data.org를 활용한 Geoscience 분야 데이터 리포지터리 평가 및 사례 연구)

  • Juseop Kim;Suntae Kim
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.3
    • /
    • pp.161-191
    • /
    • 2024
  • In order to share and reuse research data, data repositories are operated mainly by research institutes, communities, and countries. Currently, there are 3,236 data repositories registered on re3data.org, and the repositories are operated by each subject. The purpose of this study is to identify the operational status of data repositories in the Geoscience field and provide repository services in the field. As a result of the study, 634 out of 890 data repositories in the Geoscience field satisfied more than 4 out of 9 properties, confirming that approximately 74% of the repositories are generally well operated. In addition, as a result of checking the services in the repositories, it was found that they provide various implications in terms of data policy, curation, tools and APIs, and quality management. These research results will serve as a basis for reference when configuring data repository services in the Geoscience field in the future.

hpvPDB: An Online Proteome Reserve for Human Papillomavirus

  • Kumar, Satish;Jena, Lingaraja;Daf, Sangeeta;Mohod, Kanchan;Goyal, Peyush;Varma, Ashok K.
    • Genomics & Informatics
    • /
    • v.11 no.4
    • /
    • pp.289-291
    • /
    • 2013
  • Human papillomavirus (HPV) infection is the leading cause of cancer mortality among women worldwide. The molecular understanding of HPV proteins has significant connotation for understanding their intrusion in the host and designing novel protein vaccines and anti-viral agents, etc. Genomic, proteomic, structural, and disease-related information on HPV is available on the web; yet, with trivial annotations and more so, it is not well customized for data analysis, host-pathogen interaction, strain-disease association, drug designing, and sequence analysis, etc. We attempted to design an online reserve with comprehensive information on HPV for the end users desiring the same. The Human Papillomavirus Proteome Database (hpvPDB) domiciles proteomic and genomic information on 150 HPV strains sequenced to date. Simultaneous easy expandability and retrieval of the strain-specific data, with a provision for sequence analysis and exploration potential of predicted structures, and easy access for curation and annotation through a range of search options at one platform are a few of its important features. Affluent information in this reserve could be of help for researchers involved in structural virology, cancer research, drug discovery, and vaccine design.

LitCovid-AGAC: cellular and molecular level annotation data set based on COVID-19

  • Ouyang, Sizhuo;Wang, Yuxing;Zhou, Kaiyin;Xia, Jingbo
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.23.1-23.7
    • /
    • 2021
  • Currently, coronavirus disease 2019 (COVID-19) literature has been increasing dramatically, and the increased text amount make it possible to perform large scale text mining and knowledge discovery. Therefore, curation of these texts becomes a crucial issue for Bio-medical Natural Language Processing (BioNLP) community, so as to retrieve the important information about the mechanism of COVID-19. PubAnnotation is an aligned annotation system which provides an efficient platform for biological curators to upload their annotations or merge other external annotations. Inspired by the integration among multiple useful COVID-19 annotations, we merged three annotations resources to LitCovid data set, and constructed a cross-annotated corpus, LitCovid-AGAC. This corpus consists of 12 labels including Mutation, Species, Gene, Disease from PubTator, GO, CHEBI from OGER, Var, MPA, CPA, NegReg, PosReg, Reg from AGAC, upon 50,018 COVID-19 abstracts in LitCovid. Contain sufficient abundant information being possible to unveil the hidden knowledge in the pathological mechanism of COVID-19.

An Economic Ripple Effect Analysis of National Scientific Data Center Construction (국가 과학데이터센터 구축의 경제적 파급효과 분석)

  • Park, Sung-Uk;Hahn, Sun-Hwa
    • Journal of Information Management
    • /
    • v.42 no.3
    • /
    • pp.55-69
    • /
    • 2011
  • In the modern scientific R&D, the efficient acquisition, curation, analysis and visualization are core elements of the science development. The value of scientific data is very important in data intensive research. An output of scientific data is drastically increasing. However we have only each individual system of scientific data in now. Therefore We feel a lack of efficiency of scientific data. In this paper, We analyze an economic ripple effects in terms of production inducement effect, added value inducement effect, labor inducement effect and forward backward linkage effect of national scientific data center construction using an input-out analysis of the bank of Korea(2009). We also examine an economic propriety of national scientific data center construction.

A Study on the Perception of Fashion Platforms and Fashion Smart Factories using Big Data Analysis (빅데이터 분석을 이용한 패션 플랫폼과 패션 스마트 팩토리에 대한 인식 연구)

  • Song, Eun-young
    • Fashion & Textile Research Journal
    • /
    • v.23 no.6
    • /
    • pp.799-809
    • /
    • 2021
  • This study aimed to grasp the perceptions and trends in fashion platforms and fashion smart factories using big data analysis. As a research method, big data analysis, fashion platform, and smart factory were identified through literature and prior studies, and text mining analysis and network analysis were performed after collecting text from the web environment between April 2019 and April 2021. After data purification with Textom, the words of fashion platform (1,0591 pieces) and fashion smart factory (9750 pieces) were used for analysis. Key words were derived, the frequency of appearance was calculated, and the results were visualized in word cloud and N-gram. The top 70 words by frequency of appearance were used to generate a matrix, structural equivalence analysis was performed, and the results were displayed using network visualization and dendrograms. The collected data revealed that smart factory had high social issues, but consumer interest and academic research were insufficient, and the amount and frequency of related words on the fashion platform were both high. As a result of structural equalization analysis, it was found that fashion platforms with strong connectivity between clusters are creating new competitiveness with service platforms that add sharing, manufacturing, and curation functions, and fashion smart factories can expect future value to grow together, according to digital technology innovation and platforms. This study can serve as a foundation for future research topics related to fashion platforms and smart factories.

Design of the Curation Platform for User-participated Book Recommendation System of Selecting on Alternative Material for the Disabled (대체자료 선정을 위한 이용자 참여형 도서 추천 큐레이션 플랫폼 설계)

  • Cho, Hyun-Yang
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.54 no.3
    • /
    • pp.41-69
    • /
    • 2020
  • The purpose of this study is to design and develop a alternative material recommendation system using automatic classification, based on user preference. Details of usage data by users from DREAM was analysed in order to develop the way of a method on selecting proper alternative material, and then the data by user preference were allocated under each category of 10 KDC categories. The keyword, selected from the title of users' usage data from a certain period of time, were divided into 10 subject categories and ranked by the order of frequency of appearance. Books including high frequency of the keyword in title can be selected as a preferred target for producing alternative materials. Lastly, a dynamic linkage for sharing usage data among National Library for the Disabled and other libraries is proposed to produce more proper alternative materials, based on user preference.

How Can We Preserve Social Memories?: Exploration of Global Open Archives

  • Gang, Ju-Yeon;Kim, Geon;Oh, Hyo-Jung
    • Journal of Information Science Theory and Practice
    • /
    • v.7 no.3
    • /
    • pp.40-51
    • /
    • 2019
  • Until now, records re-enacting social memories have not been main targets for preservation and management in Korea. However, people have recently begun to focus on forming and maintaining their memories because these personalized records have started to be recognized as social and political issues. In this respect, this study aims to find out how to preserve social memories by comparing various global open archives. For achieving our research goal, we first established the definition of social memories and records and revealed their characteristics. After then, we selected representative open archives' websites to examine their collection polices and compare them according to several criteria. As a result, we distilled insights based on similarities and differences of each archive and discussed considerations in preserving social memories consisting of three phases: analyzing target social memories, establishing collection policies, and collecting actual records. This study has significance in that it examines the characteristics of social memories and records and also suggests preliminary findings for advanced research to develop practical tools for social records management and archives.

Comparative Study of Learning Platform for IT Developers (IT 개발자 대상 학습플랫폼 비교 연구)

  • Lee, Ji-Eun
    • Journal of Information Technology Services
    • /
    • v.20 no.5
    • /
    • pp.147-158
    • /
    • 2021
  • The digital transformation and COVID-19 are also causing major changes in teaching-learning methods. The biggest change is the spread of remote training and the emergence of various innovative learning platforms. Distance education has been criticized for not meeting technology trends and field demands..However, the problem of distance education is being solved through a system that supports various interactions and collaborations and supports customized learning paths. The researcher conducted a case study on domestic and foreign learning platforms that provide non-face-to-face ICT education. Based on the case study results, the researcher presented the functional characteristics of a learning platform that effectively supports non-face-to-face learning. In common, these sites faithfully supported the basic functions of the information system. In addition to learning progress check and learning guidance, some innovative learning platforms were providing differentiated functions in practice support, performance management, mentoring, learning data analysis, curation provision, and CDP support. Most learning platforms supported one-way, superficial interaction. If the platform effectively supports a variety of learning experiences and provides an integrated learning experience thanks to the development of IT technology, user satisfaction with the learning platform, intention to continue learning, and achievement will increase.