• Title/Summary/Keyword: 웹아카이빙

Search Result 41, Processing Time 0.028 seconds

A study on the enhanced filtering method of the deduplication for bulk harvest of web records (대규모 웹 기록물의 원격수집을 위한 콘텐츠 중복 필터링 개선 연구)

  • Lee, Yeon-Soo;Nam, Sung-un;Yoon, Dai-hyun
    • The Korean Journal of Archival Studies
    • /
    • no.35
    • /
    • pp.133-160
    • /
    • 2013
  • As the network and electronic devices have been developed rapidly, the influences the web exerts on our daily lives have been increasing. Information created on the web has been playing more and more essential role as the important records which reflect each era. So there is a strong demand to archive information on the web by a standardized method. One of the methods is the snapshot strategy, which is crawling the web contents periodically using automatic software. But there are two problems in this strategy. First, it can harvest the same and duplicate contents and it is also possible that meaningless and useless contents can be crawled due to complex IT skills implemented on the web. In this paper, we will categorize the problems which can emerge when crawling web contents using snapshot strategy and present the possible solutions to settle the problems through the technical aspects by crawling the web contents in the public institutions.

A Study on the Collection and Application Measures for Media Platform Based Materials (매체 플랫폼 기반 자료의 수집 및 적용 방안 연구)

  • Younghee Noh;Youngmi Jung;Aekyoung Son;Inho Chang;Hyunju Cha
    • Journal of Korean Library and Information Science Society
    • /
    • v.55 no.1
    • /
    • pp.193-214
    • /
    • 2024
  • This study aimed to propose a method for collecting and applying media platform based materials at the National Library of Korea. Firstly, we analyzed the current status and limitations of data collection based on domestic media platforms, including the National Library of Korea. Secondly, a literature review method was used to investigate the current status and types of digital content based on media platforms. Thirdly, we identified the types of materials based on media platforms that are not currently included in the National Central Library's online material collection guidelines through the examination of cases from major overseas libraries. Fourthly, after reviewing technical and legal elements such as the definition of collection targets and scope for each new media, and collection methods, we established collection criteria. Fifthly, based on the research results, the policies proposed in this study are as follows: 1) there is a need to establish a clear legal basis for the collection of media platform based materials; 2) the development and presentation of collection guidelines for media platform based materials is necessary; 3) the development of collection tools and infrastructure for media platform based materials is required; 4) for the collection of media platform based materials, it is necessary to obtain permission for collection from targeted social media organizations, and to cooperate in linkage with organizations that produce and service extended reality content; 5) for the service activation of media platform based materials, it is necessary to improve accessibility for the usage activation of these materials, to enhance the content extensibility and ease of use of the e-deposit system including extended reality content, and to advance and construct spaces for reproducing extended reality content.

Design and Implementation of a Learning experience data xAPI-based University Club Linking & Archiving Metaverse Platform (학습경험데이터 xAPI 기반의 대학 동아리 매칭 및 아카이빙 메타버스 플랫폼 설계 및 구현)

  • Lee, Chanhee;Nah, Jeong-Eun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.928-930
    • /
    • 2022
  • 최근 취향 관심사를 기반으로 한 소모임 플랫폼 수요가 높아지듯, 대학 내외의 사회 전반에서 수준 높은 네트워킹에 대한 욕구가 강한 상황이다. 나아가, 교육 환경, 거주 지역, 개인의 성향 등의 변인들이 성장의 주요 발판이 될 인적 자원을 만드는 데에 지대한 영향을 끼친다는 점에서, 진입 장벽을 낮출 서비스에 대한 니즈가 존재한다. 한편, 기존의 플랫폼은 대학 동아리 및 소모임 활동을 분산적으로 취급하며 총괄하지 못한다. 따라서, 본 논문은 기존 플랫폼의 문제점을 개선한 메타버스 웹 플랫폼인 클러버스(Clubverse)를 소개한다. 이는 대학 내 동아리 간의 소통 및 외부 업체와의 연계를 제고하고 성장기회를 공평하게 분배해 변화에 대비할 수 있는 가능성을 지닌다.

The ASK_a Service Model for Public Library in Korea (우리나라 공공도서관의 ASK_a 서비스 모형 개발)

  • Nam, Young-Joon;Lee, Hyang-Sook
    • Journal of Information Management
    • /
    • v.37 no.1
    • /
    • pp.57-81
    • /
    • 2006
  • The new service of Korean public library, ASK_a service model suggests a new management practice in collaborative digital reference services. The model has three functions: input transaction, process transaction, and output transaction. The best form for input is the web form. The best form for process is a model with a hybrid type of public libraries(hierarchical and lateral type). The output suggests the archiving policy for gathering the query-answer data. The core of this model is providing an advanced information service to its users through cooperation with public libraries and external manpower.

A Study on Development of Subject-based Community Model by Link of Content -Focused on Life Science- (콘텐트 연계를 통한 주제기반 커뮤니티 모델 개발 연구 -생명과학 분야를 중심으로-)

  • Bu-Young Ahn;Seon-Heui Choi;Yong-Ju Shin;Soon-Young Kim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.11a
    • /
    • pp.607-610
    • /
    • 2008
  • 국내외 연구자들은 각자의 분야에서 다양하고 중요한 연구를 수행하면서 그 연구결과물을 생산하고 있다. 연구결과물의 형태는 학회지 및 학술대회 논문, 연구보고서, 특허, 연구노트, 세미나 발표자료, 학교교재, 신문 및 잡지의 기사 등 매우 다양하다. 이런 다양한 연구결과물을 같은 학문 분야, 같은 주제의 연구자들끼리 서로 공유하고 교환하기 위해서는 정보의 자유로운 이용에 근거한 커뮤니티 환경이 필요하다. 이에, 국가 과학기술정보 유통기관인 한국과학기술정보연구원(KISTI)에서 보유하고 있는 문헌 콘텐트와 사실 콘텐트를 주제별로 분류하고 재가공하여 특정 주제분야 전문 연구자들을 위한 오픈 아카이빙, 오픈 액세스 개념을 적용한 커뮤니티 모델을 개발하여 제공하고자 한다. 본 커뮤니티 모델은 요즘들어 가장 많은 연구가 진행되고 있는 생명과학 분야의 연구결과물을 중심으로 개발하였다. 커뮤니티 모델을 개발하기 위하여 1) KISTI가 보유하고 있는 콘텐트 현황을 조사하고, 2) 그 중에서 생명과학분야 콘텐트의 형태와 특성을 분석하고, 3) 연구자들이 연구결과물을 자유롭게 업로드/다운로드할 수 있는 웹 환경의 플랫폼을 설계하였다.

A Hybrid Method of Storing XML Data Using RDBMS (RDBMS를 이용한 XML 데이터의 혼합형 저장 기법)

  • Jeon, Chan-Hoon;Kang, Hyun-Chul
    • The Journal of Society for e-Business Studies
    • /
    • v.14 no.1
    • /
    • pp.57-79
    • /
    • 2009
  • As the Web-based e-Business prevails, the volume of XML data on the Web is getting larger than ever. Although much research has been done on decomposing and storing XML data in RDB, which is now the most popular storage for XML, and on processing XML queries through their SQL counterparts, little attention was paid to how to alleviate the burden of storing massive volume of XML data. In this paper, we propose a hybrid method of storing XML data in RDB, whereby the unit of storage could be an XML subtree as well as an XML node. The part of XML data whose nodes were separately stored could be reformed into an XML subtree for storing when it gets rarely queried or less valuable for reference as time goes by. With this method, we designed and implemented a hybrid XML storage and query processing system, comparing it with the conventional system where an XML node is the only unit of storage. Through experiments, we compared storage efficiency and query processing performance, validating the effectiveness of our proposed system.

  • PDF

Global Impact of Institutional Repositories in South Korean University (국내 대학 리포지터리의 세계적 영향력에 관한 연구)

  • Shin, Eun-Ja
    • Journal of the Korean Society for information Management
    • /
    • v.34 no.1
    • /
    • pp.197-218
    • /
    • 2017
  • This study attempts to measure visibility and impact of university repositories in South Korea with the help of web-sites, OpenDOAR, ROAR and RWR. Further understanding the self-archiving status of South Korea, the analysis results were compared with the reputation and power of major Asian countries' university repositories. The results showed that only nine institutional repositories of the universities in South Korea were active. There was only one South Korean university repository in the RWR top 500. All the other repositories ranked in bottom level. However, among Asian countries, Japan and Taiwan have established many institutional repositories. They had 257 and 52 repositories respectively. Fortunately, some leading university repositories in South Korea began activating self-archiving with the help of linking their own research outputs management system. Also, the attempts by other South Korean university repositories expect a substantial quantitative growth in the near future.

A Study on Data Requirements and Quality Verification for Legal Deposit and Acquisition Tasks of Domestic Electronic Publications (국내 유통 전자출판물의 납본 및 수집을 위한 데이터 요구사항 및 품질 검증 연구)

  • Gyuhwan Kim;Soojung Kim;Daekeun Jeong
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.35 no.1
    • /
    • pp.127-148
    • /
    • 2024
  • This study aimed to propose considerations for attributes and their standardization strategies during the data collection process for electronic publications by domestic distributors for the National Library of Korea. The research identified a total of 21 essential and optional attributes based on a survey and a Focused Group Interview (FGI) with the staff responsible for legal deposit and acquisition tasks at the National Library of Korea. Additional attributes were found necessary during the data quality verification process, leading to the specification of essential and optional attributes for various types of materials, including eBooks, audiobooks, webtoons, and web novels. The standardization of attribute values, essential for enhancing the identifiability and management efficiency of electronic publications, included adherence to ISO 8601 rules for dates and times, clear designation of limited-range attribute values such as file format and adult content, and detailed description of information related to titles. Furthermore, the study highlighted the need for establishing standardized metadata requirements and continuous data quality management and monitoring systems.

Metadata extraction using AI and advanced metadata research for web services (AI를 활용한 메타데이터 추출 및 웹서비스용 메타데이터 고도화 연구)

  • Sung Hwan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.499-503
    • /
    • 2024
  • Broadcasting programs are provided to various media such as Internet replay, OTT, and IPTV services as well as self-broadcasting. In this case, it is very important to provide keywords for search that represent the characteristics of the content well. Broadcasters mainly use the method of manually entering key keywords in the production process and the archive process. This method is insufficient in terms of quantity to secure core metadata, and also reveals limitations in recommending and using content in other media services. This study supports securing a large number of metadata by utilizing closed caption data pre-archived through the DTV closed captioning server developed in EBS. First, core metadata was automatically extracted by applying Google's natural language AI technology. The next step is to propose a method of finding core metadata by reflecting priorities and content characteristics as core research contents. As a technology to obtain differentiated metadata weights, the importance was classified by applying the TF-IDF calculation method. Successful weight data were obtained as a result of the experiment. The string metadata obtained by this study, when combined with future string similarity measurement studies, becomes the basis for securing sophisticated content recommendation metadata from content services provided to other media.

A Study on Availability of AtoM for Recording Korean Wave Culture Contents : A Case of K-Food Contents (한류문화콘텐츠의 기록화를 위한 AtoM 활용 방안에 관한 연구 K-Food 콘텐츠를 중심으로)

  • Shim, Gab-yong;Yoo, Hyeon-Gyeong;Moon, Sang-Hoon;Lee, Youn-Yong;Lee, Jeong-Hyeon;Kim, Yong
    • The Korean Journal of Archival Studies
    • /
    • no.43
    • /
    • pp.5-42
    • /
    • 2015
  • Korean wave 3.0 is focused on 'K-Culture' which includes traditional culture, cultural art as well as existing culture contents as a keyword. It considers everything about Korean culture as materials of Korean wave culture contents. Since Korean wave culture contents reflect contemporary social aspect, it needs to preserve those contents as archives and records which have the important value of evidence. With this social environment, this study aims to implement RMS based on AtoM that manages various kinds of Korean wave culture contents through analysis of management situation of those materials. Recently, it is in progress individually to manage them through organizations dealing with korean cultures such as K-Pop, K-Food, K-Movie. However, it has problems in accumulating information and reproducing high quality contents because of lack of coordination among organizations. To solve the problems, this study proposed RMS based on open source software Access to Memory(AtoM) for managing and recording Korean wave culture contents. AtoM provides various functions for managing records and archives such as accumulation, classification, description and browsing. Furthermore AtoM is for free as open source software and easy to implement and use. Thus, this study implemented RMS based on AtoM to methodically manage korean wave culture contents by functional requirements of RMS. Also, this study considered contents relating K-Food as an object to collect, classify, and describe. To describe it, this study selected ISAD(G) standard.