• Title/Summary/Keyword: metadata research

Search Result 403, Processing Time 0.024 seconds

Research on the Development of Facets for Improvement in Searching Records: Focusing on Presidential Records (기록물의 검색 향상을 위한 패싯 개발에 관한 연구 - 대통령기록물을 중심으로 -)

  • Seong, Hyoju;Rieh, Hae-young
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.17 no.2
    • /
    • pp.165-188
    • /
    • 2017
  • As the recognition of the importance of user-oriented services is increasing, there has been a heightened attention for finding aids that could improve the effectiveness of searching. This study tried to draw various facet elements that can be applied to the presidential records retrieval system using presidential records as cases in analyzing various resources, considering the importance of facets in finding aids for the improvement of effectiveness in searching in the future and the importance of presidential records in Korea. In drawing facet elements based on the characteristics of presidential records, the websites of the National Archives (NARA) and Presidential (Prime Ministers') Archives as well as their search options were examined as cases. In addition, the morpheme of each title of presidential records were analyzed, as well as the terms entered by the users of the Presidential Archives Portal of Korea, the terms used in the request for information disclosure toward the Presidential Archives in Korea, the search options of the Presidential Archives Portal, and the elements of the description and metadata standards. The significance of this study lies on suggesting the methodology of developing various facets as main elements in finding aids using the presidential records as cases.

Construction of Component Repository for Supporting the CBD Process (CBD 프로세스 지원을 위한 컴포넌트 저장소의 구축)

  • Cha, Jung-Eun;Kim, Hang-Kon
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.7
    • /
    • pp.476-486
    • /
    • 2002
  • CBD(Component Based Development) has become the best strategical method for the business application. Because CBD is a new development paradigm which makes it possible to assemble the software components for application, it copes with the rapid challenge of business process and meets the increasing requirements for productivity. Since the business process is rapidly changing, CBD technology is the promising way to solve the productivity. Especially, the repository is the most important part for the development, distribution and reuse of components. In component repository, we can store and manage the related work-products produced at each step of component development as well as component itself. In this paper, we suggested a practical approach for repository construction to support and realize the CBD process and developed the CRMS(Component Repository Management System) as implementation product of the proposed techniques. CRMS can manage a variety of component products based on component architecture, and help software developers to search a candidate component for their project and to understand a variety of information for the component. In the paper, a practical approach for component repository was suggested, and a supporting environment was constructed to make CBD to be working efficiently. We expect this work wall be valuable research for component repository and the entire supporting Component Based Development Process.

A Study on the Improvement of the Descriptive Form of the Edition and Impression in the Korean Cataloguing Rule (한국목록규칙에서 판과 쇄의 서지요소 기술형식 개선안 연구)

  • Lee, Myounggyu
    • Journal of Korean Library and Information Science Society
    • /
    • v.52 no.3
    • /
    • pp.217-238
    • /
    • 2021
  • The purpose of this study is to identify the problems of describing bibliographic elements such as edition, impression, and year of publication in the current Cataloging Rules, and to present improvements to the KCR and the KORMARC Format. According to changes in the information technology environment in the library, the level and scope of the cataloging that can be provided to users should be adjusted. As data is diversified and descriptive method of metadata develops, the development of bibliographic elements and the change of the descriptive method are changing the framework of the existing standardization. As a research method, we will examine the concept of edition and impression, examine the transition process of description of edition and impression targeting KCR. And we would like to present the methods of bibliographic description to be added or modified through cataloging examples, and examine the format reflected in KORMARC. This study is intended to be limited to the descriptive form of books. The bibliographic elements of the entry in the library must be fully consistent with the bibliographic elements of the material to be catalogued. This study suggests an improvement to the cataloging rules. First, the edition of the cataloging rules shall describe the edition statement and the impression statement. In addition, the description of the year of publication describes the latest year, not the first year of publication of the edition of the material to be catalogued. The descriptive scope of the year of publication is extended to the month of publication. Second, in the KORMARC format, the description of impression in edition statement is given to the identifier in the 250 field, and is numerically described as the second impression. When presenting the month and year of publication, one identifier in the field 260 describes the year of publication and the other identifier describes the month of publication. The description for the month and year of publication is stated the latest month and year on the title page and the colophon of the publication.

Semi-automatic Construction of Learning Set and Integration of Automatic Classification for Academic Literature in Technical Sciences (기술과학 분야 학술문헌에 대한 학습집합 반자동 구축 및 자동 분류 통합 연구)

  • Kim, Seon-Wu;Ko, Gun-Woo;Choi, Won-Jun;Jeong, Hee-Seok;Yoon, Hwa-Mook;Choi, Sung-Pil
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.4
    • /
    • pp.141-164
    • /
    • 2018
  • Recently, as the amount of academic literature has increased rapidly and complex researches have been actively conducted, researchers have difficulty in analyzing trends in previous research. In order to solve this problem, it is necessary to classify information in units of academic papers. However, in Korea, there is no academic database in which such information is provided. In this paper, we propose an automatic classification system that can classify domestic academic literature into multiple classes. To this end, first, academic documents in the technical science field described in Korean were collected and mapped according to class 600 of the DDC by using K-Means clustering technique to construct a learning set capable of multiple classification. As a result of the construction of the training set, 63,915 documents in the Korean technical science field were established except for the values in which metadata does not exist. Using this training set, we implemented and learned the automatic classification engine of academic documents based on deep learning. Experimental results obtained by hand-built experimental set-up showed 78.32% accuracy and 72.45% F1 performance for multiple classification.

Multimodal Sentiment Analysis Using Review Data and Product Information (리뷰 데이터와 제품 정보를 이용한 멀티모달 감성분석)

  • Hwang, Hohyun;Lee, Kyeongchan;Yu, Jinyi;Lee, Younghoon
    • The Journal of Society for e-Business Studies
    • /
    • v.27 no.1
    • /
    • pp.15-28
    • /
    • 2022
  • Due to recent expansion of online market such as clothing, utilizing customer review has become a major marketing measure. User review has been used as a tool of analyzing sentiment of customers. Sentiment analysis can be largely classified with machine learning-based and lexicon-based method. Machine learning-based method is a learning classification model referring review and labels. As research of sentiment analysis has been developed, multi-modal models learned by images and video data in reviews has been studied. Characteristics of words in reviews are differentiated depending on products' and customers' categories. In this paper, sentiment is analyzed via considering review data and metadata of products and users. Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Self Attention-based Multi-head Attention models and Bidirectional Encoder Representation from Transformer (BERT) are used in this study. Same Multi-Layer Perceptron (MLP) model is used upon every products information. This paper suggests a multi-modal sentiment analysis model that simultaneously considers user reviews and product meta-information.

Degree Programs in Data Science at the School of Information in the States (미국 정보 대학의 데이터사이언스 학위 현황 연구)

  • Park, Hyoungjoo
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.2
    • /
    • pp.305-332
    • /
    • 2022
  • This preliminary study examined the degree programs in data science at the School of Information in the States. The focus of this study was the data science degrees offered at the School of Information awarded by the 64 Library and Information Science (LIS) programs accredited by the American Library Association (ALA) in 2022. In addition, this study examined the degrees, majors, minors, specialized tracks, and certificates in data science, as well as the potential careers after earning a data science degree. Overall, eight Schools of Information (iSchools) offered 12 data science degrees. Data science courses at the School of Information focus on topics such as introduction to data science, information retrieval, data mining, database, data and humanities, machine learning, metadata, research methods, data analysis and visualization, internship/capstone, ethics and security, user, policy, and curation and management. Most schools did not offer traditional LIS courses. After earning the data science degree in the School of Information, the potential careers included data scientists, data engineers and data analysts. The researcher hopes the findings of this study can be used as a starting point to discuss the directions of data science programs from the perspectives of the information field, specifically the degrees, majors, minors, specialized tracks and certificates in data science.

Study on the Proposal for Deposit Linkage Plan Based on the Survey of Online Material Identification System (온라인 자료 식별체계 실태조사를 기반으로 한 납본연계방안 제안 연구)

  • Younghee Noh;Aekyoung Son;Kyung Sun Lee;Inho Chang;Youngmi Jung;Hyunju Cha
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.1
    • /
    • pp.133-162
    • /
    • 2024
  • The rapid digitalization has highlighted the importance of identifying and managing online resources. Especially, the need for a systematic identification system for the efficient distribution and preservation of digital content is growing. This study aims to respond to these contemporary demands by investigating the current state of identification systems for online resources and exploring more systematic management and utilization methods through linking these systems with legal deposit. To achieve this, the study surveyed the identification systems and their issuance status for online resources and analyzed prior research related to these online resources. Based on the analysis, the proposed strategies for linking with legal deposit can be summarized into three categories: First, to prioritize and enhance the utilization of legal deposit, strategies are required to strengthen the mutual complementarity of deposit and use, to assign priorities to certain deposits, and to increase the usability of deposited materials. Second, as strategies based on international standard numbers for linking with legal deposit, it is necessary to integrate ISBN and UCI in the deposit process, to link international standard resource numbers with deposit, to interconnect metadata between international standard numbers and UCI, to integrate UCI and ICN, and to introduce automation technology for upgrading the deposit system. Third, to effectively implement the aforementioned strategies, policy support is essential. This includes enhancing the role of the Korean Bibliographic Standards Center, strengthening cooperation with publishers, compensating for deposited materials, and increasing awareness and institutional compensation for the legal deposit system.

A Study on Database Design Model for Production System Record Management Module in DataSet Record Management (데이터세트 기록관리를 위한 생산시스템 기록관리 모듈의 DB 설계 모형연구)

  • Kim, Dongsu;Yim, Jinhee;Kang, Sung-hee
    • The Korean Journal of Archival Studies
    • /
    • no.78
    • /
    • pp.153-195
    • /
    • 2023
  • RDBMS is a widely used database system worldwide, and the term dataset refers to the vast amount of data produced in administrative information systems using RDBMS. Unlike business systems that mainly produce administrative documents, administrative information systems generate records centered around the unique tasks of organizations. These records differ from traditional approval documents and metadata, making it challenging to seamlessly transfer them to standard record management systems. With the 2022 revision of the 'Public Records Act Enforcement Decree,' dataset was included in the types of records for which only management authority is transferred. The core aspect of this revision is the need to manage the lifecycle of records within administrative information systems. However, there has been little exploration into how to manage dataset within administrative information systems. As a result, this research aims to design a database for a record management module that needs to be integrated into administrative information systems to manage the lifecycle of records. By modifying and supplementing ISO 16175-1:2020, we are designing an "human resource management system" and identifying and evaluating personnel management dataset. Through this, we aim to provide a concrete example of record management within administrative information systems. It's worth noting that the prototype system designed in this research has limitations in terms of data volume compared to systems currently in use within organizations, and it has not yet been validated by record researchers and IT developers in the field. However, this endeavor has allowed us to understand the nature of dataset and how they should be managed within administrative information systems. It has also affirmed the need for a record management module's database within administrative information systems. In the future, once a complete record management module is developed and standards are established by the National Archives, it is expected to become a necessary module for organizations to manage dataset effectively.

Research on Archive Opening and Sharing Projects of Korean Terrestrial Broadcasters and External Users of Shared Archives : Focusing on the Case of the 5.18 Footage Video Sharing Project 〈May Story(Owol-Iyagi)〉 Contest Organized by KBS (국내 지상파 방송사의 아카이브 개방·공유 사업과 아카이브 이용자 연구 KBS 5.18 아카이브 시민공유 프로젝트 <5월이야기> 공모전 사례를 중심으로)

  • Choi, Hyojin
    • The Korean Journal of Archival Studies
    • /
    • no.78
    • /
    • pp.197-249
    • /
    • 2023
  • This paper focus on the demand for broadcast and video archive contents by users outside broadcasters as the archive openness and sharing projects of terrestrial broadcasters have become more active in recent years. In the process of creating works using broadcasters' released video footage, the study examined the criteria by which video footage is selected and the methods and processes utilized for editing. To this end, the study analyzed the the case of the 5.18 footage video sharing project 〈May Story(Owol-Iyagi)〉 contest organized by KBS in 2022, in which KBS released its footage about the May 18 Democratic Uprising and invited external users to create new content using them. Analyzing the works that were selected as the winners of the contest, the research conducts in-depth interviews with the creators of each work. As a result, the following points are identified. Among the submitted works, many works deal with the direct or indirect experience of the May 18 Democratic Uprising and focus on the impact of this historical event on individuals and our current society. The study also examined the ways in which broadcasters' footage is used in secondary works. We found ways to use video as a means to share historical events, or to present video as evidence or metaphor. It is found that the need for broadcasters to provide a wider range of public video materials such as the May 18 Democratic Uprising, describing more metadata including copyright information before releasing selected footage, ensuring high-definition and high-fidelity videos that can be used for editing, and strengthening streaming or downloading functions for user friendliness. Through this, the study explores the future direction of broadcasters' video data openness and sharing business, and confirms that broadcasters' archival projects can be an alternative to fulfill public responsibilities such as strengthening social integration between regions, generations, and classes through moving images.

  • PDF

Product Community Analysis Using Opinion Mining and Network Analysis: Movie Performance Prediction Case (오피니언 마이닝과 네트워크 분석을 활용한 상품 커뮤니티 분석: 영화 흥행성과 예측 사례)

  • Jin, Yu;Kim, Jungsoo;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.49-65
    • /
    • 2014
  • Word of Mouth (WOM) is a behavior used by consumers to transfer or communicate their product or service experience to other consumers. Due to the popularity of social media such as Facebook, Twitter, blogs, and online communities, electronic WOM (e-WOM) has become important to the success of products or services. As a result, most enterprises pay close attention to e-WOM for their products or services. This is especially important for movies, as these are experiential products. This paper aims to identify the network factors of an online movie community that impact box office revenue using social network analysis. In addition to traditional WOM factors (volume and valence of WOM), network centrality measures of the online community are included as influential factors in box office revenue. Based on previous research results, we develop five hypotheses on the relationships between potential influential factors (WOM volume, WOM valence, degree centrality, betweenness centrality, closeness centrality) and box office revenue. The first hypothesis is that the accumulated volume of WOM in online product communities is positively related to the total revenue of movies. The second hypothesis is that the accumulated valence of WOM in online product communities is positively related to the total revenue of movies. The third hypothesis is that the average of degree centralities of reviewers in online product communities is positively related to the total revenue of movies. The fourth hypothesis is that the average of betweenness centralities of reviewers in online product communities is positively related to the total revenue of movies. The fifth hypothesis is that the average of betweenness centralities of reviewers in online product communities is positively related to the total revenue of movies. To verify our research model, we collect movie review data from the Internet Movie Database (IMDb), which is a representative online movie community, and movie revenue data from the Box-Office-Mojo website. The movies in this analysis include weekly top-10 movies from September 1, 2012, to September 1, 2013, with in total. We collect movie metadata such as screening periods and user ratings; and community data in IMDb including reviewer identification, review content, review times, responder identification, reply content, reply times, and reply relationships. For the same period, the revenue data from Box-Office-Mojo is collected on a weekly basis. Movie community networks are constructed based on reply relationships between reviewers. Using a social network analysis tool, NodeXL, we calculate the averages of three centralities including degree, betweenness, and closeness centrality for each movie. Correlation analysis of focal variables and the dependent variable (final revenue) shows that three centrality measures are highly correlated, prompting us to perform multiple regressions separately with each centrality measure. Consistent with previous research results, our regression analysis results show that the volume and valence of WOM are positively related to the final box office revenue of movies. Moreover, the averages of betweenness centralities from initial community networks impact the final movie revenues. However, both of the averages of degree centralities and closeness centralities do not influence final movie performance. Based on the regression results, three hypotheses, 1, 2, and 4, are accepted, and two hypotheses, 3 and 5, are rejected. This study tries to link the network structure of e-WOM on online product communities with the product's performance. Based on the analysis of a real online movie community, the results show that online community network structures can work as a predictor of movie performance. The results show that the betweenness centralities of the reviewer community are critical for the prediction of movie performance. However, degree centralities and closeness centralities do not influence movie performance. As future research topics, similar analyses are required for other product categories such as electronic goods and online content to generalize the study results.