• Title/Summary/Keyword: Data Sharing and Reuse

Search Result 53, Processing Time 0.026 seconds

Scaling Reuse Detection in the Web through Two-way Boosting with Signatures and LSH

  • Kim, Jong Wook
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.6
    • /
    • pp.735-745
    • /
    • 2013
  • The emergence of Web 2.0 technologies, such as blogs and wiki, enable even naive users to easily create and share content on the Web using freely available content sharing tools. Wide availability of almost free data and promiscuous sharing of content through social networking platforms created a content borrowing phenomenon, where the same content appears (in many cases in the form of extensive quotations) in different outlets. An immediate side effect of this phenomenon is that identifying which content is re-used by whom is becoming a critical tool in social network analysis, including expert identification and analysis of information flow. Internet-scale reuse detection, however, poses extremely challenging scalability issues: considering the large size of user created data on the web, it is essential that the techniques developed for content-reuse detection should be fast and scalable. Thus, in this paper, we propose a $qSign_{lsh}$ algorithm, a mechanism for identifying multi-sentence content reuse among documents by efficiently combining sentence-level evidences. The experiment results show that $qSign_{lsh}$ significantly improves the reuse detection speed and provides high recall.

A Study of Software Product Line Engineering application for Data Link Software

  • Kim, Jin-Woo;Lee, Woo-Sin;Kim, Hack-Joon;Jin, So-Yeon;Jo, Se-Hyeon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.12
    • /
    • pp.65-72
    • /
    • 2018
  • In this paper, we have studied how to reuse common data link software by applying software product line engineering. Existing common data link software performed different stages of design, implementation, and testing without sharing the accumulated knowledge of different developers. In this situation, developers agreed that sharing the assets of each project and reusing the previously developed software would save human and time costs. Even with the initial difficulties, the common Data Link is a continually proposed project in the defense industry, so we decided to build a product line. The common data link software can be divided into two domains. Among them, the initial feature model for the GUI software was constructed, and the following procedure was studied. Through this, we propose a plan to build a product line for core assets and reuse them in newly developed projects.

The Effect of Information Quality and Self-efficacy on Car-sharing Usage Intention (정보품질과 자기효능감이 카셰어링 재이용의도에 미치는 영향)

  • Liu, Bo;Byun, Sookeun
    • Journal of Service Research and Studies
    • /
    • v.13 no.3
    • /
    • pp.20-38
    • /
    • 2023
  • Recently, car sharing has shown the most remarkable growth among sharing economy services. In the process of analyzing the intention to reuse the car sharing service, this study tried to reflect the unique characteristics of the service, which consists of non-face-to-face self-service, such as reservation, approval, handover, inspection, and return of the vehicle. Specifically, in addition to the perceived benefits and the perceived risks, we considered 'information quality' as a platform characteristic and 'self-efficacy' as a personal characteristic. To collect data, an online survey was conducted on adults with experience in car sharing, and a total of 320 responses were used for analysis. As a result of analyzing the structural equation model, it was found that information quality and self-efficacy increased the perceived benefits of services, and the higher the information quality, the higher the self-efficacy. On the other hand, the role of information quality and self-efficacy in lowering perceived risks was insignificant, and the intention to reuse services was more affected by perceived benefits than perceived risks. As a result of further analysis using Process Macro, it was found that the effect of self-efficacy on reuse intention was mediated by perceived benefits. It was analyzed that the indirect effects of information quality on reuse intention through perceived benefits or self-efficacy were all significant. These results suggest that providing timely, sufficient, and easy-to-understand information required by users on the platform improves self-efficacy and increases service reuse intention. In order to increase the number of service users, it is important for service providers not only to provide promotional activities such as offering attractive prices, but also to provide high-quality information so that users can use it more easily.

A Study on Factors Affecting the Reuse of Research Data by Academic Researchers in the Social Sciences (사회과학분야 학술 연구자의 연구데이터 재이용 영향요인 연구)

  • Bak, Ji Won;Chang, Woo Kwon
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.4
    • /
    • pp.199-230
    • /
    • 2021
  • This study is to present an analysis and activation plan for the effect of reuse of research data through investigation of researchers and reuse data on reuse of research data. To this end, 178 copies were analyzed based on the distribution and collection of surveys targeting academic researchers in the field of social science in Korea who have experience in calculating new research results by reusing research data. As a result, 1) Most researchers acquire reuse data through systems such as data repositories, data management systems, and research data DBs, and mainly reuse analysis data produced through experiments and observations. In addition, despite being a researcher who successfully reused research data, the awareness of research data sharing was low and did not share it in the face of various problems. 2) The reliability and validity of 10 factors derived through literature review and factor analysis (academic usefulness, research efficiency, researcher concerns, data vulnerability, direct effort, indirect effort, suitability for reuse, data completeness, data usefulness, and social conditions) were verified. 3) As a result of correlation analysis, research efficiency, social conditions showed a quantitative correlation with research data reuse intention, researcher concerns, data vulnerability, and direct effort showed a negative correlation with research data reuse intention. As a result of regression analysis, all of these factors had a significant effect on the intention to reuse research data, and in the order of research efficiency, social conditions, direct efforts, researchers' concerns, and data vulnerability. Based on this, a plan to revitalize the reuse of research data was proposed.

A Study on the Current Status of Research Data Management by Researchers in Each Academic Field: Focusing on Library and Information Science, Statistics, Ecology, and Korean Musicology (학문분야별 연구자들의 연구데이터 관리 현황에 관한 연구 - 문헌정보학, 통계학, 생태학 및 한국음악학을 중심으로 -)

  • Juseop, Kim;Suntae, Kim;Yeonjung, Han;Won-Jae, Youe;Paul, Jeon;Seong Jun, Yang
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.33 no.4
    • /
    • pp.229-247
    • /
    • 2022
  • As part of open science, the government adopts a national research data sharing and utilization strategy, forms a governance system at the national level, and promotes policies for research institutes to implement. Although the policy for managing research data is carried out centering on the academic world, it is insufficient compared to developed countries, and even this is a reality in which researchers do not have enough awareness to introduce it to the field. The purpose of this study is to understand the research data management status of researchers in each academic field. Academic fields consisted of four fields including Library and Information Science, Statistics, Ecology, and Korean Musicology, and the current status of data management was identified through a survey. The current status of research data management was analyzed from the perspective of research data production, sharing and management, saving, preservation and reuse. As a result of the study, it was found that there were differences by discipline in terms of data production, data sharing and management, data preservation, and data reuse, except for data savings.

The Implications of Current Practices Relating to the Sharing, Reuse, and Citation of Research Software for the Future of Research (연구소프트웨어의 공유, 재사용 및 인용과 관련된 현재 관행의 의미)

  • Park, Hyoungjoo;Wolfram, Dietmar
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.4
    • /
    • pp.65-82
    • /
    • 2021
  • The purpose of this research is to explore the phenomenon of the sharing, reuse, and citation of research software. These practices are playing an increasingly important role in scholarly communication. The researchers found that the citation and reuse of research software are currently uncommon or at least not reflected in the Data Citation Index (DCI). Such citation was observed, however, for the newer software in a number of prominent repositories. The repositories Comprehensive R Archive Network (CRAN) and Zenodo received the most formal software citations. The researchers observed both formal and informal forms of citation when researchers reused software. The latter form involves mentioning research software in passing in the main text of articles, while formal citations appear in the references section. In addition, our comparative analysis helps to explain the phenomenon of self-citation of research software.

A Study on Wikidata Utilization for Digital Archives (디지털 아카이브의 위키데이터 활용방안 연구)

  • Han, Sangeun;Park, Heejin
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.1
    • /
    • pp.201-217
    • /
    • 2022
  • This study aims to use Wikidata to increase access to archival resources and see ways to share and reuse constructed data. To this end, the status of digital archives locally and abroad was identified using the description standard of archival resources in the digital environment discussed in the archival science field and the research related to data management and sharing of digital archives. The structure and characteristics of Wikidata were identified, and cases of digital archives applying Wikidata were investigated. The case analysis suggested implications for data sharing and reuse as well as the considerations for applying Wikidata to digital archives. The results of this study can be used as preliminary data for developing services using Wikidata in the actual archive field by understanding the recent trends in data open and sharing related technologies in constructing digital archives.

Ontology-based Sensor Network Information Sharing

  • Lee, Jiapei;Lee, Hyun-chang;LIU, Xiao-wen;Yan, xuebo;Jin, Chan-Yong;Shin, Seong-yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.10a
    • /
    • pp.375-378
    • /
    • 2016
  • The difficulty of "information sharing", "information reusing" issues happening in Wireless Sensor Network is due to the heterogeneity of the application environment, data processing, communication protocol etc. Based on the introduction of the Ontology theory, though analyzing the sensor characteristic a general type of sensor ontology contains the definition of concept, frame structure and OWL design was proposed from the standpoint of sensor observation. The paper expounded a system framework of the domain ontology through the expansion of knowledge base on the general sensor could achieve the information sharing and reuse by semantic communication between the general sensor ontology and user. The research of this method would bring new idea to the semantic sensor network construction.

  • PDF

SOC Test Compression Scheme Sharing Free Variables in Embedded Deterministic Test Environment

  • Wang, Weizheng;Cai, Shuo;Xiang, Lingyun
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.15 no.3
    • /
    • pp.397-403
    • /
    • 2015
  • This paper presents a new SOC test compression scheme in Embedded Deterministic Test (EDT) compression environment. Compressed test data is brought over the TAM from the tester to the cores in SOC and decompressed in the cores. The proposed scheme allows cores tested at the same time to share some test channels. By sharing free variables in these channels across test cubes of different cores decompressed at the same time, high encoding efficiency is achieved. Moreover, no excess control data is required in this scheme. The ability to reuse excess free variables eliminates the need for high precision in matching the number of test channels with the number of care bits for every core. Experimental results obtained for some SOC designs illustrate effectiveness of the proposed test application scheme.

Development of CAE Data Translation Technique for a Virtual Reality Environment (가상현실 환경을 위한 해석데이터 변환 기술 개발)

  • Song, In-Ho;Yang, Jeong-Sam;Jo, Hyun-Jei;Choi, Sang-Su
    • Korean Journal of Computational Design and Engineering
    • /
    • v.13 no.5
    • /
    • pp.334-341
    • /
    • 2008
  • Computer-aided engineering (CAE) analysis is considered essential for product development because it decreases the simulation time, reduces the prototyping costs, and enhances the reusability of product parts. The reuse of quality-assured CAE data has been continually increasing due to the extension of product lifecycle management; PLM, which is widely used, shortens the product development cycle and improves the product quality. However, less attention has been paid to systematic research on the interoperability of CAE data because of the diversity of CAE data and because the structure of CAE data is more complex than that of CAD data. In this paper, we suggest a CAE data exchange method for the effective sharing of geometric and analysis data. The method relies on heterogeneous CAE systems, a virtual reality system, and our developed CAE middleware for CAE data exchange. We also designed a generic CAE kernel, which is a critical part of the CAE middleware. The kernel offers a way of storing analysis data from various CAE systems, and, with the aid of a script command, enabling the data to be translated for a different system. The reuse of CAE data is enhanced by the fact that the CAE middle-ware can be linked with a virtual reality system or a product data management system.