• Title/Summary/Keyword: Text Reuse

Search Result 22, Processing Time 0.024 seconds

Detecting Local Text Reuse in the Texts of East Asian Traditional Medicine (한의학 고문헌 텍스트에서의 인용문 추정과 탐색)

  • Oh, Junho
    • Journal of Korean Medical classics
    • /
    • v.34 no.1
    • /
    • pp.37-45
    • /
    • 2021
  • Objectives : The purpose of this paper was to examine quantitative methods for estimating and detecting local text reuse in the texts of East Asian Traditional Medicine. Methods : We introduce techniques that estimate the volume of local text reuse with n-gram and those that directly detect the reuse with the Smith-Waterman algorithm (SW algorithm). Based on this, the estimation and detection of local text reuse were carried out for 『Donguibogam』 and 『Huangdineijing·Suwen』. Results : Estimates with n-gram had more errors than methods with SW algorithms. SW algorithms detected suspected strings directly with local text reuse, resulting in more accurate results. Conclusions : Although n-gram does not accurately find local text reuse, its high speed makes it a preferable method for certain purposes, such as screening similar documents. On the other hand, SW algorithms have the advantage of being relatively good at finding similar phrases suspected as local text reuse even if the strings do not completely match. However, due to its excessive consumption of time and computing resources, its benefits are limited to cases where precise results are required.

Query Formulation for Heuristic Retrieval in Obfuscated and Translated Partially Derived Text

  • Kumar, Aarti;Das, Sujoy
    • Journal of Information Science Theory and Practice
    • /
    • v.3 no.1
    • /
    • pp.24-39
    • /
    • 2015
  • Pre-retrieval query formulation is an important step for identifying local text reuse. Local reuse with high obfuscation, paraphrasing, and translation poses a challenge of finding the reused text in a document. In this paper, three pre-retrieval query formulation strategies for heuristic retrieval in case of low obfuscated, high obfuscated, and translated text are studied. The strategies used are (a) Query formulation using proper nouns; (b) Query formulation using unique words (Hapax); and (c) Query formulation using most frequent words. Whereas in case of low and high obfuscation and simulated paraphrasing, keywords with Hapax proved to be slightly more efficient, initial results indicate that the simple strategy of query formulation using proper nouns gives promising results and may prove better in reducing the size of the corpus for post processing, for identifying local text reuse in case of obfuscated and translated text reuse.

The effects of OTTservice information system quality on reuse intention (OTT 서비스 정보시스템 품질이 재사용의도에 미치는 영향)

  • Eom, Ji Yeon;Lim, Yeong Woo;Kwahk, Kee-Young
    • The Journal of Information Systems
    • /
    • v.32 no.3
    • /
    • pp.63-83
    • /
    • 2023
  • Purpose With the continuous growth of the OTT services market, trust issues are becoming increasingly important, but research on this topic is still in its infancy. The purpose of this study is to identify the structural relationship between information system quality and reuse intention of OTT services and to analyze the impact of trust and user satisfaction. Design/methodology/approach This study proposed a research model based on the information system success model. In this study, a survey was conducted among 236 Korean users who have used OTT services within the last six months. Findings The results of the analysis showed that text quality and visual quality had a significant impact on trust in OTT services, with text quality having the largest impact. System quality and text quality also had a significant impact on trust in OTT service providers. However, visual quality did not have a statistically significant effect on trust in the service provider. Trust in the OTT service and the service provider was analyzed to have a significant impact on user satisfaction. However, it did not have a statistically significant impact on reuse intention. These findings have important implications for improving trust in OTT services to increase users' reuse intentions. It is also expected to contribute to further expanding the field of OTT service research.

The relationship between public acceptance of nuclear power generation and spent nuclear fuel reuse: Implications for promotion of spent nuclear fuel reuse and public engagement

  • Roh, Seungkook;Kim, Dongwook
    • Nuclear Engineering and Technology
    • /
    • v.54 no.6
    • /
    • pp.2062-2066
    • /
    • 2022
  • Nuclear energy sources are indispensable in cost effectively achieving carbon neutral economy, where public opinion is critical to adoption as the consequences of nuclear accident can be catastrophic. In this context, discussion on spent nuclear fuel is a prerequisite to expanding nuclear energy, as it leads to the issue of radioactive waste disposal. Given the dearth of study on spent nuclear fuel public acceptance, we use text mining and big data analysis on the news article and public comments data on Naver news portal to identify the Korean public opinion on spent nuclear fuel. We identify that the Korean public is more interested in the nuclear energy policy than spent nuclear fuel itself and that the alternative energy sources affect the position towards spent nuclear fuel. We recommend relating spent nuclear fuel issue with nuclear energy policy and environmental issues of alternative energy sources to further promote spent nuclear fuel.

A Study on the Factors Affecting the Characteristics of Mobile App for Disabled Libraries' Full-text Service on User's Satisfaction and Reuse Intention (장애인도서관 원문서비스 모바일 앱의 특성이 사용자의 만족도 및 재사용 의도에 미치는 영향요인 연구)

  • Jang, Bo-Seong
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.1
    • /
    • pp.329-347
    • /
    • 2020
  • This study analyzed the effect of app characteristics of the handicapped library on user satisfaction and intention to reuse using the technology acceptance model. As a result, the app's accessibility, convenience, innovation, and reliability have a significant effect on perceived usefulness, and instant accessibility, accuracy, and interactivity have no significant effect. The characteristics of all apps were analyzed to have a significant effect on perceived ease of use. The regression model for perceived ease of use and perceived usefulness was statistically significant. It was found that perceived ease of use and perceived usefulness had a positive effect on user satisfaction and satisfaction was intended to be reused.

The Implications of Current Practices Relating to the Sharing, Reuse, and Citation of Research Software for the Future of Research (연구소프트웨어의 공유, 재사용 및 인용과 관련된 현재 관행의 의미)

  • Park, Hyoungjoo;Wolfram, Dietmar
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.4
    • /
    • pp.65-82
    • /
    • 2021
  • The purpose of this research is to explore the phenomenon of the sharing, reuse, and citation of research software. These practices are playing an increasingly important role in scholarly communication. The researchers found that the citation and reuse of research software are currently uncommon or at least not reflected in the Data Citation Index (DCI). Such citation was observed, however, for the newer software in a number of prominent repositories. The repositories Comprehensive R Archive Network (CRAN) and Zenodo received the most formal software citations. The researchers observed both formal and informal forms of citation when researchers reused software. The latter form involves mentioning research software in passing in the main text of articles, while formal citations appear in the references section. In addition, our comparative analysis helps to explain the phenomenon of self-citation of research software.

The Analysis of Research Trends in Social Service Quality Using Text Mining and Topic Modeling (텍스트 마이닝과 토픽모델링 활용한 사회서비스 품질의 학술연구 동향 분석)

  • Lee, Hae-Jung;Youn, Ki-Hyok
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.3
    • /
    • pp.29-40
    • /
    • 2022
  • The aim of this study was to analyze research trends of social service quality from 2007 to 2020 based on text mining and topic modeling. Our focus was to provide foundational materials for social service improvement by discovering the latent meaning of relevant research papers. We collected 97 scholarly articles on social service, social welfare service, and quality from RISS, and implemented two segments of text mining analysis. Our results showed that the first section included 38 papers and the second 59, indicating 6.9 articles annually. Word frequency results demonstrated that the common keywords of both sections were 'service', 'quality', 'social service', 'satisfaction', 'users', 'quality control', 'reuse', 'policy', 'voucher', etc. TF-IDF suggested that 'social service', 'satisfaction', 'users', 'customer satisfaction', 'revisiting', 'voucher', 'quality', 'assisted living facility', 'quality control', 'community service investment business', etc., were represented in both categories. Lastly, topic modeling analysis revealed that the first segment displayed 'types of care services', 'service costs', 'reuse', 'users based', and 'job creation', whereas the second presented 'service quality', 'public value', 'management system of human resources', 'service provision system', and 'service satisfaction'. Future directions of social service quality were discussed based on the results.

Spatiotemporal Removal of Text in Image Sequences (비디오 영상에서 시공간적 문자영역 제거방법)

  • Lee, Chang-Woo;Kang, Hyun;Jung, Kee-Chul;Kim, Hang-Joon
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.41 no.2
    • /
    • pp.113-130
    • /
    • 2004
  • Most multimedia data contain text to emphasize the meaning of the data, to present additional explanations about the situation, or to translate different languages. But, the left makes it difficult to reuse the images, and distorts not only the original images but also their meanings. Accordingly, this paper proposes a support vector machines (SVMs) and spatiotemporal restoration-based approach for automatic text detection and removal in video sequences. Given two consecutive frames, first, text regions in the current frame are detected by an SVM-based texture classifier Second, two stages are performed for the restoration of the regions occluded by the detected text regions: temporal restoration in consecutive frames and spatial restoration in the current frame. Utilizing text motion and background difference, an input video sequence is classified and a different temporal restoration scheme is applied to the sequence. Such a combination of temporal restoration and spatial restoration shows great potential for automatic detection and removal of objects of interest in various kinds of video sequences, and is applicable to many applications such as translation of captions and replacement of indirect advertisements in videos.