• Title/Summary/Keyword: Similar Keyword

Search Result 114, Processing Time 0.023 seconds

A Study on Structuring Method of Study Data Supporting Efficient Keyword Search (효율적인 키워드 검색을 지원하는 학습자료의 구조화 방법 연구)

  • Kim, Eun-Kyung;Choi, Jin-Oh
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.1
    • /
    • pp.1063-1066
    • /
    • 2005
  • Most reading systems that supply various study data generally support keyword search. But the usual keyword matching techniques have a problem to require the exact keyword matching, and could not find similar field materials. Futhermore, testing materials have too little information to apply the keyword matching search. To solve these problems, this thesis proposes the method to extract the important keyword from study data and to construct the database automatically when the data are stored at the storage. And using prepared similar terminology database, we suggest the intelligent and efficient technique to find study materials.

  • PDF

A Study on the Intellectual Structure Analysis by Keyword Type Based on Profiling: Focusing on Overseas Open Access Field (프로파일링에 기초한 키워드 유형별 지적구조 분석에 관한 연구 - 국외 오픈액세스 분야를 중심으로 -)

  • Kim, Pan Jun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.55 no.4
    • /
    • pp.115-140
    • /
    • 2021
  • This study divided the keyword sets searched from LISTA database focusing on the overseas open access fields into two types (controlled keywords and uncontrolled keywords), and examined the results of performing an intellectual structure analysis based on profiling for the each keyword type. In addition, these results were compared with those of an intellectual structural analysis based on co-word analysis. Through this, I tried to investigate whether similar results were derived from profiling, another method of intellectual structure analysis, and to examine the differences between co-word analysis and profiling results. As a result, there was a similar difference to the co-word analysis in the results of intellectual structure analysis based on profiling for each of the two keyword types. Also, there were also noticeable differences between the results of intellectual structural analysis based on profiling and co-word analysis. Therefore, intellectual structure analysis using keywords should consider the characteristics of each keyword type according to the research purpose, and better results can be expected to be used based on profiling than co-word analysis to more clearly understand research trends in a specific field.

Development of Similar Bibliographic Retrieval System based on Neighboring Words and Keyword Topic Information (인접한 단어와 키워드 주제어 정보에 기반한 유사 문헌 검색 시스템 개발)

  • Kim, Kwang-Young;Kwak, Seung-Jin
    • Journal of Korean Library and Information Science Society
    • /
    • v.40 no.3
    • /
    • pp.367-387
    • /
    • 2009
  • The similar bibliographic retrieval system follows whether it selects a thing of the extracted index term and or not the difference in which the similar document retrieval system There be many in the search result is generated. In this research, the method minimally making the error of the selection of the extracted candidate index term is provided In this research, the word information in which it is adjacent by using candidate index terms extracted from the similar literature and the keyword topic information were used. And by using the related author information and the reranking method of the search result, the similar bibliographic system in which an accuracy is high was developed. In this paper, we conducted experiments for similar bibliographic retrieval system on a collection of Korean journal articles of science and technology arena. The performance of similar bibliographic retrieval system was proved through an experiment and user evaluation.

  • PDF

A Study On the Healthcare Technology Trends through Patent Data Analysis (특허 데이터 분석을 통한 헬스케어 기술 트렌드 연구)

  • Han, Jeong-Hyeon;Hyun, Young-Geun;Chae, U-ri;Lee, Gi-Hyun;Lee, Joo-Yeoun
    • Journal of Digital Convergence
    • /
    • v.18 no.3
    • /
    • pp.179-187
    • /
    • 2020
  • In a social environment where population aging is rapidly progressing, the healthcare service market is growing fast with the increasing interest in health and quality of life based on rising income levels and the evolution of technology. In this study, after keywords were extracted from Korean and US patent data published on KIPRIS from 2000 to October 2019, frequency analysis, time series analysis, and keyword network analysis were performed. Through this, the change of technology trends were identified, which keywords related to healthcare was shifted from traditional medical words to ICT words. In addition, although the keywords in Korean patents are 55% similar to those in the US, they show an absolute gap in patent production volume. In the next study, we will analyze various data such as domestic and international research and can obtain meaningful implications in the global market on the identified keywords.

Similar Patent Search Service System using Latent Dirichlet Allocation (잠재 의미 분석을 적용한 유사 특허 검색 서비스 시스템)

  • Lim, HyunKeun;Kim, Jaeyoon;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.8
    • /
    • pp.1049-1054
    • /
    • 2018
  • Keyword searching used in the past as a method of finding similar patents, and automated classification by machine learning is using in recently. Keyword searching is a method of analyzing data that is formalized through data refinement. While the accuracy for short text is high, long one consisted of several words like as document that is not able to analyze the meaning contained in sentences. In semantic analysis level, the method of automatic classification is used to classify sentences composed of several words by unstructured data analysis. There was an attempt to find similar documents by combining the two methods. However, it have a problem in the algorithm w the methods of analysis are different ways to use simultaneous unstructured data and regular data. In this paper, we study the method of extracting keywords implied in the document and using the LDA(Latent Semantic Analysis) method to classify documents efficiently without human intervention and finding similar patents.

Analysis of CSR·CSV·ESG Research Trends - Based on Big Data Analysis - (CSR·CSV·ESG 연구 동향 분석 - 빅데이터 분석을 중심으로 -)

  • Lee, Eun Ji;Moon, Jaeyoung
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.4
    • /
    • pp.751-776
    • /
    • 2022
  • Purpose: The purpose of this paper is to present implications by analyzing research trends on CSR, CSV and ESG by text analysis and visual analysis(Comprehensive/ Fields / Years-based) which are big data analyses, by collecting data based on previous studies on CSR, CSV and ESG. Methods: For the collection of analysis data, deep learning was used in the integrated search on the Academic Research Information Service (www.riss.kr) to search for "CSR", "CSV" and "ESG" as search terms, and the Korean abstracts and keyword were scrapped out of the extracted paper and they are organize into EXCEL. For the final step, CSR 2,847 papers, CSV 395 papers, ESG 555 papers derived were analyzed using the Rx64 4.0.2 program and Rstudio using text mining, one of the big data analysis techniques, and Word Cloud for visualization. Results: The results of this study are as follows; CSR, CSV, and ESG studies showed that research slowed down somewhat before 2010, but research increased rapidly until recently in 2019. Research have been found to be heavily researched in the fields of social science, art and physical education, and engineering. As a result of the study, there were many keyword of 'corporate', 'social', and 'responsibility', which were similar in the word cloud analysis. Looking at the frequent keyword and word cloud analysis by field and year, overall keyword were derived similar to all keyword by year. However, some differences appeared in each field. Conclusion: Government support and expert support for CSR, CSV and ESG should be activated, and researches on technology-based strategies are needed. In the future, it is necessary to take various approaches to them. If researches are conducted in consideration of the environment or energy, it is judged that bigger implications can be presented.

Concealed Policy and Ciphertext Cryptography of Attributes with Keyword Searching for Searching and Filtering Encrypted Cloud Email

  • Alhumaidi, Hind;Alsuwat, Hatim
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.212-222
    • /
    • 2022
  • There has been a rapid increase in the use of cloud email services. As a result, email encryption has become more commonplace as concerns about cloud privacy and security grow. Nevertheless, this increase in usage is creating the challenge of how to effectively be searching and filtering the encrypted emails. They are popular technologies of solving the issue of the encrypted emails searching through searchable public key encryption. However, the problem of encrypted email filtering remains to be solved. As a new approach to finding and filtering encrypted emails in the cloud, we propose a ciphertext-based encrypted policy attribute-based encryption scheme and keyword search procedure based on hidden policy ciphertext. This feature allows the user of searching using some encrypted emails keywords in the cloud as well as allowing the emails filter-based server toward filter the content of the encrypted emails, similar to the traditional email keyword filtering service. By utilizing composite order bilinear groups, a hidden policy system has been successfully demonstrated to be secure by our dual system encryption process. Proposed system can be used with other scenarios such as searching and filtering files as an applicable method.

A Study on the Demand Forecasting of Healthcare Technology from a Consumer Perspective : Using Social Data and ARIMA Model Approach (소셜데이터 및 ARIMA 분석을 활용한 소비자 관점의 헬스케어 기술수요 예측 연구)

  • Yang, Dong Won;Lee, Zoon Ky
    • Journal of Information Technology Services
    • /
    • v.19 no.4
    • /
    • pp.49-61
    • /
    • 2020
  • Prior studies on technology predictions attempted to predict the emergence and spread of emerging technologies through the analysis of correlations and changes between data using objective data such as patents and research papers. Most of the previous studies predicted future technologies only from the viewpoint of technology development. Therefore, this study intends to conduct technical forecasting from the perspective of the consumer by using keyword search frequency of search portals such as NAVER before and after the introduction of emerging technologies. In this study, we analyzed healthcare technologies into three types : measurement technology, platform technology, and remote service technology. And for the keyword analysis on the healthcare, we converted the classification of technology perspective into the keyword classification of consumer perspective. (Blood pressure and blood sugar, healthcare diagnosis, appointment and prescription, and remote diagnosis and prescription) Naver Trend is used to analyze keyword trends from a consumer perspective. We also used the ARIMA model as a technology prediction model. Analyzing the search frequency (Naver trend) over 44 months, the final ARIMA models that can predict three types of healthcare technology keyword trends were estimated as "ARIMA (1,2,1) (1,0,0)", "ARIMA (0,1,0) (1,0,0)", "ARIMA (1,1,0) (0,0,0)". In addition, it was confirmed that the values predicted by the time series prediction model and the actual values for 44 months were moving in almost similar patterns in all intervals. Therefore, we can confirm that this time series prediction model for healthcare technology is very suitable.

Structuring Risk Factors of Industrial Incidents Using Natural Language Process (자연어 처리 기법을 활용한 산업재해 위험요인 구조화)

  • Kang, Sungsik;Chang, Seong Rok;Lee, Jongbin;Suh, Yongyoon
    • Journal of the Korean Society of Safety
    • /
    • v.36 no.1
    • /
    • pp.56-63
    • /
    • 2021
  • The narrative texts of industrial accident reports help to identify accident risk factors. They relate the accident triggers to the sequence of events and the outcomes of an accident. Particularly, a set of related keywords in the context of the narrative can represent how the accident proceeded. Previous studies on text analytics for structuring accident reports have been limited to extracting individual keywords without context. We proposed a context-based analysis using a Natural Language Processing (NLP) algorithm to remedy this shortcoming. This study aims to apply Word2Vec of the NLP algorithm to extract adjacent keywords, known as word embedding, conducted by the neural network algorithm based on supervised learning. During processing, Word2Vec is conducted by adjacent keywords in narrative texts as inputs to achieve its supervised learning; keyword weights emerge as the vectors representing the degree of neighboring among keywords. Similar keyword weights mean that the keywords are closely arranged within sentences in the narrative text. Consequently, a set of keywords that have similar weights presents similar accidents. We extracted ten accident processes containing related keywords and used them to understand the risk factors determining how an accident proceeds. This information helps identify how a checklist for an accident report should be structured.

A Study of Keyword Spotting System Based on the Weight of Non-Keyword Model (비핵심어 모델의 가중치 기반 핵심어 검출 성능 향상에 관한 연구)

  • Kim, Hack-Jin;Kim, Soon-Hyub
    • The KIPS Transactions:PartB
    • /
    • v.10B no.4
    • /
    • pp.381-388
    • /
    • 2003
  • This paper presents a method of giving weights to garbage class clustering and Filler model to improve performance of keyword spotting system and a time-saving method of dialogue speech processing system for keyword spotting by calculating keyword transition probability through speech analysis of task domain users. The point of the method is grouping phonemes with phonetic similarities, which is effective in sensing similar phoneme groups rather than individual phonemes, and the paper aims to suggest five groups of phonemes obtained from the analysis of speech sentences in use in Korean morphology and in stock-trading speech processing system. Besides, task-subject Filler model weights are added to the phoneme groups, and keyword transition probability included in consecutive speech sentences is calculated and applied to the system in order to save time for system processing. To evaluate performance of the suggested system, corpus of 4,970 sentences was built to be used in task domains and a test was conducted with subjects of five people in their twenties and thirties. As a result, FOM with the weights on proposed five phoneme groups accounts for 85%, which has better performance than seven phoneme groups of Yapanel [1] with 88.5% and a little bit poorer performance than LVCSR with 89.8%. Even in calculation time, FOM reaches 0.70 seconds than 0.72 of seven phoneme groups. Lastly, it is also confirmed in a time-saving test that time is saved by 0.04 to 0.07 seconds when keyword transition probability is applied.