• Title/Summary/Keyword: word database

Search Result 235, Processing Time 0.026 seconds

A Study on the Intellectual Structure of Metadata Research by Using Co-word Analysis (동시출현단어 분석에 기반한 메타데이터 분야의 지적구조에 관한 연구)

  • Choi, Ye-Jin;Chung, Yeon-Kyoung
    • Journal of the Korean Society for information Management
    • /
    • v.33 no.3
    • /
    • pp.63-83
    • /
    • 2016
  • As the usage of information resources produced in various media and forms has been increased, the importance of metadata as a tool of information organization to describe the information resources becomes increasingly crucial. The purposes of this study are to analyze and to demonstrate the intellectual structure in the field of metadata through co-word analysis. The data set was collected from the journals which were registered in the Core collection of Web of Science citation database during the period from January 1, 1998 to July 8, 2016. Among them, the bibliographic data from 727 journals was collected using Topic category search with the query word 'metadata'. From 727 journal articles, 410 journals with author keywords were selected and after data preprocessing, 1,137 author keywords were extracted. Finally, a total of 37 final keywords which had more than 6 frequency were selected for analysis. In order to demonstrate the intellectual structure of metadata field, network analysis was conducted. As a result, 2 domains and 9 clusters were derived, and intellectual relations among keywords from metadata field were visualized, and proposed keywords with high global centrality and local centrality. Six clusters from cluster analysis were shown in the map of multidimensional scaling, and the knowledge structure was proposed based on the correlations among each keywords. The results of this study are expected to help to understand the intellectual structure of metadata field through visualization and to guide directions in new approaches of metadata related studies.

Topic Model Augmentation and Extension Method using LDA and BERTopic (LDA와 BERTopic을 이용한 토픽모델링의 증강과 확장 기법 연구)

  • Kim, SeonWook;Yang, Kiduk
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.3
    • /
    • pp.99-132
    • /
    • 2022
  • The purpose of this study is to propose AET (Augmented and Extended Topics), a novel method of synthesizing both LDA and BERTopic results, and to analyze the recently published LIS articles as an experimental approach. To achieve the purpose of this study, 55,442 abstracts from 85 LIS journals within the WoS database, which spans from January 2001 to October 2021, were analyzed. AET first constructs a WORD2VEC-based cosine similarity matrix between LDA and BERTopic results, extracts AT (Augmented Topics) by repeating the matrix reordering and segmentation procedures as long as their semantic relations are still valid, and finally determines ET (Extended Topics) by removing any LDA related residual subtopics from the matrix and ordering the rest of them by F1 (BERTopic topic size rank, Inverse cosine similarity rank). AET, by comparing with the baseline LDA result, shows that AT has effectively concretized the original LDA topic model and ET has discovered new meaningful topics that LDA didn't. When it comes to the qualitative performance evaluation, AT performs better than LDA while ET shows similar performances except in a few cases.

A Model of Speech Database in Korean in consideration of its segmental phonology (국어 분절음 특성에 맞는 음성 데이터 베이스의 모형)

  • 김종미
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.297-302
    • /
    • 1994
  • 본 논문에서는 국어 분절음 특성에 맞는 음성 데이터베이스의 모형을 제시하고자 한다. 음성 데이터 베이스는 1) 각 음의 고유음가정보, 2) 인접음 정보, 3) 빈도수에 따른 확률정보를 포함해야 한다. 이 요건을 충족시키기 위해 본 모형은 1) 음운 단위별로 Labeling 하여, 고유음과 인접음 정보를 편집하고, 2) 음운 규칙과 제약정보에 의해 Phoneme Balanced Words를 작성하여, 허용되는 인접음을 취하고, 허용되지 않는 인접음을 탈락시키며 3) 시스템 평가시, 빈도수가 shb은 음과 음소열의 우선적인 인식 및 합성을 우월하게 평가한다는 고정서, 4) 데이터 집적시, 데이터의 음운기능의 중복과 편중을 피함으로서 데이터량을 간소화할 수 있다는 경제성을 들 수 있다.

  • PDF

Modeling the Growth of Neurology Literature

  • Hadagali, Gururaj S.;Anandhalli, Gavisiddappa
    • Journal of Information Science Theory and Practice
    • /
    • v.3 no.3
    • /
    • pp.45-63
    • /
    • 2015
  • The word ‘growth’ represents an increase in actual size, implying a change of state. In science and technology, growth may imply an increase in number of institutions, scientists, or publications, etc. The present study demonstrates the growth of neurology literature for the period 1961-2010. A total of 291,702 records were extracted from the Science Direct Database for fifty years. The Relative Growth Rate (RGR) and Doubling Time (Dt.) of neurology literature have been calculated, supplementing with different growth patterns to check whether neurology literature fits exponential, linear, or logistic models. The results of the study indicate that the growth of literature in neurology does not follow the linear, or logistic growth model. However, it follows closely the exponential growth model. The study concludes that there has been a consistent trend towards increased growth of literature in the field of neurology.

open-japanese-mesh: assigning MeSH UIDs to Japanese medical terms via open Japanese-English glossaries

  • Yamada, Ryota;Tatieisi, Yuka
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.22.1-22.3
    • /
    • 2020
  • The Medical Subject Headings (MeSH) thesaurus is a controlled vocabulary for indexing biomedical documents that is used for document retrieval and other natural language processing purposes. However, although the oariginal English MeSH is freely available, its Japanese translation has a restricted license. We attempted to create an open alternative, and for this purpose we made a script for assigning MeSH UIDs to Japanese medical terms using Japanese-English glossaries. From the MeSpEn glossary and MEDUTX dictionary, we generated a 12,457-word Japanese-MeSH dictionary.

Study on Efficient Generation of Dictionary for Korean Vocabulary Recognition (한국어 음성인식을 위한 효율적인 사전 구성에 관한 연구)

  • Lee Sang-Bok;Choi Dae-Lim;Kim Chong-Kyo
    • Proceedings of the KSPS conference
    • /
    • 2002.11a
    • /
    • pp.41-44
    • /
    • 2002
  • This paper is related to the enhancement of speech recognition rate using enhanced pronunciation dictionary. Modern large vocabulary, continuous speech recognition systems have pronunciation dictionaries. A pronunciation dictionary provides pronunciation information for each word in the vocabulary in phonemic units, which are modeled in detail by the acoustic models. But in most speech recognition system based on Hidden Markov Model, actual pronunciation variations are disregarded. Without the pronunciation variations in the speech recognition system, the phonetic transcriptions in the dictionary do not match the actual occurrences in the database. In this paper, we proposed the unvoiced rule of semivowel in allophone rules to pronunciation dictionary. Experimental results on speech recognition system give higher performance than existing pronunciation dictionaries.

  • PDF

The structural comparison and analysis of the early and current status in biochip research using KDD (KDD 방법론을 이용한 Biochip 관련 분야들의 초기와 현재의 구조적 비교와 분석)

  • Oh, Hae-Young;Park, Gak-Ro;Park, Sang-Jin;Youn, Moon-Seob
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2004.05a
    • /
    • pp.155-158
    • /
    • 2004
  • The aim of this study is to map the structure of biochip research field and analyse ex-ante feasibility in R&D planning stage using Co-word analysis. We used SCI Database as a source data to analyze the intellectual structure of biochip research in two different periods (1994${\sim}$1999, 1994${\sim}$2002).

  • PDF

Comparative Analysis of Box-office Related Statistics and Diffusion in Korea and US Film Markets (한국과 미국에 있어 영화 수익관련 통계량과 확산 현상의 비교분석)

  • Kim, Taegu;Hong, Jungsik
    • Korean Management Science Review
    • /
    • v.32 no.1
    • /
    • pp.133-145
    • /
    • 2015
  • Motion picture industry in Korea has been growing constantly and aroused various kinds of research attention. Particularly, the introduction of official box-office database service brought quantitative studies. However, approaches based on diffusion models have been rarely found with domestic film markets. In addition to the fundamental statistical review on Korea and US film markets, we applied a diffusion model to daily box-office revenue. Unlike conventional preference of Gamma distribution on the film markets, estimation results proved that BMIC can also explain the trend of daily revenue successfully. The comparison with BMIC showed that there is a distinctive difference in diffusion patterns of Korea and US film markets. Generally, word-of-mouth effect appeared more significant in Korea.

A Study of Security Level Conversion Scheme for Security Documents (보안 문서의 보안 수준 변환을 위한 기법 연구)

  • Cho, Do-Eun;Yeo, Sang-Soo
    • Journal of Advanced Navigation Technology
    • /
    • v.15 no.3
    • /
    • pp.405-411
    • /
    • 2011
  • The value of information becomes very high, a large number of research works has been made for acquiring, managing, and using information. In a specific company (or organization), they are classifying company data documents with managed security levels, and they are securing their secured documents. In this paper, we introduce essential technologies enabling to inspect documents securely and to change specific keywords to normal words, in case that a higher security level document should be converted to a lower security level document.

The Concept of Frailty: A Review of the Literature (노인허약에 대한 고찰)

  • Choi, Kyung-Won;Lee, In-Sook
    • The Korean Journal of Rehabilitation Nursing
    • /
    • v.11 no.2
    • /
    • pp.67-73
    • /
    • 2008
  • Purpose: The purpose of this study was to review and identify the meaning and components of the concept, Frailty. Method: We conducted literature review of studies that concluded the word of 'frail' or 'frailty between 1980 and 2008, and used MEDLINE, CINAHL database to select the articles. Results: Frailty is defined as a concept with multidomains, which are physical, cognitive, psychological, social. Critical characteristics of Frailty include multidominal deficiency, combined accumulation, diminished ability to keep up the independence of daily living, states beyond one's reserve capacity, dynamic relativity, proximity to adverse health outcome, aggregated symptoms. Frailty is caused by decreased physical activity, loss of sensory function, Chronic symptoms or signs, relationship with Caregiver, social isolation. Moreover, Frail elderly is at risk of falls and institutionalization. Conclusion: Frailty is very useful concept, because it has the potential to identify the elderly population at risk of adverse health outcomes. Based on this results, the appropriate tool for screening Korean Frail elderly and Nursing intervention for them needs to be developed.

  • PDF