• 제목/요약/키워드: Word record

Search Result 53, Processing Time 0.029 seconds

Research Trends in Record Management Using Unstructured Text Data Analysis (비정형 텍스트 데이터 분석을 활용한 기록관리 분야 연구동향)

  • Deokyong Hong;Junseok Heo
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.23 no.4
    • /
    • pp.73-89
    • /
    • 2023
  • This study aims to analyze the frequency of keywords used in Korean abstracts, which are unstructured text data in the domestic record management research field, using text mining techniques to identify domestic record management research trends through distance analysis between keywords. To this end, 1,157 keywords of 77,578 journals were visualized by extracting 1,157 articles from 7 journal types (28 types) searched by major category (complex study) and middle category (literature informatics) from the institutional statistics (registered site, candidate site) of the Korean Citation Index (KCI). Analysis of t-Distributed Stochastic Neighbor Embedding (t-SNE) and Scattertext using Word2vec was performed. As a result of the analysis, first, it was confirmed that keywords such as "record management" (889 times), "analysis" (888 times), "archive" (742 times), "record" (562 times), and "utilization" (449 times) were treated as significant topics by researchers. Second, Word2vec analysis generated vector representations between keywords, and similarity distances were investigated and visualized using t-SNE and Scattertext. In the visualization results, the research area for record management was divided into two groups, with keywords such as "archiving," "national record management," "standardization," "official documents," and "record management systems" occurring frequently in the first group (past). On the other hand, keywords such as "community," "data," "record information service," "online," and "digital archives" in the second group (current) were garnering substantial focus.

The Sentence Similarity Measure Using Deep-Learning and Char2Vec (딥러닝과 Char2Vec을 이용한 문장 유사도 판별)

  • Lim, Geun-Young;Cho, Young-Bok
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.10
    • /
    • pp.1300-1306
    • /
    • 2018
  • The purpose of this study is to see possibility of Char2Vec as alternative of Word2Vec that most famous word embedding model in Sentence Similarity Measure Problem by Deep-Learning. In experiment, we used the Siamese Ma-LSTM recurrent neural network architecture for measure similarity two random sentences. Siamese Ma-LSTM model was implemented with tensorflow. We train each model with 200 epoch on gpu environment and it took about 20 hours. Then we compared Word2Vec based model training result with Char2Vec based model training result. as a result, model of based with Char2Vec that initialized random weight record 75.1% validation dataset accuracy and model of based with Word2Vec that pretrained with 3 million words and phrase record 71.6% validation dataset accuracy. so Char2Vec is suitable alternate of Word2Vec to optimize high system memory requirements problem.

Improving the Access Service of National Designated Records in the National Archives of Korea: Focusing on Facet Directory Service (국가기록원의 국가지정기록물 웹 기반 기록정보서비스 개선방안 연구 - 패싯 기반 디렉토리 서비스를 중심으로 -)

  • Jung, Mi Ok;Choi, Sanghee
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.4
    • /
    • pp.217-234
    • /
    • 2019
  • National Records Designation System is designed to protect valuable civilian records from loss or damage. It also intends that government administrates important civilian records to raise public concerns civilian records and to foster archival culture in Korea. This study investigates the current states of service fo the designated record through the web page of National Archive of Korea. Major findings are as follows. First, the information of designated records is dispersed in two web pages by the National Archive of Korea, an introductive web page of every collection in the National Archive of Korea and a web page of designated record service. Second, the web page of designated record service provides information of designated records only at collection level, so it is not easy for users to understand the contents of the records. In order to improve the service for the designated record service of the National Archive of Korea, this study proposed the unification of dispersed web pages to provide information of the designated records consistently. It also suggested a facet based directory service and word cloud service to give access to the contents of each designated record collection. The facet based directory and word cloud service will help users to understand the designated records in more detail.

Digital Isolated Word Recognition System based on MFCC and DTW Algorithm (MFCC와 DTW에 알고리즘을 기반으로 한 디지털 고립단어 인식 시스템)

  • Zang, Xian;Chong, Kil-To
    • Proceedings of the KIEE Conference
    • /
    • 2008.10b
    • /
    • pp.290-291
    • /
    • 2008
  • The most popular speech feature used in speech recognition today is the Mel-Frequency Cepstral Coefficients (MFCC) algorithm, which could reflect the perception characteristics of the human ear more accurately than other parameters. This paper adopts MFCC and its first order difference, which could reflect the dynamic character of speech signal, as synthetical parametric representation. Furthermore, we quote Dynamic Time Warping (DTW) algorithm to search match paths in the pattern recognition process. We use the software "GoldWave" to record English digitals in the lab environments and the simulation results indicate the algorithm has higher recognition accuracy than others using LPCC, etc. as character parameters in the experiment for Digital Isolated Word Recognition (DIWR) system.

  • PDF

Avian research trends in Korea analyzed by text-mining and co-word analysis: based on articles of the Korean Journal of Ornithology (텍스트마이닝과 동시출현단어 분석을 이용한 국내 조류학 연구동향: 한국조류학회지 논문을 대상으로)

  • Jin, Chaelyeong;Eo, Soo Hyung
    • Korean Journal of Ornithology
    • /
    • v.25 no.2
    • /
    • pp.126-132
    • /
    • 2018
  • For balanced development of ornithological research in Korea, it is important to review what birds and what research topics have been studied so far. We quantitatively investigated the trends of domestic ornithological research using text-mining and co-word analysis. As a result of studying 372 articles published in the Korean Journal of Ornithology, which is the most representative ornithological journals, words related to research topics such as population and community monitoring, first record of species and breeding ecology, and heavy metal pollution in birds have been widely used in research articles. Except for subjects such as monitoring and first record of species, studies have not been conducted widely. It was also found that research were concentrated on specific birds such as Anas platyrhynchos, Calidris alpina, and Anas poecilorhyncha. The present study, which analyzed the research topics and avian taxa that were relatively active until now and those which were insufficient, suggests what we should do in the future for the balanced development of ornithological research in Korea.

A Study on the Development Process and Status of Korean Marathon (한국 마라톤의 발전과정과 현황에 관한 연구)

  • Nam, Sang-Nam;Eo, Kyung-Tae
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.4
    • /
    • pp.357-371
    • /
    • 2018
  • The purpose of this study is to provide basic data on the improvement and development of Korean marathon through an analysis of the records of Korean Marathon and the World Marathon. Based on the records of the Donga Marathon, the Chuncheon Marathon, and the JoongAng Marathon, which are the three major marathons of Korea, the results of comparison and analysis of the records of Korean Marathon and World Marathon are as follows. First, it is rare for the host country to win. The problem is that, recently, for more than 10 years, the record gap with foreign players is increasing. Second, Comparisons between Olympic men and women and world championships men and women had equally competed for a winning record until the 1990s, but have not been able to reach the past winning record they recorded 20 years ago, and have been keeping a big difference in the record with the world record.

A Study on Bai Su(背戍) (背戍의 硏究)

  • 김진구
    • The Research Journal of the Costume Culture
    • /
    • v.5 no.4
    • /
    • pp.1-5
    • /
    • 1997
  • This study is concerned with the bai su(背戍) of Koryo period which recorded in Kei Rim Yu Sa(鷄林類事). Results of this research can be summarized as follows : The record of Bai Su(背戍) in Kei Rim Yu Sa(鷄林類事) was correct. It was not a mistake in writing. Thus, this word(背戍) was used by the people of Koryo. The 背戍 of Koryo was related to Aramaic patash and Japanese byets or bats, バツ. It was found that 背戍 of Koryo was very similar to Aramaic patash, legging. It indicates that 背戍 was derived from Aramaic and it was a transliteration of patash. Thus, 背戍 was a borrowed word from Aramaic. Also it was found that 背戍 of Koryo and Japanese byets(ぺツ) or bats(バツ) showed a very close affininty with each other in phonetic value. These words had the same meanings of 襪 one another. It reveals that 背戍 of Koryo and Japanese byets of bats has the same origins. Japanese byets or bats were transliterations of 背戍 of Koryo and they were borrowed words from 背戍 of Koryo.

  • PDF

A Study on the Presidential Records of the Participatory Government : Focusing on the Records of Presidential Events (참여정부 대통령기록 연구 대통령 행사기록을 중심으로)

  • Yi, Kyoung Yong
    • The Korean Journal of Archival Studies
    • /
    • no.71
    • /
    • pp.131-167
    • /
    • 2022
  • This article analyzes the contents of the records surrounding the production process of the 'Word Record' produced by the Office of the Records Management Secretariat in relation to the presidential event among the 16th presidential records. Through this, it was suggested to properly understand the production context of the records of the President's events transferred to the Presidential Archives by the 16th President, and based on this, link and organize related records and actively utilize them.

The origin of the word of sunflower (해바라기(향일규(向日葵), 향일화(向日花))의 어원(語源)에 대하여)

  • Kim, Jong Dug;Koh, Byung Hee
    • The Journal of Korean Medical History
    • /
    • v.14 no.1
    • /
    • pp.31-47
    • /
    • 2001
  • According to the customary, naming is done after the subject is in existence. But the name "Haebaragi"(Hyangilgyu, Hyangilhwa) has been used as an alias of Hibiscus manihot L, long before Helianthus annuus L was brought in to Korea, and now the usage of the name has been conversed since then. Since the incorrect record of Gyugwak and Gyuhwa as Haebaragi in "Chosunesajun"(Dictionary of Chosun language) published under Chosunchongdokbu in 1920, the mistake has been carried on and this must be corrected from now on. Incorrect record of hollyhock(Chokgyuwha) as Haebaragi in "Mong-u" (1810) took a role in this incorrect trend.

  • PDF

The origin of the word of sunflower (해바라기(향일규(向日葵), 향일화(向日花))의 어원(語源)에 대하여)

  • Kim, Jong-Dug;Koh, Byung-Hee
    • Korean Journal of Oriental Medicine
    • /
    • v.7 no.1
    • /
    • pp.55-66
    • /
    • 2001
  • According to the customary, naming is done after the subject is in existence. But the name's Hae-ba-ra-gi(해바라기), 향일규(向日葵), 향일화(向日花))' has been used as an alias of Hibiscus manihot L.(닥풀) long before Helianthus annuus L.(sunflower) was brought in to Korea. And now the usage of the name has been conversed since them. Since the incorrect record of '葵藿' and '葵花'as '해바라기' in ${\ulcorner}$조선어사전(朝鮮語辭典)${\lrcorner}$(1920), the mistake has been carried on this must be corrected from now on. Incorrect record of hollyhock(蜀葵花) '해바라기' in ${\ulcorner}$몽유(蒙喩))${\lrcorner}$(1810) took a role in this incorrect trend.

  • PDF