• Title/Summary/Keyword: 동시단어분석

Search Result 186, Processing Time 0.027 seconds

A Research on the state of the utilization of the stock-information-retrieval-service (KT 증권정보 서비스 이용 실태 및 인식 결과 조사)

  • 최영재
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.63-66
    • /
    • 1998
  • 한국통신에서는 PC로 된 프로토타입 시스템을 이용하여 음성인식 증권정보 서비스를 1995년 11월부터 1998년 초까지 5채널에 대해 시험운용을 해왔으며, 상용서비스를 위해 120명이 동시에 서비스 받을 수 있는 시스템을 개발하였다. 개발된 시스템의 전반적인 문제점을 파악하기 위하여 개발된 시스템을 사용하여 1998년 3월 16일부터 30 채널규모로 일반인들에게 시험서비스를 제공하고 있다. 음성인식 전화정보 서비스를 현재보다 훨씬 더 활성화시키기 위해서, 서비스의 이용 형태에 대한 분석을 통해, 어느 부분이 어떻게 개선되어야 할지를 연구하여, 초보 사용자라도 이용하기 쉬운 형태로 서비스를 시나리오를 개선해 나가고 있다. 본 논문에서는 사용자 특히, 처음 사용자의 여러 가지 이용 실태 요인을 분석하였다. 또한, 음성인식 증권 정보 서비스가 정식으로 서비스되기 이전과 그 이후의 일시별 인식률을 통해 조사하고, 이용자가 동일 대상 단어를 연속으로 발음하는 경우, 동일 대상 단어에 대한 인식률을 조사하였다. 조사결과 문제점은 4가지로 분류될 수 있었으며, 드러난 문제점을 해결하기 위하여 노력하고 있다.

  • PDF

Exploring the Research Trends of Learning Strategies in Korean Language Education Using Co-word Analysis (동시출현단어 분석을 활용한 한국어교육에서의 학습전략 연구 동향 탐색)

  • Heo, Youngsoo;Park, Ji-Hong
    • Journal of the Korean Society for information Management
    • /
    • v.38 no.2
    • /
    • pp.65-86
    • /
    • 2021
  • In the foreign language education, learners are an important part of education, however in the Korean language education, the study of learners was insufficient compared to the contents of education, teaching methods and textbooks. Therefore, it is meaningful to analyze how learner research, especially learning strategy research, has been conducted and derive areas that need research for better education. In this study, co-word analysis was conducted on the titles of academic journals and dissertations in order to analyze the learning strategy research in Korean language education. I found it is about "reading" that the most studies related to Korean language learners' learning strategies were conducted and those studies' subjects mostly were 'Chinese international students' and 'marriage-immigrants'. In addition, the results of the subgroup analysis on the research topic show four major subgroups: a group related to 'reading for academic purposes', a group related to 'request, rejection, conversation, etc.', a group related to 'writing', and a group related to 'vocabulary, listening'. This shows that the researchers' major interests in studying Korean learner's strategies are "reading" and "speaking" and their studies have been concentrated in the specific areas. Therefore, it is necessary for researchers to study various functions and subjects in Korean language learner's learning strategies.

A Study on Ideological Orientation and the Construction of News about Korean News Media : Focused on a Semantic Network Analysis for Articles about 'Bernie Sanders' (국내 언론매체의 이념성향과 뉴스구성에 대한 연구 : 미 대선 후보 '버니 샌더스' 관련 보도의 의미연결망 분석을 중심으로)

  • Lee, Hye-Mi;Gim, Hye-Yeong;Ryu, Seoung-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.8
    • /
    • pp.180-191
    • /
    • 2016
  • This study utilized a semantic network analysis for Korean major newspaper articles concerning 'Bernie Sanders'. 'Bernie Sanders' promotes conservative values of 'Americana' as well as the progressive values of 'relieving inequality', and thus, perhaps he is a subject on which ideological differences between the press can be distinctively manifest. Upon comparison of the priority of frequency between the conservative press and progressive press, the conservative press frequently used the expressions, 'socialist' and 'black man', whereas the progressive press frequently used the expressions, 'inequality' and 'problem'. Both the conservative press and progressive press displayed particularly different semantic compositions with the term, 'Korea'. The progressive press aimed to express the criticism of social problems and established politics identified by Sanders in relation to the 'Korean' society, whereas the conservative press criticized the blunt expressions stating that a specifically named politician resembles Sanders, and the specific party and term of 'Korea'. A completely different disposition of reports from different perspectives and context was ascertained, regardless of the use of the same terms. Thus, it is demonstrated that the semantic composition of the press on a specific issue displays significant differences according to their ideological disposition.

An Analysis of Domestic and International Research Trends on Metaverse (메타버스 관련 국내외 연구동향 분석)

  • Hyunjung Kim
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.3
    • /
    • pp.351-379
    • /
    • 2023
  • The goal of this study is to investigate the domestic and international research trends on metaverse related researches. To achieve this goal, a set of 913 journal articles were collected from KCI (Korea Citation Index), 232 articles from WoS (Web of Science), and 277 articles from WoS-CPCI (Conference Proceeding Citation Index). A descriptive analysis shows the number of researches has been increased radically, and the mostly researched subject areas are interdisciplinary, computer science, and education in KCI, business and economics in WoS, and computer science in WoS-CPCI. The co-occurrence network analysis using author keywords revealed that technology related terms such as virtual reality and augmented reality showed high centrality measures in all of the databases, and the cluster analysis resulted in education and metaverse platform related keywords cluster from KCI, bibliometric analysis related keywords cluster from WoS, and all the metaverse technology related keywords cluster from WoS-CPCI.

Similar Patent Search Service System using Latent Dirichlet Allocation (잠재 의미 분석을 적용한 유사 특허 검색 서비스 시스템)

  • Lim, HyunKeun;Kim, Jaeyoon;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.8
    • /
    • pp.1049-1054
    • /
    • 2018
  • Keyword searching used in the past as a method of finding similar patents, and automated classification by machine learning is using in recently. Keyword searching is a method of analyzing data that is formalized through data refinement. While the accuracy for short text is high, long one consisted of several words like as document that is not able to analyze the meaning contained in sentences. In semantic analysis level, the method of automatic classification is used to classify sentences composed of several words by unstructured data analysis. There was an attempt to find similar documents by combining the two methods. However, it have a problem in the algorithm w the methods of analysis are different ways to use simultaneous unstructured data and regular data. In this paper, we study the method of extracting keywords implied in the document and using the LDA(Latent Semantic Analysis) method to classify documents efficiently without human intervention and finding similar patents.

Text Mining Driven Content Analysis of Social Perception on Schizophrenia Before and After the Revision of the Terminology (조현병과 정신분열병에 대한 뉴스 프레임 분석을 통해 본 사회적 인식의 변화)

  • Kim, Hyunji;Park, Seojeong;Song, Chaemin;Song, Min
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.4
    • /
    • pp.285-307
    • /
    • 2019
  • In 2011, the Korean Medical Association revised the name of schizophrenia to remove the social stigma for the sick. Although it has been about nine years since the revision of the terminology, no studies have quantitatively analyzed how much social awareness has changed. Thus, this study investigates the changes in social awareness of schizophrenia caused by the revision of the disease name by analyzing Naver news articles related to the disease. For text analysis, LDA topic modeling, TF-IDF, word co-occurrence, and sentiment analysis techniques were used. The results showed that social awareness of the disease was more negative after the revision of the terminology. In addition, social awareness of the former term among two terms used after the revision was more negative. In other words, the revision of the disease did not resolve the stigma.

Topic-Network based Topic Shift Detection on Twitter (트위터 데이터를 이용한 네트워크 기반 토픽 변화 추적 연구)

  • Jin, Seol A;Heo, Go Eun;Jeong, Yoo Kyung;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.1
    • /
    • pp.285-302
    • /
    • 2013
  • This study identified topic shifts and patterns over time by analyzing an enormous amount of Twitter data whose characteristics are high accessibility and briefness. First, we extracted keywords for a certain product and used them for representing the topic network allows for intuitive understanding of keywords associated with topics by nodes and edges by co-word analysis. We conducted temporal analysis of term co-occurrence as well as topic modeling to examine the results of network analysis. In addition, the results of comparing topic shifts on Twitter with the corresponding retrieval results from newspapers confirm that Twitter makes immediate responses to news media and spreads the negative issues out quickly. Our findings may suggest that companies utilize the proposed technique to identify public's negative opinions as quickly as possible and to apply for the timely decision making and effective responses to their customers.

An Investigation on Scientific Data for Data Journal and Data Paper (Scientific Data 학술지 분석을 통한 데이터 논문 현황에 관한 연구)

  • Chung, EunKyung
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.1
    • /
    • pp.117-135
    • /
    • 2019
  • Data journals and data papers have grown and considered an important scholarly practice in the paradigm of open science in the context of data sharing and data reuse. This study investigates a total of 713 data papers published in Scientific Data in terms of author, citation, and subject areas. The findings of the study show that the subject areas of core authors are found as the areas of Biotechnology and Physics. An average number of co-authors is 12 and the patterns of co-authorship are recognized as several closed sub-networks. In terms of citation status, the subject areas of cited publications are highly similar to the areas of data paper authors. However, the citation analysis indicates that there are considerable citations on the journals specialized on methodology. The network with authors' keywords identifies more detailed areas such as marine ecology, cancer, genome, database, and temperature. This result indicates that biology oriented-subjects are primary areas in the journal although Scientific Data is categorized in multidisciplinary science in Web of Science database.

A Study on the Retrieval Effectiveness of KoreaMed using MeSH Search Filter and Word-Proximity Search (검색용 MeSH 필터와 단어인접탐색 기법을 활용한 KoreaMed 검색 효율성 향상 연구)

  • Jeong, So-Na;Jeong, Ji-Na
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.5
    • /
    • pp.596-607
    • /
    • 2017
  • This study examined the method for adding related to "stomach neoplasms" as filters to the Medical Subject Headings (MeSH) for search as well as a method for improving the search efficiency through a word-proximity search by measuring the distance of co-occurring terms. A total of 8,625 articles published between 2007 and 2016 with the major topic terms "stomach neoplasms" were downloaded from PubMed article titles. The vocabulary to be added to the MeSH for search were analyzed. The search efficiency was verified by 277 articles that had "Stomach Neoplasms" indexed as MEDLINE MeSH in KoreaMed. As a result, 973 terms were selected as the candidate vocabulary. "Gastric Cancer" (2,780 appearances) was the most frequent term and 7,376 compound words (88.51%) combined the histological terms of "stomach" and "neoplasm", such as "gastric adenocarcinoma" and "gastric MALT lymphoma". A total of 5,234 compounds words (70.95%), in which the co-occurring distance was two words, were found. The matching rate through the MEDLINE MeSH and KoreaMed MeSH Indexer was 209 articles (75.5%). The search efficiency improved to 263 articles (94.9%) when the search filters were added, and to 268 articles (96.7%) when the 13 word-proximity search technique of the co-occurring terms was applied. This study showed that the use of a thesaurus as a means of improving the search efficiency in a natural language search could maintain the advantages of controlled vocabulary. The search accuracy can be improved using the word-proximity search instead of a Boolean search.

User Reputation Evaluation Using Co-occurrence Feature and Collective Intelligence (동시출현 자질과 집단 지성을 이용한 지식검색 문서 사용자 명성 평가)

  • Lee, Hyun-Woo;Han, Yo-Sub;Kim, LaeHyun;Cha, Jeung-Won
    • Annual Conference on Human and Language Technology
    • /
    • 2008.10a
    • /
    • pp.79-84
    • /
    • 2008
  • 많은 사용자들의 참여로 구축된 집단 지성을 이용한 지식 검색 서비스에서 사용자가 원하는 답변을 빨리 찾고자 하는 요구가 증가하고 있다. 기존의 연구에서 조회 수, 추천 수, 답변 수와 같은 비텍스트 정보가 답변을 평가하는데 좋은 자질임이 증명되었고, 신뢰도를 추정할 수 있는 여러 종류의 단어 사전을 이용하여 답변의 좋고 나쁨을 평가할 수 있는 연구도 진행되었다. 하지만, 조회 수, 추천 수, 답변 수와 같은 비텍스트 정보는 사용자 조작이 간단하여 지속적으로 관리를 해야 하며, 신뢰도를 추정할 수 있는 단어는 지속적으로 보강되어야 한다. 본 논문에서는 이러한 문제점을 해결하고자 동시출현 자질을 이용한 질문과 답변의 유사성을 활용하여 집단 지성에서 사용자의 활동을 분석하여 사용자의 명성을 평가하는 방법을 제안한다. 사용자의 명성을 계산할 수 있다면 조회 수와 추천 수가 많지 않은 답변의 신뢰도도 비교적 정확하게 추정할 수 있다. 이를 위해 우리는 PageRank 알고리즘을 수정하여 사용자 명성을 계산한다. 네이버 지식iN의 문서로 실험한 결과, 기존 정답 선택률을 보완할 수 있는 결과를 보였다.

  • PDF