• Title/Summary/Keyword: 동시단어 분석

Search Result 186, Processing Time 0.024 seconds

A Method for Compound Noun Extraction to Improve Accuracy of Keyword Analysis of Social Big Data

  • Kim, Hyeon Gyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.8
    • /
    • pp.55-63
    • /
    • 2021
  • Since social big data often includes new words or proper nouns, statistical morphological analysis methods have been widely used to process them properly which are based on the frequency of occurrence of each word. However, these methods do not properly recognize compound nouns, and thus have a problem in that the accuracy of keyword extraction is lowered. This paper presents a method to extract compound nouns in keyword analysis of social big data. The proposed method creates a candidate group of compound nouns by combining the words obtained through the morphological analysis step, and extracts compound nouns by examining their frequency of appearance in a given review. Two algorithms have been proposed according to the method of constructing the candidate group, and the performance of each algorithm is expressed and compared with formulas. The comparison result is verified through experiments on real data collected online, where the results also show that the proposed method is suitable for real-time processing.

Time Series Analysis of Intellectual Structure and Research Trend Changes in the Field of Library and Information Science: 2003 to 2017 (문헌정보학 분야의 지적구조 및 연구 동향 변화에 대한 시계열 분석: 2003년부터 2017년까지)

  • Choi, Hyung Wook;Choi, Ye-Jin;Nam, So-Yeon
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.2
    • /
    • pp.89-114
    • /
    • 2018
  • Research on changes in research trends in academic disciplines is a method that enables observation of not only the detailed research subject and structure of the field but also the state of change in the flow of time. Therefore, in this study, in order to observe the changes of research trend in library and information science field in Korea, co-word analysis was conducted with Korean author keywords from three types of journals which were listed in the Korea Citation Index(KCI) and have top citation impact factor were selected. For the time series analysis, the 15-year research period was accumulated in 5-years units, and divided into 2003~2007, 2003~2012, and 2003~2017. The keywords which limited to the frequency of appearance 10 or more, respectively, were analyzed and visualized. As a result of the analysis, during the period from 2003 to 2007, the intellectual structure composed with 25 keywords and 8 areas was confirmed, and during the period from 2003 to 2012, the structure composed by 3 areas 17 sub-areas with 76 keywords was confirmed. Also, the intellectual structure during the period from 2003 to 2017 was crowded into 6 areas 32 consisting of a total of 132 keywords. As a result of comprehensive period analysis, in the field of library and information science in Korea, over the past 15 years, new keywords have been added for each period, and detailed topics have also been subdivided and gradually segmented and expanded.

Detecting Research Trends in Korean Information Science Research, 2000-2011 (국내 정보학분야 연구동향 분석, 2000-2011)

  • Seo, Eun-Gyoung;Yu, So-Young
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.4
    • /
    • pp.215-239
    • /
    • 2013
  • Even though the overall scholarly community has recognized a dramatic growth and changes in the Information Science research in Korea over the last few decades, there are still only few studies that have identified the changes in terms of long-term and dynamic point of view. We have analyzed 1,007 IS-research articles from leading Korean journals in KCI (Korea Citation Index), published between 2000 and 2011. To discern the trendline of changes in research interests over time, we conducted a time-series analysis by developing grounded subject scheme from the article set and checking the growth rate of the number of published articles and title keywords. A comparative analysis was also conducted by constructing and comparing co-word maps over time to discover visible changes in research topics over this 12-year period of the IS-research in Korea. As a result, we identified some developments and transformations in major subject areas and knowledge structure of the IS-research in Korea over time. The major trend we discovered is that IS-studies over the 12-year period evolved from system-oriented research to library-application research. The changes are especially observed in knowledge management, Web-based system evaluation, and information retrieval areas. When compared to the results of other studies, the result of our study may serve as an evidence of the localization of Korean IS-studies in the first decade of the $21^{st}$ century.

A Content Analysis of Journal Articles Using the Language Network Analysis Methods (언어 네트워크 분석 방법을 활용한 학술논문의 내용분석)

  • Lee, Soo-Sang
    • Journal of the Korean Society for information Management
    • /
    • v.31 no.4
    • /
    • pp.49-68
    • /
    • 2014
  • The purpose of this study is to perform content analysis of research articles using the language network analysis method in Korea and catch the basic point of the language network analysis method. Six analytical categories are used for content analysis: types of language text, methods of keyword selection, methods of forming co-occurrence relation, methods of constructing network, network analytic tools and indexes. From the results of content analysis, this study found out various features as follows. The major types of language text are research articles and interview texts. The keywords were selected from words which are extracted from text content. To form co-occurrence relation between keywords, there use the co-occurrence count. The constructed networks are multiple-type networks rather than single-type ones. The network analytic tools such as NetMiner, UCINET/NetDraw, NodeXL, Pajek are used. The major analytic indexes are including density, centralities, sub-networks, etc. These features can be used to form the basis of the language network analysis method.

Comparative Analysis of Low Fertility Policy and the Public Perceptions using Text-Mining Methodology (텍스트 마이닝을 활용한 저출산 정책과 대중인식 비교)

  • Bae, Giryeon;Moon, HyunJeong;Lee, Jaeil;Park, Mina;Park, Arum
    • Journal of Digital Convergence
    • /
    • v.19 no.12
    • /
    • pp.29-42
    • /
    • 2021
  • As the low fertility intensifies in Korea, this study investigated fundamental differences between the government's low fertility policy and public perception of it. To this end, we selected four times 'Aging Society and Population Policy' documents and news comments for two weeks immediately after announcement of the third and fourth Policy as analysis targets. Then we conducted word frequency analysis, co-occurrence analysis and CONCOR analysis. As a result of analyses, first, direct childcare support during the first and second periods, and a social structural approach during third and fourth periods were noticeable. Second, it was revealed that both policies and comments aim for the work-family compatibility in 'parenting'. Lastly it was showed public interest in environment of raising children and the critical mind to effectiveness of the policy. This study is meaningful in that it confirmed the public perception using big data analysis, and it will help improve the direction for the future low fertility policy.

Semantic Network Analysis of Trends in Hyundai Motor's Corporate Cultural Marketing (언어 네트워크 분석을 통한 현대자동차의 기업 문화마케팅 변화 연구)

  • Kim, Junghyun;Lee, Jin Woo
    • Korean Association of Arts Management
    • /
    • no.51
    • /
    • pp.75-102
    • /
    • 2019
  • This study aims to figure out the progression of Hyundai motor's corporate cultural marketing by conducting semantic network analysis. Although the previous research has focused on conception, categorization, impact, and performance of cultural marketing, they hardly pay attention to changes in cultural marketing over time. To explore the identified gap, we collected 2,315 articles concerning Hyundai motor's cultural marketing on daily newspapers printed from 2001 to 2018. The 18-year time period was classified into four periods, and lists of words were extracted and analyzed by Korean language analysis program, Textom and social network analysis program, called 'UCINET'. The outcome of our analysis indicates that Hyundai Motor's cultural marketing has been developed from the strategy of merely increasing sales to the means of distinguishing their corporate and brand identity. In the early 2000s, the words 'customer', 'The Age of Great Paintings: Rembrandt and the 17th century Dutch paintings', and 'performances' were extracted with high frequency. It shows Hyundai Motor held performance-oriented events and provided benefits to specific consumer groups under the type of 'Cultural Promotion'. In addition, as the exhibition sponsored by Hyundai motor was reported in the media with high publicity effect, the concept of 'Cultural Support' is also emerged. In the late 2000s, the top exposures were 'Seoul Arts Center' and 'Seoul Metropolitan Symphony Orchestra'. Under the concept of 'Cultural Support', both organizations and cultural events were sponsored by Hyundai motor. Hyundai Motor has the tendency to cooperate with high profile parties who have already accomplished high publicities to attract social interests and issues. In the early 2010s, Hyundai Motor created cultural marketing brand and space ('Brilliant' and 'Hyundai Art Hall') that broadened the potential target groups, which represented both 'Cultural Support' and 'Cultural Enterprise'. In the middle and late of the 2010s, as shown by the high frequency of 'brand' and 'global', Hyundai Motor has focused on the global market and viewpoint has expanded to brand building focusing on the type of 'Cultural Enterprise'.

A Quantitative Analysis for An Efficient Memory Allocation (효과적인 메모리 할당을 위한 정량적 분석)

  • Hong, Yun-Shik
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.9
    • /
    • pp.2395-2403
    • /
    • 1998
  • Memory allocation problem has two independent goals: minimization of number of memories and minimization of number of registers in one memory Our concern is the ordering of the bindings during memory allocation. We formulate and analyze three different memory allocation algorithms b) changing their binding order. It is shown that when we combine these subtasks and solve them simultaneously by heuristic cost function significant savings (up to 20%) can be obtained in the total area of memories.

  • PDF

Emotion Recognition based on Short Text using Semantic Orientation Analysis (의미 지향성 분석을 통한 단문 텍스트 기반 감정인지)

  • Kim, Hyun-Woo;Lee, Sung-Young;Chung, Tae-Choong;Yoon, Suk-Hwan
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2012.06b
    • /
    • pp.375-377
    • /
    • 2012
  • 스마트폰과 같은 모바일 기기가 발전함에 따라 SNS, 모바일 메신저, SMS와 같은 단문 기반 메시지는 자신의 감정을 가장 잘 표현하는 매체이다. 그럼에도 불구하고 기존 연구는 주로 장문의 텍스트로부터 긍정, 부정 분류나 문서의 성향을 분석하는 것에 그치는 경우가 많다. 의미지향(Semantic Orientation)방법은 검색엔진을 통해 감정 키워드와 인지하고자 하는 단어의 동시 빈출 정도를 PMI로 계산한 것으로 WordNet과 같은 의미 사전이 존재하지 않는 한국어의 특성에서 적용 가능한 방법이다. 본 논문에서는 의미 지향성 및 다른 텍스트 기반 감정 분류 기술에 대해 비교하고 이들을 활용하여 한국어로 구성된 단문 텍스트에서 효율적인 감정 분류 기법을 제안하고자 한다.

A Bibliometric Analysis of Research Trends on Disaster in Korea (국내 재난 관련 연구 동향에 대한 계량정보학적 분석)

  • Lee, Jae Yun;Kim, Soojung
    • Journal of the Korean Society for information Management
    • /
    • v.33 no.4
    • /
    • pp.103-124
    • /
    • 2016
  • This study aims to investigate the research trends of disaster in Korea through a bibliometric analysis. To do that, it analyzed 772 scholarly articles published from 2002 to 2016, retrieved from KCI (Korean Citation Index) database. For analysis, discipline profiling analysis, journal profiling analysis, and co-word analysis methods were used. The study found that the number of scholarly articles on disaster has increased, especially after Sewol ferry disaster occurred in 2004. The major discipline areas were identified as 'policy sciences/public administration' area, 'engineering' area, 'GIS/telecommunication' area, and 'medical/humanities/social sciences' area. In terms of time series, the proportion of scholarly articles published in 'policy sciences/public administration' area has decreased since 2014 and at the same time, discipline areas have been diversified including law, medical, and journalism.

An Investigation on Characteristics and Intellectual Structure of Sociology by Analyzing Cited Data (사회학 분야의 연구데이터 특성과 지적구조 규명에 관한 연구)

  • Choi, Hyung Wook;Chung, EunKyung
    • Journal of the Korean Society for information Management
    • /
    • v.34 no.3
    • /
    • pp.109-124
    • /
    • 2017
  • Through a wide variety of disciplines, practices on data access and re-use have been increased recently. In fact, there has been an emerging phenomenon that researchers tend to use the data sets produced by other researchers and give scholarly credit as citation. With respect to this practice, in 2012, Thomson Reuters launched Data Citation Index (DCI). With the DCI, citation to research data published by researchers are collected and analyzed in a similar way for citation to journal articles. The purpose of this study is to identify the characteristics and intellectual structure of sociology field based on research data, which is one of actively data-citing fields. To accomplish this purpose, two data sets were collected and analyzed. First, from DCI, a total of 8,365 data were collected in the field of sociology. Second, a total of 12,132 data were collected from Web of Science with a topic search with 'Sociology'. As a result of the co-word analysis of author provided-keywords for both data sets, the intellectual structure of research data-based sociology was composed of two areas and 15 clusters and that of article-based sociology was composed with three areas and 17 clusters. More importantly, medical science area was found to be actively studied in research data-based sociology and public health and psychology are identified to be central areas from data citation.