• Title/Summary/Keyword: 데이터 개방.공유

Search Result 115, Processing Time 0.027 seconds

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

  • An, Jungkook;Kim, Hee-Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.49-67
    • /
    • 2015
  • Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.

A study on the Standardization of Design Guidelines for Geographic Information Databases (지리정보 DB 설계 지침의 표준화 연구)

  • Lim, Duk-Sung;Moon, Sang-Ho;Si, Jong-Ik;Hong, Bong-Hee
    • Journal of Korea Spatial Information System Society
    • /
    • v.5 no.1 s.9
    • /
    • pp.49-63
    • /
    • 2003
  • Recently, two international standard organizations, ISO and OGC, have done the work of standardization for GIS. Current standardization work for providing interoperability among GIS DB focuses on the design of open interfaces. But, this work has not considered procedures and methods for designing GIS DB. Eventually, GIS DB has its own model. When we share the data by open interface among heterogeneous GIS DB, differences between models result in the loss of information. Our aim in this paper is to revise the design guidelines for geographic information databases in order to make consistent spatial data models, logical structures, and semantic structure of populated geographical databases. In details, we propose standard guidelines which convert ISO abstract schema into relation model, object-relation model, object-centered model, and geometry-centered model. Furthermore, we provide sample models for applying these guidelines in commercial GIS S/Ws. Building GIS DB based on design guidelines proposed in the paper has the following advantages: the interoperability among databases, the standardization of schema definitions, and the catalogue of GIS databases through.

  • PDF

An Relational Analysis between Humidity, Temperature and Fire Occurrence using Public Data (공공데이터를 이용한 습도 및 온도와 실화 발생 간의 관계분석)

  • Song, Dong-Woo;Kim, Ki-Sung;Lee, Su-Kyung
    • Fire Science and Engineering
    • /
    • v.28 no.2
    • /
    • pp.82-90
    • /
    • 2014
  • According to recent government's 3.0 operating paradigm for the opening and sharing of public information, relationship between humidity, temperature and fire occurrence were analyzed using the data in National Weather Service and National Emergency Management Agency. In order to analyze the relationships between humidity, temperature and fire occurrence, hourly frequency of fire occurrence compared with humidity and temperature ranges was suggested as an analysis method. Tendencies of fire occurrence frequencies were examined through this and characteristics of detailed attributes could be statistically identified. Results about hourly frequencies of fire occurrence by classifying the humidity ranges in each region showed increasing frequencies in all areas where the humidity is lower. Hourly frequencies of fire occurrence according to temperature ranges were identified to be similar in each area as well. In addition, characteristics of objects' attributes were analyzed including types of fire, igniting source of fire, initial complex, reasons of fire occurrence, and distinctive directions were suggested. Suggested method in this paper could be practically used when suggesting the frequency in each category in fire occurrence statistics of National Fire Information System.

e-Learning Contents Development as Social Negotiation Perspective: A Case Study of Program Development for the Public Sector Officials' Case Management (사회적 협상 관점의 e-Learning 콘텐츠 개발: 사례관리 담당 공무원을 위한 프로그램 개발 사례연구)

  • Kim, In-Sook;Jin, Sun-Mee
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.7
    • /
    • pp.519-527
    • /
    • 2011
  • The e-Learning program is a multimedia data program consisting of texts, images, animation, audio and video. The development of an e-Learning program requires time and is a complex process, requiring cooperation and open-communication between all parties involved, particularly in the event of a problem. This study will analyze the e-Learning contents development process from the Social Negotiation Perspective. An appropriate process for the development of the program and effective decision-making guidelines for those parties involved will be recommended. Participants' viewpoints regarding program development and guidelines were studied qualitatively, while the evaluation of developed content employed both qualitative and quantitative research. The study found the following results. First, the development of an e-Learning program requires a clear goal and purpose. Second, the target group must be clearly identified. Third, all parties involved must share in the development process and its outcomes. Fourth, the party requesting the program must allocate the appropriate time and budget for the development group. Finally, the project requires a strong, capable leadership for effective decision-making.

Effect of Multidisciplinary Emergency Consultation System for Drug Intoxicated Patients (응급실을 내원한 약물중독 환자에 대한 다학제 응급협진체계의 효과 검증)

  • Kang, Jino;Kim, Hye Ri;Min, Kyungjoon;Kim, Na Ryoung;Heo, Yoon Kyung;Kim, Sun Mi
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.27 no.2
    • /
    • pp.130-137
    • /
    • 2019
  • Objectives : When a patient who attempts suicide visits the emergency room, it is important that the departments of emergency medicine, internal medicine, and psychiatry communicate with each other and prioritize treatment. This study was conducted to verify the effectiveness of the multidisciplinary emergency consultation system (ECS) for drug intoxicated patients. Methods : We retrospectively analyzed the data from medical records prior to the ECS, from July 2017 to May 2018, and after the ECS, from July 2018 to May 2019, to verify the effectiveness of the system. Results : After the ECS, admission to open wards was significantly higher than to the intensive care units (χ2=8.567, p=0.014). In addition, the proportion of consultations to the department of psychiatry among patients admitted to other departments tended to increase (χ2=4.202, p=0.053), and the time required for consultation response decreased (Z=-2.031, p=0.042). As a result of the consultation, the proportion of the patients who had been transferred to the department of psychiatry was increased (χ2=4.692, p=0.043), and the time spent to transfer tended to decrease (Z=-1.941, p=0.052). Conclusions : After implementing the ECS for drug intoxicated patients, unnecessary intensive care unit admissions, consultation response time, and the time spent to transfer were reduced, and the rate of consultation referrals and transfer rates increased. This means that the multidisciplinary consultation system rapidly provided essential medical services to patients at lower medical costs.