• Title/Summary/Keyword: technology classification system

Search Result 1,460, Processing Time 0.031 seconds

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.

A Study on the Effect of the Document Summarization Technique on the Fake News Detection Model (문서 요약 기법이 가짜 뉴스 탐지 모형에 미치는 영향에 관한 연구)

  • Shim, Jae-Seung;Won, Ha-Ram;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.201-220
    • /
    • 2019
  • Fake news has emerged as a significant issue over the last few years, igniting discussions and research on how to solve this problem. In particular, studies on automated fact-checking and fake news detection using artificial intelligence and text analysis techniques have drawn attention. Fake news detection research entails a form of document classification; thus, document classification techniques have been widely used in this type of research. However, document summarization techniques have been inconspicuous in this field. At the same time, automatic news summarization services have become popular, and a recent study found that the use of news summarized through abstractive summarization has strengthened the predictive performance of fake news detection models. Therefore, the need to study the integration of document summarization technology in the domestic news data environment has become evident. In order to examine the effect of extractive summarization on the fake news detection model, we first summarized news articles through extractive summarization. Second, we created a summarized news-based detection model. Finally, we compared our model with the full-text-based detection model. The study found that BPN(Back Propagation Neural Network) and SVM(Support Vector Machine) did not exhibit a large difference in performance; however, for DT(Decision Tree), the full-text-based model demonstrated a somewhat better performance. In the case of LR(Logistic Regression), our model exhibited the superior performance. Nonetheless, the results did not show a statistically significant difference between our model and the full-text-based model. Therefore, when the summary is applied, at least the core information of the fake news is preserved, and the LR-based model can confirm the possibility of performance improvement. This study features an experimental application of extractive summarization in fake news detection research by employing various machine-learning algorithms. The study's limitations are, essentially, the relatively small amount of data and the lack of comparison between various summarization technologies. Therefore, an in-depth analysis that applies various analytical techniques to a larger data volume would be helpful in the future.

Classification of Service Quality for HMR unmanned store business (HMR 무인매장 서비스 품질 분류에 관한 연구)

  • Jong Won Lee
    • Journal of Service Research and Studies
    • /
    • v.13 no.2
    • /
    • pp.41-61
    • /
    • 2023
  • The universal form of life in the era of the 4th industrial revolution can probably be summarized as the keyword "non-face-to-face". In particular, in terms of consumption activities, face-to-face contact is gradually changing to a system that minimizes, and offline stores are rapidly changing to non-contact services through kiosks and robots. The social structure is also changing with the passage of time, and most fundamentally, our dietary consumption patterns are changing. In particular, the increase in single-person households and the aging population are having a great impact on changes in the food service industry, which is closely related to dietary life. The HMR (Home Meal Replacement) market has grown significantly as the labor of cooking at home has decreased and the use of substitute foods has increased. As the size of the market has grown, the types of businesses that provide products have also diversified. The development of technology, non-face-to-face culture, and corporate management efficiency are intertwined, and unmanned stores are spreading recently. In this study, service quality attributes of HMR unmanned stores, where competition is gradually intensifying, are classified, and service quality classification using the Kano model and Timko's customer satisfaction coefficient are calculated to provide implications for service management based on customer satisfaction. As a result of the analysis, 'products with short cooking time' and 'variety of products (menu)' were classified as attractive qualities, and 'cleanliness inside/outside of the store' and 'products at reasonable prices' were classified as unified quality. In addition, 'convenience of self-checkout process' was classified as a natural quality, and 'convenience of in-store passage' was classified as an indifferent quality. Furthermore, when the service factor was satisfied within the HMR unmanned store, the factor with the highest satisfaction coefficient was 'product (menu) variety', and the factor with the highest dissatisfaction factor was 'convenience of self-checkout process'. Through the results of this study, it is intended to derive priorities in service quality management of HMR unmanned stores and provide strategic implications for related businesses.

A Study on Establishment of the Levee GIS Database Using LiDAR Data and WAMIS Information (LiDAR 자료와 WAMIS 정보를 활용한 제방 GIS 데이터베이스 구축에 관한 연구)

  • Choing, Yun-Jae
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.17 no.3
    • /
    • pp.104-115
    • /
    • 2014
  • A levee is defined as an man-made structure protecting the areas from temporary flooding. This paper suggests a methodology for establishing the levee GIS database using the airborne topographic LiDAR(Light Detection and Ranging) data taken in the Nakdong river basins and the WAMIS(WAter Management Information System) information. First, the National Levee Database(NLD) established by the USACE(United States Army Corps Engineers) and the levee information tables established by the WAMIS are compared and analyzed. For extracting the levee information from the LiDAR data, the DSM(Digital Surface Model) is generated from the LiDAR point clouds by using the interpolation method. Then, the slope map is generated by calculating the maximum rates of elevation difference between each pixel of the DSM and its neighboring pixels. The slope classification method is employed to extract the levee component polygons such as the levee crown polygons and the levee slope polygons from the slope map. Then, the levee information database is established by integrating the attributes extracted from the identified levee crown and slope polygons with the information provided by the WAMIS. Finally, this paper discusses the advantages and limitations of the levee GIS database established by only using the LiDAR data and suggests a future work for improving the quality of the database.

An Analysis of the Economic Effects on the Project to Construct Smart Network (스마트 네트워크 구축사업의 경제적 파급효과 분석)

  • Jeong, Woo-Soo;Kim, Sa-Hyuk
    • Journal of Internet Computing and Services
    • /
    • v.12 no.4
    • /
    • pp.61-71
    • /
    • 2011
  • The Smart Network Project is planned for achieving the Internet advanced country by adjusting the Government Future Internet Development as a national agenda. The future Internet is defined as diverse alternative technology and services that can provide optimal services for individual characteristic and situation in anywhere, anytime throughout convergence of communication, broadcasting, and computing to solve the current limitation of the Internet. This paper is to analyze the economic effects of the smart network build-up. For the economic effect analysis, we reclassified the smart network industry classification system and re-drew up 2011 Inter-industry Relations Table by using the Inter-industry Relations Table issued by the Bank of Korea and the RAS techniques. And we analyzed the economic effects that can be drawn from the investment of the smart network industry. As a result, the gross production inductive effect which appears with the economic effect of the smart network establishment project from 2011 to 2015 came out to be about 72 trillion 808.2 billion KW, added value inductive effect of 44 trillion 192.9 billion KW and the employment inductive effect of the job creation of about 412 thousands people. Afterward, it is anticipated that the smart network build-up project to contribute to the improvement of Koreans' daily life. Moreover, this research will be used as a valued basic material in the pursuit of the future network projects.

Inclusive Impact Index "Triple I" for Assessing Ocean Utilization Technologies (해양이용기술 평가를 위한 포괄적 영향지수 "트리플 I")

  • Otsuka, Koji
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.15 no.2
    • /
    • pp.118-125
    • /
    • 2012
  • World population has increased rapidly following the industrial revolution, reaching 7 billion in 2012. Several forecasts estimate that this number will rise to about 8 billion in 2025. Improvements of living standards in developing nations have also raised resource and energy demands worldwide. In consequences, human beings have faced many global and urgent problems, such as global warming, water and food shortages, resource and energy crises, and so on. Many ocean utilization technologies for avoiding or reducing such big problems have been developed, for examples $CO_2$ ocean sequestration, seawater desalination, artificial upwelling, deepwater mining, and ocean energies. It is important, however, to assess such technologies from the viewpoints of sustainability and public acceptancy, since the aims of those technologies are to develop sustainable social systems rather than conventional ones based on fossil resources. Inclusive Marine Pressure Assessment and Classification Technology Research Committee (generally called IMPACT Research Committee) of Japan Society of Naval Architects and Ocean Engineers, has proposed Inclusive Impact Index "Triple I" as an indicator, which can predict both environmental sustainability and economical feasibility, in order to assess the ocean utilization technologies from the viewpoints of sustainability and public acceptancy. This index was considered by combining Ecological Footprint and Environmental Risk Assessment. The Ecological Footprint and the Environmental Risk Assessment are introduced in the first part of this paper. Then the concept and the structure of the Triple I are explained in the second part of this paper. Finally, the economy-ecology conversion factor in Triple I accounting is considered.

Building Hierarchical Knowledge Base of Research Interests and Learning Topics for Social Computing Support (소셜 컴퓨팅을 위한 연구·학습 주제의 계층적 지식기반 구축)

  • Kim, Seonho;Kim, Kang-Hoe;Yeo, Woondong
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.12
    • /
    • pp.489-498
    • /
    • 2012
  • This paper consists of two parts: In the first part, we describe our work to build hierarchical knowledge base of digital library patron's research interests and learning topics in various scholarly areas through analyzing well classified Electronic Theses and Dissertations (ETDs) of NDLTD Union catalog. Journal articles from ACM Transactions and conference web sites of computing areas also are added in the analysis to specialize computing fields. This hierarchical knowledge base would be a useful tool for many social computing and information service applications, such as personalization, recommender system, text mining, technology opportunity mining, information visualization, and so on. In the second part, we compare four grouping algorithms to select best one for our data mining researches by testing each one with the hierarchical knowledge base we described in the first part. From these two studies, we intent to show traditional verification methods for social community miming researches, based on interviewing and answering questionnaires, which are expensive, slow, and privacy threatening, can be replaced with systematic, consistent, fast, and privacy protecting methods by using our suggested hierarchical knowledge base.

Fire Service Policy Theory for Establishing Fire Science (소방학의 학문적 정체성 확립을 위한 소방정책론 정립방안)

  • Ryu, Sang-Il;Lee, Jae-Eun
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.2
    • /
    • pp.421-432
    • /
    • 2010
  • In Korea, about 80 academic departments related to fire science are in operation throughout the country, but fire science is not included as a branch science in the science classification system acknowledged by the Ministry of Education, Science and Technology and its position as a science is not solid yet. In response to this problem, research is being made actively to establish fire science recently. This study aims at composing the fire service policy theory which is one of sub areas of fire science. First, the concept of fire service policy should be established, and fire service policies should be divided into different types. In addition, it is necessary to examine the direction of the development of fire service policies in Korea, and the unique characteristics of fire service policies should be described. Next, we will mention fire service policy making and participants, theories on the determinants of fire service policies, the execution of fire service policies, and the evaluation of fire service policies. Particularly, based on the peculiarity of fire service, it is necessary to explain policies on fire prevention and precaution, fire investigation, rescue and first aid, public campaigns for safety and prevention, fire insurance, etc. Finally, we suggest the future directions of fire service policies according to the change of environment in the future.

The Identification Framework for source code author using Authorship Analysis and CNN (작성자 분석과 CNN을 적용한 소스 코드 작성자 식별 프레임워크)

  • Shin, Gun-Yoon;Kim, Dong-Wook;Hong, Sung-sam;Han, Myung-Mook
    • Journal of Internet Computing and Services
    • /
    • v.19 no.5
    • /
    • pp.33-41
    • /
    • 2018
  • Recently, Internet technology has developed, various programs are being created and therefore various codes are being made through many authors. On this aspect, some author deceive a program or code written by other particular author as they make it themselves and use other writers' code indiscriminately, or not indicating the exact code which has been used. Due to this makes it more and more difficult to protect the code. In this paper, we propose author identification framework using Authorship Analysis theory and Natural Language Processing(NLP) based on Convolutional Neural Network(CNN). We apply Authorship Analysis theory to extract features for author identification in the source code, and combine them with the features being used text mining to perform author identification using machine learning. In addition, applying CNN based natural language processing method to source code for code author classification. Therefore, we propose a framework for the identification of authors using the Authorship Analysis theory and the CNN. In order to identify the author, we need special features for identifying the authors only, and the NLP method based on the CNN is able to apply language with a special system such as source code and identify the author. identification accuracy based on Authorship Analysis theory is 95.1% and identification accuracy applied to CNN is 98%.

Expressed sequence tag analysis of Meretrix lusoria (Veneridae) in Korea (한국산 백합 (Meretrix lusoria) 의 전사체 분석)

  • Kang, Jung-Ha;Jeong, Ji Eun;Kim, Bong Seok;An, Chel-Min;Kang, Hyun-Sook;Kang, Se-Won;Hwang, Hee Ju;Han, Yeon Soo;Chae, Sung-Hwa;Ko, Hyun-Sook;Lee, Jun-Sang;Lee, Yong Seok
    • The Korean Journal of Malacology
    • /
    • v.28 no.4
    • /
    • pp.377-384
    • /
    • 2012
  • The importance of biological resources has been gradually increasing, and mollusks have been utilized as main fishery resources in terrestrial ecosystems. But little is known about genomic and transcriptional analysis in mollusks. This is the first report on the transcriptomic profile of Meretrix lusoria. In this study, we constructed cDNA library and determined 542 of distinct EST sequences composed of 284 singletons and 95 contigs. At first, we identified 180 of EST sequences that have significant hits on protein sequences of the exclusive Mollusks database through BLASTX program and 343 of EST sequences that have significant hits on NCBI NR database. We also found that 211 of putative sequences through local BLAST (blastx, E < e-10) search against KOG database were classified into 16 functional categories. Some kinds of immune response related genes encoding allograft inflammatory factor 1 (AIF-1), B-cell translocation gene 1 (BTG1), C-type lectin A, thioester-containing protein and 26S proteasome regulatory complex were identified. To determine phylogenetic relationship, we identified partial sequences of four genes (COX1, COX2, 12S rRNA and NADH dehydrogenase) that significantly matched with the mitochondrial genomes of 3 species-Ml (Meretrix lusoria), Mp (Meretrix petechialis) and Mm (Meretrix meretrix). As a result, we found that there was a little bit of a difference between sequences of Korean isolates and other known isolates. This study will be useful to develop breeding technology and might also be helpful to establish a classification system.