• 제목/요약/키워드: Machine learning algorithm

검색결과 1,480건 처리시간 0.031초

A Sentiment Classification Approach of Sentences Clustering in Webcast Barrages

  • Li, Jun;Huang, Guimin;Zhou, Ya
    • Journal of Information Processing Systems
    • /
    • 제16권3호
    • /
    • pp.718-732
    • /
    • 2020
  • Conducting sentiment analysis and opinion mining are challenging tasks in natural language processing. Many of the sentiment analysis and opinion mining applications focus on product reviews, social media reviews, forums and microblogs whose reviews are topic-similar and opinion-rich. In this paper, we try to analyze the sentiments of sentences from online webcast reviews that scroll across the screen, which we call live barrages. Contrary to social media comments or product reviews, the topics in live barrages are more fragmented, and there are plenty of invalid comments that we must remove in the preprocessing phase. To extract evaluative sentiment sentences, we proposed a novel approach that clusters the barrages from the same commenter to solve the problem of scattering the information for each barrage. The method developed in this paper contains two subtasks: in the data preprocessing phase, we cluster the sentences from the same commenter and remove unavailable sentences; and we use a semi-supervised machine learning approach, the naïve Bayes algorithm, to analyze the sentiment of the barrage. According to our experimental results, this method shows that it performs well in analyzing the sentiment of online webcast barrages.

신문기사로부터 추출한 최근동향에 대한 트위터 감성분석 (Twitter Sentiment Analysis for the Recent Trend Extracted from the Newspaper Article)

  • 이경호;이공주
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제2권10호
    • /
    • pp.731-738
    • /
    • 2013
  • 본 논문은 사회의 최근 동향에 대한 여론의 반응을 관찰하기 위한 방법을 나타낸다. 최근 동향을 나타내는 키워드를 신문기사로부터 추출하고, 추출된 키워드를 이용하여 수집된 트윗의 감성 분석을 통해 최근 동향에 대한 여론을 분석한다. 수집된 신문기사를 k-means알고리즘을 이용하여 군집화하고, 군집내의 단어의 출현 빈도를 이용하여 토픽 키워드를 선정하였다. 각 토픽에 대하여 수집된 트윗은 그 토픽 대한 트윗이라는 가정하에 기계학습 방법을 이용하여 긍/부정을 판별하여 감성을 판단하게 하였다. 그리고 이와 같은 가정에 대한 타당성을 검증해 보았다.

의사결정트리를 이용한 개별 공시지가 비교표준지의 자동 선정 (An Automatic Method for Selecting Comparative Standard Land Parcels in Land Price Appraisal Using a Decision Tree)

  • 김종윤;박수홍
    • 한국지리정보학회지
    • /
    • 제7권1호
    • /
    • pp.9-19
    • /
    • 2004
  • 개별 공시지가 산정에 있어 비교 표준지의 선정은 가장 중요한 작업으로서, 최대한 객관적이고 합리적으로 이루어져야 한다. 그러나 현재 비교표준지를 선정하는 작업은 담당 공무원의 수작업에 의해 이루어지기 때문에 효율성이나 객관성을 보장하기가 어렵다. 본 연구에서는 현행 비교표준지 선정방식을 분석하여 문제를 정의하고 비교표준지 선정 업무의 자동화에 적용가능한 기계학습 알고리즘으로 의사결정트리를 선정하고 비교표준지를 선정하여 규칙을 주제지향적인 데이터베이스를 기반으로 학습하였다. 이렇게 학습된 규칙을 이용하여 비교표준지를 선정하고 그 결과를 평가 분석하여 새로운 비교표준지 선정 방법을 제안하였다.

  • PDF

Saliency Map 다중 채널을 기반으로 한 개선된 객체 추출 방법 (Enhanced Object Extraction Method Based on Multi-channel Saliency Map)

  • 최영진;퀴런;김광락;김형중
    • 전자공학회논문지
    • /
    • 제53권2호
    • /
    • pp.53-61
    • /
    • 2016
  • 영상으로부터 중요 객체를 구하는 Saliency Map은 현재 영상처리 분야에서 가장 활발한 연구 분야이다. 이와 관련한 여러 연구가 진행되어가고 있으나 Saliency Map의 객체를 추출하는 것이 어려운 상황이다. 본 논문에서는 제안하는 SLIC와 색상차, 영역간 거리, texture 정보를 이용하여 객체 추출하는 방법으로 Saliency Map을 개선하고자 한다. 실험결과는 본 논문에서 제안하는 방법을 통해 영상의 모든 영역이 아닌 중앙에 있는 영역을 중점으로 주요 객체를 추출하는 결과를 보였다.

A Context-aware Task Offloading Scheme in Collaborative Vehicular Edge Computing Systems

  • Jin, Zilong;Zhang, Chengbo;Zhao, Guanzhe;Jin, Yuanfeng;Zhang, Lejun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권2호
    • /
    • pp.383-403
    • /
    • 2021
  • With the development of mobile edge computing (MEC), some late-model application technologies, such as self-driving, augmented reality (AR) and traffic perception, emerge as the times require. Nevertheless, the high-latency and low-reliability of the traditional cloud computing solutions are difficult to meet the requirement of growing smart cars (SCs) with computing-intensive applications. Hence, this paper studies an efficient offloading decision and resource allocation scheme in collaborative vehicular edge computing networks with multiple SCs and multiple MEC servers to reduce latency. To solve this problem with effect, we propose a context-aware offloading strategy based on differential evolution algorithm (DE) by considering vehicle mobility, roadside units (RSUs) coverage, vehicle priority. On this basis, an autoregressive integrated moving average (ARIMA) model is employed to predict idle computing resources according to the base station traffic in different periods. Simulation results demonstrate that the practical performance of the context-aware vehicular task offloading (CAVTO) optimization scheme could reduce the system delay significantly.

Fault Diagnosis Method based on Feature Residual Values for Industrial Rotor Machines

  • Kim, Donghwan;Kim, Younhwan;Jung, Joon-Ha;Sohn, Seokman
    • KEPCO Journal on Electric Power and Energy
    • /
    • 제4권2호
    • /
    • pp.89-99
    • /
    • 2018
  • Downtime and malfunction of industrial rotor machines represents a crucial cost burden and productivity loss. Fault diagnosis of this equipment has recently been carried out to detect their fault(s) and cause(s) by using fault classification methods. However, these methods are of limited use in detecting rotor faults because of their hypersensitivity to unexpected and different equipment conditions individually. These limitations tend to affect the accuracy of fault classification since fault-related features calculated from vibration signal are moved to other regions or changed. To improve the limited diagnosis accuracy of existing methods, we propose a new approach for fault diagnosis of rotor machines based on the model generated by supervised learning. Our work is based on feature residual values from vibration signals as fault indices. Our diagnostic model is a robust and flexible process that, once learned from historical data only one time, allows it to apply to different target systems without optimization of algorithms. The performance of the proposed method was evaluated by comparing its results with conventional methods for fault diagnosis of rotor machines. The experimental results show that the proposed method can be used to achieve better fault diagnosis, even when applied to systems with different normal-state signals, scales, and structures, without tuning or the use of a complementary algorithm. The effectiveness of the method was assessed by simulation using various rotor machine models.

A Study on Design of Real-time Big Data Collection and Analysis System based on OPC-UA for Smart Manufacturing of Machine Working

  • Kim, Jaepyo;Kim, Youngjoo;Kim, Seungcheon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제13권4호
    • /
    • pp.121-128
    • /
    • 2021
  • In order to design a real time big data collection and analysis system of manufacturing data in a smart factory, it is important to establish an appropriate wired/wireless communication system and protocol. This paper introduces the latest communication protocol, OPC-UA (Open Platform Communication Unified Architecture) based client/server function, applied user interface technology to configure a network for real-time data collection through IoT Integration. Then, Database is designed in MES (Manufacturing Execution System) based on the analysis table that reflects the user's requirements among the data extracted from the new cutting process automation process, bush inner diameter indentation measurement system and tool monitoring/inspection system. In summary, big data analysis system introduced in this paper performs SPC (statistical Process Control) analysis and visualization analysis with interface of OPC-UA-based wired/wireless communication. Through AI learning modeling with XGBoost (eXtream Gradient Boosting) and LR (Linear Regression) algorithm, quality and visualization analysis is carried out the storage and connection to the cloud.

가상현실 기반 사용자 참여형 타공패널 파사드 설계 방법론 (User-Participated Design Method for Perforated Metal Facades using Virtual Reality)

  • 장도진;김성준;김성아
    • 대한건축학회논문집:계획계
    • /
    • 제36권4호
    • /
    • pp.103-111
    • /
    • 2020
  • Perforated metal sheets are used as panels of facades for controlling environmental factors while ensuring user's visibility. Despite their functional potentials, only a specific direction of facades or an orientation of a building was considered in the relevant studies. This study proposed a design methodology for the perforated panel facades that reflects the location on the facades and the user's requirements. The optimization of quantitative and qualitative performance is achieved through communication between designers and users in a VR system. In optimizing quantitative performances, designers use machine learning techniques such as clustering and genetic algorithm to allocate optimal panels on the facades. In optimizing qualitative performances, through the VR system, users intervene in evaluating performances whose preferences are depending on them. The experiment using the office project showed that designers were able to make decisions based on clustering using GMM to optimize multiple quantitative performances. The gap between the target and final performance could be narrowed by limiting the types of perforated panels considering mass customization. In assessing visibility as a qualitative performance, users were able to participate in the design process using the VR system.

An Extended Work Architecture for Online Threat Prediction in Tweeter Dataset

  • Sheoran, Savita Kumari;Yadav, Partibha
    • International Journal of Computer Science & Network Security
    • /
    • 제21권1호
    • /
    • pp.97-106
    • /
    • 2021
  • Social networking platforms have become a smart way for people to interact and meet on internet. It provides a way to keep in touch with friends, families, colleagues, business partners, and many more. Among the various social networking sites, Twitter is one of the fastest-growing sites where users can read the news, share ideas, discuss issues etc. Due to its vast popularity, the accounts of legitimate users are vulnerable to the large number of threats. Spam and Malware are some of the most affecting threats found on Twitter. Therefore, in order to enjoy seamless services it is required to secure Twitter against malicious users by fixing them in advance. Various researches have used many Machine Learning (ML) based approaches to detect spammers on Twitter. This research aims to devise a secure system based on Hybrid Similarity Cosine and Soft Cosine measured in combination with Genetic Algorithm (GA) and Artificial Neural Network (ANN) to secure Twitter network against spammers. The similarity among tweets is determined using Cosine with Soft Cosine which has been applied on the Twitter dataset. GA has been utilized to enhance training with minimum training error by selecting the best suitable features according to the designed fitness function. The tweets have been classified as spammer and non-spammer based on ANN structure along with the voting rule. The True Positive Rate (TPR), False Positive Rate (FPR) and Classification Accuracy are considered as the evaluation parameter to evaluate the performance of system designed in this research. The simulation results reveals that our proposed model outperform the existing state-of-arts.

A System Engineering Approach to Predict the Critical Heat Flux Using Artificial Neural Network (ANN)

  • Wazif, Muhammad;Diab, Aya
    • 시스템엔지니어링학술지
    • /
    • 제16권2호
    • /
    • pp.38-46
    • /
    • 2020
  • The accurate measurement of critical heat flux (CHF) in flow boiling is important for the safety requirement of the nuclear power plant to prevent sharp degradation of the convective heat transfer between the surface of the fuel rod cladding and the reactor coolant. In this paper, a System Engineering approach is used to develop a model that predicts the CHF using machine learning. The model is built using artificial neural network (ANN). The model is then trained, tested and validated using pre-existing database for different flow conditions. The Talos library is used to tune the model by optimizing the hyper parameters and selecting the best network architecture. Once developed, the ANN model can predict the CHF based solely on a set of input parameters (pressure, mass flux, quality and hydraulic diameter) without resorting to any physics-based model. It is intended to use the developed model to predict the DNBR under a large break loss of coolant accident (LBLOCA) in APR1400. The System Engineering approach proved very helpful in facilitating the planning and management of the current work both efficiently and effectively.