• Title/Summary/Keyword: 사전 기반 모델

Search Result 850, Processing Time 0.029 seconds

Predicting the Direction of the Stock Index by Using a Domain-Specific Sentiment Dictionary (주가지수 방향성 예측을 위한 주제지향 감성사전 구축 방안)

  • Yu, Eunji;Kim, Yoosin;Kim, Namgyu;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.1
    • /
    • pp.95-110
    • /
    • 2013
  • Recently, the amount of unstructured data being generated through a variety of social media has been increasing rapidly, resulting in the increasing need to collect, store, search for, analyze, and visualize this data. This kind of data cannot be handled appropriately by using the traditional methodologies usually used for analyzing structured data because of its vast volume and unstructured nature. In this situation, many attempts are being made to analyze unstructured data such as text files and log files through various commercial or noncommercial analytical tools. Among the various contemporary issues dealt with in the literature of unstructured text data analysis, the concepts and techniques of opinion mining have been attracting much attention from pioneer researchers and business practitioners. Opinion mining or sentiment analysis refers to a series of processes that analyze participants' opinions, sentiments, evaluations, attitudes, and emotions about selected products, services, organizations, social issues, and so on. In other words, many attempts based on various opinion mining techniques are being made to resolve complicated issues that could not have otherwise been solved by existing traditional approaches. One of the most representative attempts using the opinion mining technique may be the recent research that proposed an intelligent model for predicting the direction of the stock index. This model works mainly on the basis of opinions extracted from an overwhelming number of economic news repots. News content published on various media is obviously a traditional example of unstructured text data. Every day, a large volume of new content is created, digitalized, and subsequently distributed to us via online or offline channels. Many studies have revealed that we make better decisions on political, economic, and social issues by analyzing news and other related information. In this sense, we expect to predict the fluctuation of stock markets partly by analyzing the relationship between economic news reports and the pattern of stock prices. So far, in the literature on opinion mining, most studies including ours have utilized a sentiment dictionary to elicit sentiment polarity or sentiment value from a large number of documents. A sentiment dictionary consists of pairs of selected words and their sentiment values. Sentiment classifiers refer to the dictionary to formulate the sentiment polarity of words, sentences in a document, and the whole document. However, most traditional approaches have common limitations in that they do not consider the flexibility of sentiment polarity, that is, the sentiment polarity or sentiment value of a word is fixed and cannot be changed in a traditional sentiment dictionary. In the real world, however, the sentiment polarity of a word can vary depending on the time, situation, and purpose of the analysis. It can also be contradictory in nature. The flexibility of sentiment polarity motivated us to conduct this study. In this paper, we have stated that sentiment polarity should be assigned, not merely on the basis of the inherent meaning of a word but on the basis of its ad hoc meaning within a particular context. To implement our idea, we presented an intelligent investment decision-support model based on opinion mining that performs the scrapping and parsing of massive volumes of economic news on the web, tags sentiment words, classifies sentiment polarity of the news, and finally predicts the direction of the next day's stock index. In addition, we applied a domain-specific sentiment dictionary instead of a general purpose one to classify each piece of news as either positive or negative. For the purpose of performance evaluation, we performed intensive experiments and investigated the prediction accuracy of our model. For the experiments to predict the direction of the stock index, we gathered and analyzed 1,072 articles about stock markets published by "M" and "E" media between July 2011 and September 2011.

Preliminary Environmental Impact Assessments on Fish Compositions and the Ecological Health of Jeokbyeok River on the Road Construction of Muju-Geumsan Region (무주-금산간 도로건설에 따른 적벽강의 어류 종 조성 분석 및 생태건강도 사전환경성평가)

  • Lee, Sang-Jae;Park, Hee-Sung;An, Kwang-Guk
    • Journal of Environmental Impact Assessment
    • /
    • v.26 no.1
    • /
    • pp.27-43
    • /
    • 2017
  • The objectives of the study were to evaluate fish compositions, endangered species, community structure, physical habitat, and general water quality for a preliminary environmental impact assessment of Jeokbyeok River on the road construction between two regions. Total number of species and total number of individuals, based on CPUE, were 23 and 1186, respectively. The endangered species (I, II) as the legal protection species were Pseudopungtungia nigra (79 samples) Gobiobotia brevibarba) (5) Gobiobotia macrocephala (2), indicating a requiring of endangered species conservation. In the meantime, exotic species and ecological disturbing species such as Micropterus salmoides and Lepomis macrochirus, were not present, indicating a well conserved area. According to fish community analysis, values of species diversity index were high (range: 0.788 - 1.030), and the dominance index were low (range: 0.097 - 0.183), indicating that the fish community in this area was maintained well without high dominacne by specific species. Also, fish analysis on tolerance guilds and trophic guilds showed that the proportions of sensitive species were largely exceeded the proportions of the tolerant species, while the proportions of insectivore species were largely exceeded the proportions of the omnivore species. This outcome suggests that the ecosystem was well maintained in terms of tolerance and trophic compositions (food chain). Ecological health, based on the multi-metric fish model of Fish Assessment Integrity (FAI), reflected those fish conditions. In other words, values of FAI model averaged 82.4, which means a "good condition" in the criteria of ecological health by the Minstry of Environment, Korea. In addition, general water quality and physical habitat analyses showed that the system was in good condition. Under these conditions, if the road constructions between the two regions happen in the future, inorganic suspended solids may increase in the waterbody, and this may result in indirect or direct influences on the physical habitats and food chain as well as fish compositions, so the ecological protections and prevention strategy from the soil erosion are required in the system.

Reduction effects of N-acetyl-L-cysteine, L-glutathione, and indole-3-acetic acid on phytotoxicity generated by methyl bromide fumigation- in a model plant Arabidopsis thaliana (모델식물 애기장대에 대한 훈증제 메틸브로마이드의 약해발생 및 N-acetyl-L-cysteine, L-glutathione, indole-3-acetic acid의 약해억제 효과)

  • Kim, Kyeongnam;Kim, Chaeeun;Park, Jungeun;Yoo, Jinsung;Kim, Woosung;Jeon, Hwang-Ju;Kim, Jun-Ran;Lee, Sung-Eun
    • Korean Journal of Environmental Biology
    • /
    • v.39 no.3
    • /
    • pp.354-361
    • /
    • 2021
  • Understanding the phytotoxic mechanism of methyl bromide (MB), an essential fumigant during the quarantine and pre-shipment process, is urgently needed to ensure its proper use and reduce international economic losses. In a previous study, two main MB-induced toxic mechanisms such as reactive oxygen species (ROS) and auxin distribution were selected by analyzing transcriptomic analysis. In the study, a 3-week-old A. thaliana was supplied with 1 mM ROS scavengers [N-acetyl-L-cysteine (NAC) or L-glutathione (GSH)] and 1µM indole-3-acetic acid(IAA) three times every 12 h, and visual and gene expression assessments were performed to evaluate the reduction in phytotoxicity by supplements. Phytotoxic effects on the MB-4h exposed group were decreased with GSH application compared to the other single supplements and a combination of supplements at 7 days post fumigation. Among these supplements, GSH at a concentration of 1, 2, and 5mM was suppled to A. thaliana with MB-fumigation. During a long-term observation of 2 weeks after the fumigation, 5 mM GSH application was the most effective in minimizing MB-induced phytotoxic effects with up-regulation of HSP70 expression and increase in main stem length. These results indicated that ROS was a main key factor of MB-induced phytotoxicity and that GSH can be used as a supplement to reduce the phytotoxicity of MB.

A Study on the Revitalization of BIM in the Field of Architecture Using AHP Method (AHP 기법을 이용한 건축분야 BIM 활성화 방안 연구)

  • Kim, Jin-Ho;Hwang, Chan-Gyu;Kim, Ji-Hyung
    • Journal of the Korea Institute of Building Construction
    • /
    • v.22 no.5
    • /
    • pp.473-483
    • /
    • 2022
  • BIM(Building Information Modeling) is a technology that can manage information throughout the entire life cycle of the construction industry and serves as a platform for improving productivity and integrating the entire construction industry. Currently, BIM is actively applied in developed countries, and its use at various overseas construction sites is increasing This is unclear. due to air shortening and budget savings. However, there is still a lack of institutional basis and technical limitations in the domestic construction sector, which have led to the lack of utilization of BIM. Various activation measures and institutional frameworks will need to be established for the early establishment of these productive BIMs in Korea. Therefore, as part of the research for the domestic settlement and revitalization of BIM, this study derived a number of key factors necessary for the development of the construction industry through brainstorming and expert surveys using AHP techniques and analyzed the relative importance of each factor. In addition, prior surveys by a group of experts resulted in 1, 3 items in level, 2, 9 items in level, and 3, 27 items in level, and priorities analysis was performed through pairwise comparisons. As a result of the AHP analysis, it was found that the relative importance weight of policy aspects was highest in level 1, and the policy factors in level 2 and the cost-based and incentive system introduction factors were considered most important in level 3. These findings show that the importance of the policy guidance or institutions underlying the activation of BIM rather than research and development or corporate innovation is relatively high, and that the preparation of policy plans by public institutions should be the first priority. Therefore, it is considered that the development of a policy system or guideline must be prioritized before it can be advanced to the next activation stage. The use of BIM technologies will not only contribute to improving the productivity of the construction industry, but also to the overall development of the industry and the growth of the construction industry. It is expected that the results of this study can provide as useful information when establishing policies for activating BIM in central government, relevant local governments, and related public institutions.

Basic Study for Selection of Factors Constituents of User Satisfaction for Micro Electric Vehicles (초소형전기차 사용자만족도 구성요인 선정을 위한 기반연구)

  • Jin, Eunju;Seo, Imki;Kim, Jongmin;Park, Jejin
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.41 no.5
    • /
    • pp.581-589
    • /
    • 2021
  • With the recent increase in the introduction of micro-electric vehicles in Korea, interest in micro-electric vehicle user satisfaction is increasing to revitalize related markets. In this paper, a basic study was conducted on the development of public services using micro-electric vehicle based on the constituent factors of user satisfaction. The survey includes: ① 'Analytic Hierarchy Process (AHP) for selecting the priority factors of user satisfaction of micro-electric vehicles', ② 'A survey of micro-electric vehicles image' to collect data in advance for providing users' preferences and transportation services for micro-electric vehicles, ③ In order to investigate the user satisfaction level of users who actually operated micro-electric vehicles, the order of 'user satisfaction survey of micro-electric vehicle drivers' was conducted. In the Analytic Hierarchy Process (AHP) analysis, it was found that users regarded as important in the order of 'user utilization data', 'vehicle movement data', and 'charging service data'. In the micro-electric vehicle image survey, users perceived micro-electric vehicles more positively in terms of "safety", 'durability', 'Ride comfort', 'design', 'MOOE (Maintenance and other operating expense)', and 'environment-friendly' when comparing micro-electric vehicles with electric motorcycles. In the survey on the user satisfaction of micro-electric vehicle drivers, the use of micro-electric vehicle did not directly affect work performance efficiency, and there was an experience of being disadvantaged on the road due to the size of the micro-electric vehicle, and driving in a cluster of micro-electric vehicle for outdoor advertisements. The city's public relations effect was great, but it was concerned about safety. In the future, based on the results of this study, we plan to build a user satisfaction structural equation model, preemptively discover feedback R&D for micro-electric vehicle utilization services in the public field, and actively seek to discover new public mobility support services.

Improvement of Mid-Wave Infrared Image Visibility Using Edge Information of KOMPSAT-3A Panchromatic Image (KOMPSAT-3A 전정색 영상의 윤곽 정보를 이용한 중적외선 영상 시인성 개선)

  • Jinmin Lee;Taeheon Kim;Hanul Kim;Hongtak Lee;Youkyung Han
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_1
    • /
    • pp.1283-1297
    • /
    • 2023
  • Mid-wave infrared (MWIR) imagery, due to its ability to capture the temperature of land cover and objects, serves as a crucial data source in various fields including environmental monitoring and defense. The KOMPSAT-3A satellite acquires MWIR imagery with high spatial resolution compared to other satellites. However, the limited spatial resolution of MWIR imagery, in comparison to electro-optical (EO) imagery, constrains the optimal utilization of the KOMPSAT-3A data. This study aims to create a highly visible MWIR fusion image by leveraging the edge information from the KOMPSAT-3A panchromatic (PAN) image. Preprocessing is implemented to mitigate the relative geometric errors between the PAN and MWIR images. Subsequently, we employ a pre-trained pixel difference network (PiDiNet), a deep learning-based edge information extraction technique, to extract the boundaries of objects from the preprocessed PAN images. The MWIR fusion imagery is then generated by emphasizing the brightness value corresponding to the edge information of the PAN image. To evaluate the proposed method, the MWIR fusion images were generated in three different sites. As a result, the boundaries of terrain and objects in the MWIR fusion images were emphasized to provide detailed thermal information of the interest area. Especially, the MWIR fusion image provided the thermal information of objects such as airplanes and ships which are hard to detect in the original MWIR images. This study demonstrated that the proposed method could generate a single image that combines visible details from an EO image and thermal information from an MWIR image, which contributes to increasing the usage of MWIR imagery.

Transfer Learning using Multiple ConvNet Layers Activation Features with Principal Component Analysis for Image Classification (전이학습 기반 다중 컨볼류션 신경망 레이어의 활성화 특징과 주성분 분석을 이용한 이미지 분류 방법)

  • Byambajav, Batkhuu;Alikhanov, Jumabek;Fang, Yang;Ko, Seunghyun;Jo, Geun Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.205-225
    • /
    • 2018
  • Convolutional Neural Network (ConvNet) is one class of the powerful Deep Neural Network that can analyze and learn hierarchies of visual features. Originally, first neural network (Neocognitron) was introduced in the 80s. At that time, the neural network was not broadly used in both industry and academic field by cause of large-scale dataset shortage and low computational power. However, after a few decades later in 2012, Krizhevsky made a breakthrough on ILSVRC-12 visual recognition competition using Convolutional Neural Network. That breakthrough revived people interest in the neural network. The success of Convolutional Neural Network is achieved with two main factors. First of them is the emergence of advanced hardware (GPUs) for sufficient parallel computation. Second is the availability of large-scale datasets such as ImageNet (ILSVRC) dataset for training. Unfortunately, many new domains are bottlenecked by these factors. For most domains, it is difficult and requires lots of effort to gather large-scale dataset to train a ConvNet. Moreover, even if we have a large-scale dataset, training ConvNet from scratch is required expensive resource and time-consuming. These two obstacles can be solved by using transfer learning. Transfer learning is a method for transferring the knowledge from a source domain to new domain. There are two major Transfer learning cases. First one is ConvNet as fixed feature extractor, and the second one is Fine-tune the ConvNet on a new dataset. In the first case, using pre-trained ConvNet (such as on ImageNet) to compute feed-forward activations of the image into the ConvNet and extract activation features from specific layers. In the second case, replacing and retraining the ConvNet classifier on the new dataset, then fine-tune the weights of the pre-trained network with the backpropagation. In this paper, we focus on using multiple ConvNet layers as a fixed feature extractor only. However, applying features with high dimensional complexity that is directly extracted from multiple ConvNet layers is still a challenging problem. We observe that features extracted from multiple ConvNet layers address the different characteristics of the image which means better representation could be obtained by finding the optimal combination of multiple ConvNet layers. Based on that observation, we propose to employ multiple ConvNet layer representations for transfer learning instead of a single ConvNet layer representation. Overall, our primary pipeline has three steps. Firstly, images from target task are given as input to ConvNet, then that image will be feed-forwarded into pre-trained AlexNet, and the activation features from three fully connected convolutional layers are extracted. Secondly, activation features of three ConvNet layers are concatenated to obtain multiple ConvNet layers representation because it will gain more information about an image. When three fully connected layer features concatenated, the occurring image representation would have 9192 (4096+4096+1000) dimension features. However, features extracted from multiple ConvNet layers are redundant and noisy since they are extracted from the same ConvNet. Thus, a third step, we will use Principal Component Analysis (PCA) to select salient features before the training phase. When salient features are obtained, the classifier can classify image more accurately, and the performance of transfer learning can be improved. To evaluate proposed method, experiments are conducted in three standard datasets (Caltech-256, VOC07, and SUN397) to compare multiple ConvNet layer representations against single ConvNet layer representation by using PCA for feature selection and dimension reduction. Our experiments demonstrated the importance of feature selection for multiple ConvNet layer representation. Moreover, our proposed approach achieved 75.6% accuracy compared to 73.9% accuracy achieved by FC7 layer on the Caltech-256 dataset, 73.1% accuracy compared to 69.2% accuracy achieved by FC8 layer on the VOC07 dataset, 52.2% accuracy compared to 48.7% accuracy achieved by FC7 layer on the SUN397 dataset. We also showed that our proposed approach achieved superior performance, 2.8%, 2.1% and 3.1% accuracy improvement on Caltech-256, VOC07, and SUN397 dataset respectively compare to existing work.

A Machine Learning-based Total Production Time Prediction Method for Customized-Manufacturing Companies (주문생산 기업을 위한 기계학습 기반 총생산시간 예측 기법)

  • Park, Do-Myung;Choi, HyungRim;Park, Byung-Kwon
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.177-190
    • /
    • 2021
  • Due to the development of the fourth industrial revolution technology, efforts are being made to improve areas that humans cannot handle by utilizing artificial intelligence techniques such as machine learning. Although on-demand production companies also want to reduce corporate risks such as delays in delivery by predicting total production time for orders, they are having difficulty predicting this because the total production time is all different for each order. The Theory of Constraints (TOC) theory was developed to find the least efficient areas to increase order throughput and reduce order total cost, but failed to provide a forecast of total production time. Order production varies from order to order due to various customer needs, so the total production time of individual orders can be measured postmortem, but it is difficult to predict in advance. The total measured production time of existing orders is also different, which has limitations that cannot be used as standard time. As a result, experienced managers rely on persimmons rather than on the use of the system, while inexperienced managers use simple management indicators (e.g., 60 days total production time for raw materials, 90 days total production time for steel plates, etc.). Too fast work instructions based on imperfections or indicators cause congestion, which leads to productivity degradation, and too late leads to increased production costs or failure to meet delivery dates due to emergency processing. Failure to meet the deadline will result in compensation for delayed compensation or adversely affect business and collection sectors. In this study, to address these problems, an entity that operates an order production system seeks to find a machine learning model that estimates the total production time of new orders. It uses orders, production, and process performance for materials used for machine learning. We compared and analyzed OLS, GLM Gamma, Extra Trees, and Random Forest algorithms as the best algorithms for estimating total production time and present the results.

Development of the Accident Prediction Model for Enlisted Men through an Integrated Approach to Datamining and Textmining (데이터 마이닝과 텍스트 마이닝의 통합적 접근을 통한 병사 사고예측 모델 개발)

  • Yoon, Seungjin;Kim, Suhwan;Shin, Kyungshik
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.1-17
    • /
    • 2015
  • In this paper, we report what we have observed with regards to a prediction model for the military based on enlisted men's internal(cumulative records) and external data(SNS data). This work is significant in the military's efforts to supervise them. In spite of their effort, many commanders have failed to prevent accidents by their subordinates. One of the important duties of officers' work is to take care of their subordinates in prevention unexpected accidents. However, it is hard to prevent accidents so we must attempt to determine a proper method. Our motivation for presenting this paper is to mate it possible to predict accidents using enlisted men's internal and external data. The biggest issue facing the military is the occurrence of accidents by enlisted men related to maladjustment and the relaxation of military discipline. The core method of preventing accidents by soldiers is to identify problems and manage them quickly. Commanders predict accidents by interviewing their soldiers and observing their surroundings. It requires considerable time and effort and results in a significant difference depending on the capabilities of the commanders. In this paper, we seek to predict accidents with objective data which can easily be obtained. Recently, records of enlisted men as well as SNS communication between commanders and soldiers, make it possible to predict and prevent accidents. This paper concerns the application of data mining to identify their interests, predict accidents and make use of internal and external data (SNS). We propose both a topic analysis and decision tree method. The study is conducted in two steps. First, topic analysis is conducted through the SNS of enlisted men. Second, the decision tree method is used to analyze the internal data with the results of the first analysis. The dependent variable for these analysis is the presence of any accidents. In order to analyze their SNS, we require tools such as text mining and topic analysis. We used SAS Enterprise Miner 12.1, which provides a text miner module. Our approach for finding their interests is composed of three main phases; collecting, topic analysis, and converting topic analysis results into points for using independent variables. In the first phase, we collect enlisted men's SNS data by commender's ID. After gathering unstructured SNS data, the topic analysis phase extracts issues from them. For simplicity, 5 topics(vacation, friends, stress, training, and sports) are extracted from 20,000 articles. In the third phase, using these 5 topics, we quantify them as personal points. After quantifying their topic, we include these results in independent variables which are composed of 15 internal data sets. Then, we make two decision trees. The first tree is composed of their internal data only. The second tree is composed of their external data(SNS) as well as their internal data. After that, we compare the results of misclassification from SAS E-miner. The first model's misclassification is 12.1%. On the other hand, second model's misclassification is 7.8%. This method predicts accidents with an accuracy of approximately 92%. The gap of the two models is 4.3%. Finally, we test if the difference between them is meaningful or not, using the McNemar test. The result of test is considered relevant.(p-value : 0.0003) This study has two limitations. First, the results of the experiments cannot be generalized, mainly because the experiment is limited to a small number of enlisted men's data. Additionally, various independent variables used in the decision tree model are used as categorical variables instead of continuous variables. So it suffers a loss of information. In spite of extensive efforts to provide prediction models for the military, commanders' predictions are accurate only when they have sufficient data about their subordinates. Our proposed methodology can provide support to decision-making in the military. This study is expected to contribute to the prevention of accidents in the military based on scientific analysis of enlisted men and proper management of them.

Development of a complex failure prediction system using Hierarchical Attention Network (Hierarchical Attention Network를 이용한 복합 장애 발생 예측 시스템 개발)

  • Park, Youngchan;An, Sangjun;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.127-148
    • /
    • 2020
  • The data center is a physical environment facility for accommodating computer systems and related components, and is an essential foundation technology for next-generation core industries such as big data, smart factories, wearables, and smart homes. In particular, with the growth of cloud computing, the proportional expansion of the data center infrastructure is inevitable. Monitoring the health of these data center facilities is a way to maintain and manage the system and prevent failure. If a failure occurs in some elements of the facility, it may affect not only the relevant equipment but also other connected equipment, and may cause enormous damage. In particular, IT facilities are irregular due to interdependence and it is difficult to know the cause. In the previous study predicting failure in data center, failure was predicted by looking at a single server as a single state without assuming that the devices were mixed. Therefore, in this study, data center failures were classified into failures occurring inside the server (Outage A) and failures occurring outside the server (Outage B), and focused on analyzing complex failures occurring within the server. Server external failures include power, cooling, user errors, etc. Since such failures can be prevented in the early stages of data center facility construction, various solutions are being developed. On the other hand, the cause of the failure occurring in the server is difficult to determine, and adequate prevention has not yet been achieved. In particular, this is the reason why server failures do not occur singularly, cause other server failures, or receive something that causes failures from other servers. In other words, while the existing studies assumed that it was a single server that did not affect the servers and analyzed the failure, in this study, the failure occurred on the assumption that it had an effect between servers. In order to define the complex failure situation in the data center, failure history data for each equipment existing in the data center was used. There are four major failures considered in this study: Network Node Down, Server Down, Windows Activation Services Down, and Database Management System Service Down. The failures that occur for each device are sorted in chronological order, and when a failure occurs in a specific equipment, if a failure occurs in a specific equipment within 5 minutes from the time of occurrence, it is defined that the failure occurs simultaneously. After configuring the sequence for the devices that have failed at the same time, 5 devices that frequently occur simultaneously within the configured sequence were selected, and the case where the selected devices failed at the same time was confirmed through visualization. Since the server resource information collected for failure analysis is in units of time series and has flow, we used Long Short-term Memory (LSTM), a deep learning algorithm that can predict the next state through the previous state. In addition, unlike a single server, the Hierarchical Attention Network deep learning model structure was used in consideration of the fact that the level of multiple failures for each server is different. This algorithm is a method of increasing the prediction accuracy by giving weight to the server as the impact on the failure increases. The study began with defining the type of failure and selecting the analysis target. In the first experiment, the same collected data was assumed as a single server state and a multiple server state, and compared and analyzed. The second experiment improved the prediction accuracy in the case of a complex server by optimizing each server threshold. In the first experiment, which assumed each of a single server and multiple servers, in the case of a single server, it was predicted that three of the five servers did not have a failure even though the actual failure occurred. However, assuming multiple servers, all five servers were predicted to have failed. As a result of the experiment, the hypothesis that there is an effect between servers is proven. As a result of this study, it was confirmed that the prediction performance was superior when the multiple servers were assumed than when the single server was assumed. In particular, applying the Hierarchical Attention Network algorithm, assuming that the effects of each server will be different, played a role in improving the analysis effect. In addition, by applying a different threshold for each server, the prediction accuracy could be improved. This study showed that failures that are difficult to determine the cause can be predicted through historical data, and a model that can predict failures occurring in servers in data centers is presented. It is expected that the occurrence of disability can be prevented in advance using the results of this study.