• Title/Summary/Keyword: network log analysis

Search Result 130, Processing Time 0.027 seconds

Analysis of the Construction Cost Prediction Performance according to Feature Scaling and Log Conversion of Target Variable (피처 스케일링과 타겟변수 로그변환에 따른 건축 공사비 예측 성능 분석)

  • Kang, Yoon-Ho;Yun, Seok-Heon
    • Journal of the Korea Institute of Building Construction
    • /
    • v.22 no.3
    • /
    • pp.317-326
    • /
    • 2022
  • With the development of various technologies in the area of artificial intelligence, a number of studies to application of artificial intelligence technology in the construction field are underway. Diverse technologies have been applied to the task of predicting construction costs, and construction cost prediction technologies applying artificial intelligence technologies have recently been developed. However, it is difficult to secure the vast amount of construction cost data required for machine learning, which has not yet been practically used. In this study, to predict the construction cost, the latest artificial neural network(ANN) method is used to propose a method to improve the construction cost prediction performance. In particular, to improve predictive performance, a log conversion method of target variables and a feature scaling method to eliminate the difference in the relative influence of each column data are applied, and their performance in predicting construction cost is compared and analyzed.

Customer Behavior Data Model using User Profile Analysis

  • Jung, Yong Gyu;Lee, Agatha;Lee, Jeong Chan;Lee, Young Dae
    • International Journal of Advanced Culture Technology
    • /
    • v.1 no.2
    • /
    • pp.13-17
    • /
    • 2013
  • Today, most of the companies have numerous issues to take advantage of the data within the organization. Modeling techniques could be described using profile and historical log data as a tool of data mining techniques. It is covered increasingly with data entry, research, processing, modeling and reporting components of the icon in the form of easy-to-use in many datamining tools. Visual data mining process can create a data stream. In this paper, customer behavior is predicted in pages or products, using the history profile analysis and the navigation items are necessary to predict unknown features.

  • PDF

Quantum Chemical Studies of Some Sulphanilamide Schiff Bases Inhibitor Activity Using QSAR Methods

  • Baher, Elham;Darzi, Naser;Morsali, Ali;Beyramabadi, Safar Ali
    • Journal of the Korean Chemical Society
    • /
    • v.59 no.6
    • /
    • pp.483-487
    • /
    • 2015
  • The different calculated quantum chemical descriptors by DFT method were used for prediction of some sulphanilamide Schiff bases inhibitor activity as a binding constant (log K). Multiple linear regression (MLR) and artificial neural network (ANN) were employed for developing the useful quantitative structure activity relationship (QSAR) model. The obtained results presented superiority of ANN model over the MLR one. The offering QSAR model is very easy to computation and Physico-Chemically interpretable. Sensitivity analysis was used to determine the relative importance of each descriptor in ANN model. The order of importance of each descriptor according to this analysis is: molecular volume, molecular weight and dipole moment, respectively. These descriptors appear good information related to different structure of sulphanilamide Schiff bases can participate in their inhibitor activity.

Malware Application Classification based on Feature Extraction and Machine Learning for Malicious Behavior Analysis in Android Platform (안드로이드 플랫폼에서 악성 행위 분석을 통한 특징 추출과 머신러닝 기반 악성 어플리케이션 분류)

  • Kim, Dong-Wook;Na, Kyung-Gi;Han, Myung-Mook;Kim, Mijoo;Go, Woong;Park, Jun Hyung
    • Journal of Internet Computing and Services
    • /
    • v.19 no.1
    • /
    • pp.27-35
    • /
    • 2018
  • This paper is a study to classify malicious applications in Android environment. And studying the threat and behavioral analysis of malicious Android applications. In addition, malicious apps classified by machine learning were performed as experiments. Android behavior analysis can use dynamic analysis tools. Through this tool, API Calls, Runtime Log, System Resource, and Network information for the application can be extracted. We redefined the properties extracted for machine learning and evaluated the results of machine learning classification by verifying between the overall features and the main features. The results show that key features have been improved by 1~4% over the full feature set. Especially, SVM classifier improved by 10%. From these results, we found that the application of the key features as a key feature was more effective in the performance of the classification algorithm than in the use of the overall features. It was also identified as important to select meaningful features from the data sets.

Long-Range Dependence and 1/f Noise in a Wide Area Network Traffic (광역 네트워크 트래픽의 장거리 상관관계와 1/f 노이즈)

  • Lee, Chang-Yong
    • Journal of KIISE:Information Networking
    • /
    • v.37 no.1
    • /
    • pp.27-34
    • /
    • 2010
  • In this paper, we examine a long-range dependence in an active measurement of a network traffic which has been a well known characteristic from analyses of a passive network traffic measurement. To this end, we utilize RTT(Round Trip Time), which is a typical active measurement measured by PingER project, and perform a relevant analysis to a time series of both RTT and its volatilities. The RTT time series exhibits a long-range dependence or a 1/f noise. The volatilities, defined as a higher-order variation, follow a log-normal distribution. Furthermore, volatilities show a long-range dependence in relatively short time intervals, and a long-range dependence and/or 1/f noise in long time intervals. From this study, we find that the long-range dependence is a characteristic of not only a passive traffic measurement but also an active measurement of network traffic such as RTT. From these findings, we can infer that the long-range dependence is a characteristic of network traffic independent of a type of measurements. In particular, an active measurement exhibits a 1/f noise which cannot be usually found in a passive measurement.

Preliminary Research for Korean Twitter User Analysis Focusing on Extreme Heavy User's Twitter Log (국내 트위터 유저 분석을 위한 예비연구 )

  • Jung, Hye-Lan;Ji, Sook-Young;Lee, Joong-Seek
    • Journal of the HCI Society of Korea
    • /
    • v.5 no.1
    • /
    • pp.37-43
    • /
    • 2010
  • Twitter has been continuously growing since October, 2006. Especially, not only the users and the number of messages have been increasing but also a new concept in social networking called 'micro blogging' has diffused. Within Korea, service such as 'me2day' has already been introduced and the improvement of internet accessibility within mobile devices is expected to expand the 'micro blogs'. In this point, this research is executed to study the new medium, 'micro blog'. To do so, we collected and analyzed Twitter logs of Korean users. Especially, we were curious about the extreme heavy users using Twitter, despite of the linguistic and cultural barrier of the foreign service. Who they are, why and how they use the 'micro blog'. First, we reviewed the general aspect of followers and messages by collecting a certain number of random samples. Using the Lorenz curve we found out that there was the imbalance within the users and based on this phenomenon we deducted an extreme heavy user group. In order to perform further analysis, log analysis was performed on the extreme heavy users. As the result, the users used multiple mobile and desktop 'Twitter' clients. The usage pattern was similar to that of internet usage time but was used during their "micro" time. The users using 'Twitter' not only to spread messages about important information, special events and emotions, but also as a habitual 'chatting tool' to express ordinary personal chats similar to SMS and IM services. In this research, it is proved that 68% of the total messages were ordinary personal chats. Also, with 24% of the total messages were retweets, we were able to find out that virtually connected 'people' and 'relationships' acted as the dominant trigger of their articulation.

  • PDF

Relationships between Topological Structures of Traffic Flows on the Subway Networks and Land Use Patterns in the Metropolitan Seoul (수도권 지하철망 상 통행흐름의 위상학적 구조와 토지이용의 관계)

  • Lee, Keum-Sook;Hong, Ji-Yeon;Min, Hee-Hwa;Park, Jong-Soo
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.10 no.4
    • /
    • pp.427-443
    • /
    • 2007
  • The purpose of this study is to investigate spacio-temporal structures of traffic flows on the subway network in the Metropolitan Seoul, and the relationships between topological structures of traffic flows and land use patterns. In particular we analyze in the topological structures of traffic flows on the subway network in time dimension as well as in spatial dimension. For the purpose, this study utilizes data mining techniques to the one day T-card transaction data of the last four years, which has developed for exploring the characteristics of traffic flows from large scale trip-transaction databases. The topological structures of traffic flows on the subway network has changed considerably during the last four years. The volumes of traffic flows, the travel time and stops per trip have increased until 2006 and decreased again in 2007. The results are visualized by utilizing GIS and analyzed, and thus the spatial patterns of traffic flows are analyzed. The spatial distribution patterns of trip origins and destinations show substantial differences among time zones during a day. We analyze the relationships between traffic flows at subway stops and the geographical variables reflecting land use around them. We obtain 6 log-linear functions from stepwise multiple regression analysis. We test multicollinearity among the variables and autocollelation for the residuals.

  • PDF

Generation of Pseudo Porosity Logs from Seismic Data Using a Polynomial Neural Network Method (다항식 신경망 기법을 이용한 탄성파 탐사 자료로부터의 유사공극률 검층자료 생성)

  • Choi, Jae-Won;Byun, Joong-Moo;Seol, Soon-Jee
    • Journal of the Korean earth science society
    • /
    • v.32 no.6
    • /
    • pp.665-673
    • /
    • 2011
  • In order to estimate the hydrocarbon reserves, the porosity of the reservoir must be determined. The porosity of the area without a well is generally calculated by extrapolating the porosity logs measured at wells. However, if not only well logs but also seismic data exist on the same site, the more accurate pseudo porosity log can be obtained through artificial neural network technique by extracting the relations between the seismic data and well logs at the site. In this study, we have developed a module which creates pseudo porosity logs by using the polynomial neural network method. In order to obtain more accurate pseudo porosity logs, we selected the seismic attributes which have high correlation values in the correlation analysis between the seismic attributes and the porosity logs. Through the training procedure between selected seismic attributes and well logs, our module produces the correlation weights which can be used to generate the pseudo porosity log in the well free area. To verify the reliability and the applicability of the developed module, we have applied the module to the field data acquired from F3 Block in the North Sea and compared the results to those from the probabilistic neural network method in a commercial program. We could confirm the reliability of our module because both results showed similar trend. Moreover, since the pseudo porosity logs from polynomial neural network method are closer to the true porosity logs at the wells than those from probabilistic method, we concluded that the polynomial neural network method is effective for the data sets with insufficient wells such as F3 Block in the North Sea.

Methodology Using Text Analysis for Packaging R&D Information Services on Pending National Issues (텍스트 분석을 활용한 국가 현안 대응 R&D 정보 패키징 방법론)

  • Hyun, Yoonjin;Han, Heejun;Choi, Heeseok;Park, Junhyung;Lee, Kyuha;Kwahk, Kee-Young;Kim, Namgyu
    • Journal of Information Technology Applications and Management
    • /
    • v.20 no.3_spc
    • /
    • pp.231-257
    • /
    • 2013
  • The recent rise in the unstructured data generated by social media has resulted in an increasing need to collect, store, search, analyze, and visualize it. These data cannot be managed effectively by using traditional data analysis methodologies because of their vast volume and unstructured nature. Therefore, many attempts are being made to analyze these unstructured data (e.g., text files and log files) by using commercial and noncommercial analytical tools. Especially, the attempt to discover meaningful knowledge by using text mining is being made in business and other areas such as politics, economics, and cultural studies. For instance, several studies have examined pending national issues by analyzing large volumes of texts on various social issues. However, it is difficult to create satisfactory information services that can identify R&D documents on specific national issues from among the various R&D resources. In other words, although users specify some words related to pending national issues as search keywords, they usually fail to retrieve the R&D information they are looking for. This is usually because of the discrepancy between the terms defining pending national issues and the corresponding terms used in R&D documents. We need a mediating logic to overcome this discrep 'ancy so that we can identify and package appropriate R&D information on specific pending national issues. In this paper, we use association analysis and social network analysis to devise a mediator for bridging the gap between the keywords defining pending national issues and those used in R&D documents. Further, we propose a methodology for packaging R&D information services for pending national issues by using the devised mediator. Finally, in order to evaluate the practical applicability of the proposed methodology, we apply it to the NTIS(National Science & Technology Information Service) system, and summarize the results in the case study section.

Analysis of promising countries for export using parametric and non-parametric methods based on ERGM: Focusing on the case of information communication and home appliance industries (ERGM 기반의 모수적 및 비모수적 방법을 활용한 수출 유망국가 분석: 정보통신 및 가전 산업 사례를 중심으로)

  • Jun, Seung-pyo;Seo, Jinny;Yoo, Jae-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.175-196
    • /
    • 2022
  • Information and communication and home appliance industries, which were one of South Korea's main industries, are gradually losing their export share as their export competitiveness is weakening. This study objectively analyzed export competitiveness and suggested export-promising countries in order to help South Korea's information communication and home appliance industries improve exports. In this study, network properties, centrality, and structural hole analysis were performed during network analysis to evaluate export competitiveness. In order to select promising export countries, we proposed a new variable that can take into account the characteristics of an already established International Trade Network (ITN), that is, the Global Value Chain (GVC), in addition to the existing economic factors. The conditional log-odds for individual links derived from the Exponential Random Graph Model (ERGM) in the analysis of the cross-border trade network were assumed as a proxy variable that can indicate the export potential. In consideration of the possibility of ERGM linkage, a parametric approach and a non-parametric approach were used to recommend export-promising countries, respectively. In the parametric method, a regression analysis model was developed to predict the export value of the information and communication and home appliance industries in South Korea by additionally considering the link-specific characteristics of the network derived from the ERGM to the existing economic factors. Also, in the non-parametric approach, an abnormality detection algorithm based on the clustering method was used, and a promising export country was proposed as a method of finding outliers that deviate from two peers. According to the research results, the structural characteristic of the export network of the industry was a network with high transferability. Also, according to the centrality analysis result, South Korea's influence on exports was weak compared to its size, and the structural hole analysis result showed that export efficiency was weak. According to the model for recommending promising exporting countries proposed by this study, in parametric analysis, Iran, Ireland, North Macedonia, Angola, and Pakistan were promising exporting countries, and in nonparametric analysis, Qatar, Luxembourg, Ireland, North Macedonia and Pakistan were analyzed as promising exporting countries. There were differences in some countries in the two models. The results of this study revealed that the export competitiveness of South Korea's information and communication and home appliance industries in GVC was not high compared to the size of exports, and thus showed that exports could be further reduced. In addition, this study is meaningful in that it proposed a method to find promising export countries by considering GVC networks with other countries as a way to increase export competitiveness. This study showed that, from a policy point of view, the international trade network of the information communication and home appliance industries has an important mutual relationship, and although transferability is high, it may not be easily expanded to a three-party relationship. In addition, it was confirmed that South Korea's export competitiveness or status was lower than the export size ranking. This paper suggested that in order to improve the low out-degree centrality, it is necessary to increase exports to Italy or Poland, which had significantly higher in-degrees. In addition, we argued that in order to improve the centrality of out-closeness, it is necessary to increase exports to countries with particularly high in-closeness. In particular, it was analyzed that Morocco, UAE, Argentina, Russia, and Canada should pay attention as export countries. This study also provided practical implications for companies expecting to expand exports. The results of this study argue that companies expecting export expansion need to pay attention to countries with a relatively high potential for export expansion compared to the existing export volume by country. In particular, for companies that export daily necessities, countries that should pay attention to the population are presented, and for companies that export high-end or durable products, countries with high GDP, or purchasing power, relatively low exports are presented. Since the process and results of this study can be easily extended and applied to other industries, it is also expected to develop services that utilize the results of this study in the public sector.