• Title/Summary/Keyword: increasing mapping

Search Result 269, Processing Time 0.026 seconds

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

  • Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.57-73
    • /
    • 2021
  • Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.

The research on enhance the reinforcement of marine crime and accident using geographical profiling (지리적 프로파일링을 활용한 해양 범죄 및 해양사고 대응력 강화에 관한 연구)

  • Soon, Gil-Tae
    • Korean Security Journal
    • /
    • no.48
    • /
    • pp.147-176
    • /
    • 2016
  • Korean Peninsula is surrounded by ocean on three sides. Because of this geographical quality over 97% of export and import volumes are exchange by sea. Foreign ship and international passenger vessels carries foreign tourist and globalization and internationalization increases this trends. Leisure population grows with national income increase and interest of ocean. And accidents and incidents rates are also increases. Korea Coast Guard's jurisdiction area is 4.5 times bigger than our country. The length of coastline is 14,963km including islands. One patrol vessel is responsible for 24,068km and one coast guard substation is responsible for 94km. Efficient patrol activities can not be provided. This research focus on this problem. Analyze the status and trends of maritime crime and suggest efficient patrol activities. To deal with increasing maritime crime rate this study suggest to use geographical profile method which developed early 1900s in USA. This geographical profile analyse the spatial characteristic and mapping this result. With this result potential crime zone can be predicted. One of the result is hot spot management which gives data about habitual crime zone. In Korea National Police Agency adopt this method in 2008 and apply on patrol and crime prevention activity by analysis of different criteria. Korea National Police Agency analyse the crime rate with crime type, crime zone and potential crime zone, and hourly, regionally criteria. Korea Coast Guard need to adopt this method and apply on maritime to make maritime crime map, which shows type of crime with regional, periodical result. With this geographical profiling we can set a Criminal Point which shows the place where the crime often occurs. The Criminal Points are set with the data of numerous rates such as homicide, robbery, burglary, missing, collision which happened in ocean. Set this crime as the major crime and manage the data more thoroughly. I expect to enhance the reinforcement of marine crime using this Criminal Points. Because this points will give us efficient way to prevent the maritime crime by placing the patrol vessel where they needed most.

  • PDF

International Research Trend on Mountainous Sediment-related Disasters Induced by Earthquakes (지진 유발 산지토사재해 관련 국외 연구동향 분석)

  • Lee, Sang-In;Seo, Jung-Il;Kim, Jin-Hak;Ryu, Dong-Seop;Seo, Jun-Pyo;Kim, Dong-Yeob;Lee, Chang-Woo
    • Journal of Korean Society of Forest Science
    • /
    • v.106 no.4
    • /
    • pp.431-440
    • /
    • 2017
  • The 2016 Gyeongju Earthquake ($M_L$ 5.8) (occurred on September 12, 2016) and the 2017 Pohang Earthquake ($M_L$ 5.4) (occurred on November 15, 2017) caused unprecedented damages in South Korea. It is necessary to establish basic data related to earthquake-induced mountainous sediment-related disasters over worldwide. In this study, we analyzed previous international studies on the earthquake-induced mountainous sediment-related disasters, then classified research areas according to research themes using text-mining and co-word analysis in VOSviewer program, and finally examined spatio-temporal research trends by research area. The result showed that the related-researches have been rapidly increased since 2005, which seems to be affected by recent large-scale earthquakes occurred in China, Taiwan and Japan. In addition, the research area related to mountainous sediment-related disasters induced by earthquakes was classified into four subjects: (i) mechanisms of disaster occurrence; (ii) rainfall parameters controlling disaster occurrence; (iii) prediction of potential disaster area using aerial and satellite photographs; and (iv) disaster risk mapping through the modeling of disaster occurrence. These research areas are considered to have a strong correlation with each other. On the threshold year (i.e., 2012-2013), when cumulative number of research papers was reached 50% of total research papers published since 1987, proportions per unit year of all research areas should increase. Especially, the proportion of the research areas related to prediction of potential disaster area using aerial and satellite photographs is highly increased compared to other three research areas. These trends are responsible for the rapidly increasing research papers with study sites in China, and the research papers examined in Taiwan, Japan, and the United States have also contributed to increases in all research areas. The results are could be used as basic data to present future research direction related to mountainous sediment-related disasters induced by earthquakes in South Korea.

A Research in Applying Big Data and Artificial Intelligence on Defense Metadata using Multi Repository Meta-Data Management (MRMM) (국방 빅데이터/인공지능 활성화를 위한 다중메타데이터 저장소 관리시스템(MRMM) 기술 연구)

  • Shin, Philip Wootaek;Lee, Jinhee;Kim, Jeongwoo;Shin, Dongsun;Lee, Youngsang;Hwang, Seung Ho
    • Journal of Internet Computing and Services
    • /
    • v.21 no.1
    • /
    • pp.169-178
    • /
    • 2020
  • The reductions of troops/human resources, and improvement in combat power have made Korean Department of Defense actively adapt 4th Industrial Revolution technology (Artificial Intelligence, Big Data). The defense information system has been developed in various ways according to the task and the uniqueness of each military. In order to take full advantage of the 4th Industrial Revolution technology, it is necessary to improve the closed defense datamanagement system.However, the establishment and usage of data standards in all information systems for the utilization of defense big data and artificial intelligence has limitations due to security issues, business characteristics of each military, anddifficulty in standardizing large-scale systems. Based on the interworking requirements of each system, data sharing is limited through direct linkage through interoperability agreement between systems. In order to implement smart defense using the 4th Industrial Revolution technology, it is urgent to prepare a system that can share defense data and make good use of it. To technically support the defense, it is critical to develop Multi Repository Meta-Data Management (MRMM) that supports systematic standard management of defense data that manages enterprise standard and standard mapping for each system and promotes data interoperability through linkage between standards which obeys the Defense Interoperability Management Development Guidelines. We introduced MRMM, and implemented by using vocabulary similarity using machine learning and statistical approach. Based on MRMM, We expect to simplify the standardization integration of all military databases using artificial intelligence and bigdata. This will lead to huge reduction of defense budget while increasing combat power for implementing smart defense.

Petrological Study on the Spherulitic Rhyolite in the Jangsan Area, Busan (부산 장산 지역의 구과상(球課狀) 유문암에 대한 암석학적 연구)

  • Park, Sumi;Yun, Sung-Hyo
    • The Journal of the Petrological Society of Korea
    • /
    • v.22 no.3
    • /
    • pp.219-233
    • /
    • 2013
  • Spherulitic rhyolite occur as part of ring dyke which showing a vertical flowage of $60^{\circ}{\sim}90^{\circ}$, of the Jangsan cauldron was studied. The spherulites range in diameter from a few millimeters to 2.8 centimeters or more, and average 5~10 millimeters. It belongs to radiated simple spherulite type. They consist of a core of moderate brown dense material encased by a thin crust, a few millimeters thick at most of white grey material. The spherulites frequently have a radiating fibrous structure, which are thought to have formed as a consequence of rapid mineral growth caused by very fast cooling of the dykes in shallow depth near the surface. EPMA examination of the concentric-zoned core of spherulites show that they are mainly composed of cryptocrystalline-fibrous intergrowth of silica minerals and alkali feldspars which have $SiO_2$ 82% or more, $Al_2O_3$ 7~10%, $Na_2O+K_2O$ less than 8%. The feldspar compositions of the spherulites lie essentially within the sanidine field. XRD examination show that spherulites are mainly composed of quartz, sanidine, albite with minor mica, kaolinite and chlorite. According to X-ray mapping, the spherulites are enriched in $SiO_2$ in the core and partly enriched $Na_2O$ or $K_2O$, $Al_2O_3$ in the shell that reflect in compositional zoning with increasing spherulitic devitrification. The feathery and non-equant crystal shapes of spherulites from rhyolite dyke of Jangsan cauldron suggest that they may have formed during the rapid cooling of dyke under the static state, or faster velocity of devitrification from glassy materials than movement velocity of the magma intrusion. The spherulitic rhyolite originated from high-silica(75.4~75.7 wt.%) rhyolite magma.

Agronomic and Genetic Evaluation on a Dull Mutant Line Derived from the Sodium Azide Treated 'Namil', a Non-Glutinous Japonica Rice (남일벼 돌연변이 유래 중간찰 계통의 작물학적 특성 및 배유특성 지배유전자위 표지)

  • Chun, Jae-Buhm;Jeung, Ji-Ung;Cho, Seong-Woo;Kim, Woo-Jae;Ha, Ki-Young;Kang, Kyung-Ho;Ko, Jae-Kwon;Kim, Hyun-Soon;Kim, Bo-Kyeong
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.60 no.4
    • /
    • pp.448-457
    • /
    • 2015
  • Developing rice lines with various amylose contents is necessary to diverse usages of rice in terms of raw materials for processed food production, and thereby to promote rice consumption in Korea. A rice mutant line, 'Namil(SA)-dull1' was established through sodium azide mutagenesis on 'Namil', a non-glutinous Korean Japonica rice cultivar. Namil(SA)-dull1' had dull endosperm characteristics and the evaluated amylose content was 12.2%. A total of 94 F2 progenies from a cross between 'Namil(SA)-dull1' and 'Milyang23', a non-glutinous Tongil-type rice cultivar, was used for genetic studies on the endosperm amylose content. Association analyses, between marker genotypes of 53 SSR anchor markers and evaluated amylose contents of each 94 F2:3 seeds, initially localized rice chromosome 6 as the harboring place for the modified allele(s) directing low amylose content of 'Namil(SA)-dull1'. By increasing SSR marker density on the putative chromosomal region followed by association analyses, the target region was narrowed down 0.94 Mbp segment, expanding from 28.95 Mbp to 29.89 Mbp, on rice chromosome 6 pseudomolecule. Among the SSR loci, RM7555 explained 84.2% of total variation of amylose contents in the $F_2$ population. Further physical mapping on the target region directing low amylose content of 'Namil(SA)-dull1' would increase the breeding efficiency in developing promising rice cultivars with various endosperm characteristics.

Development of Topic Trend Analysis Model for Industrial Intelligence using Public Data (텍스트마이닝을 활용한 공개데이터 기반 기업 및 산업 토픽추이분석 모델 제안)

  • Park, Sunyoung;Lee, Gene Moo;Kim, You-Eil;Seo, Jinny
    • Journal of Technology Innovation
    • /
    • v.26 no.4
    • /
    • pp.199-232
    • /
    • 2018
  • There are increasing needs for understanding and fathoming of business management environment through big data analysis at industrial and corporative level. The research using the company disclosure information, which is comprehensively covering the business performance and the future plan of the company, is getting attention. However, there is limited research on developing applicable analytical models leveraging such corporate disclosure data due to its unstructured nature. This study proposes a text-mining-based analytical model for industrial and firm level analyses using publicly available company disclousre data. Specifically, we apply LDA topic model and word2vec word embedding model on the U.S. SEC data from the publicly listed firms and analyze the trends of business topics at the industrial and corporate levels. Using LDA topic modeling based on SEC EDGAR 10-K document, whole industrial management topics are figured out. For comparison of different pattern of industries' topic trend, software and hardware industries are compared in recent 20 years. Also, the changes of management subject at firm level are observed with comparison of two companies in software industry. The changes of topic trends provides lens for identifying decreasing and growing management subjects at industrial and firm level. Mapping companies and products(or services) based on dimension reduction after using word2vec word embedding model and principal component analysis of 10-K document at firm level in software industry, companies and products(services) that have similar management subjects are identified and also their changes in decades. For suggesting methodology to develop analysis model based on public management data at industrial and corporate level, there may be contributions in terms of making ground of practical methodology to identifying changes of managements subjects. However, there are required further researches to provide microscopic analytical model with regard to relation of technology management strategy between management performance in case of related to various pattern of management topics as of frequent changes of management subject or their momentum. Also more studies are needed for developing competitive context analysis model with product(service)-portfolios between firms.

A Study on the Efficient Utilization of Spatial Data for Heat Mapping with Remote Sensing and Simulation (원격탐사 및 시뮬레이션의 열지도 구축을 위한 공간정보 활용 효율화 연구)

  • Cho, Young-Il;Yoon, Donghyeon;Lim, Youngshin;Lee, Moung-Jin
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.6_1
    • /
    • pp.1421-1434
    • /
    • 2020
  • The frequency and intensity of heatwaves have been increasing due to climate change. Since urban areas are more severely damaged by heatwaves as they act in combination with the urban heat island phenomenon, every possible preparation for such heat threats is required. Many overseas local governments build heat maps using a variety of spatial information to prepare for and counteract heatwaves, and prepare heatwave measures suitable for each region with different spatial characteristics within a relevant city. Building a heat map is a first and important step to prepare for heatwaves. The cases of heat map construction and thermal environment analysis involve various area distributions from urban units with a large area to local units with a small area. The method of constructing a heat map varies from a method utilizing remote sensing to a method using simulation, but there is no standard for using differentiated spatial information according to spatial scale, so each researcher constructs a heat map and analyzes the thermal environment based on different methods. For the above reason, spatial information standards required for building a heat map according to the analysis scale should be established. To this end, this study examined spatial information, analysis methodology, and final findings related to Korean and oversea analysis studies of heatwaves and urban thermal environments to suggest ways to improve the utilization efficiency of spatial information used to build urban heat maps. As a result of the analysis, it was found that spatial, temporal, and spectral resolutions, as basic resolutions, are necessary to construct a heat map using remote sensing in the use of spatial information. In the use of simulations, it was found that the type of weather data and spatial resolution, which are input condition information for simulation implementation, differ according to the size of analysis target areas. Therefore, when constructing a heat map using remote sensing, spatial, spectral, and temporal resolution should be considered; and in the case of using simulations, the spatial resolution, which is an input condition for simulation implementation, and the conditions of weather information to be inputted, should be considered in advance. As a result of understanding the types of monitoring elements for heatwave analysis, 19 types of elements were identified such as land cover, urban spatial characteristics, buildings, topography, vegetation, and shadows, and it was found that there are differences in the types of the elements by spatial scale. This study is expected to help give direction to relevant studies in terms of the use of spatial information suitable for the size of target areas, and setting monitoring elements, when analyzing heatwaves.

A stratified random sampling design for paddy fields: Optimized stratification and sample allocation for effective spatial modeling and mapping of the impact of climate changes on agricultural system in Korea (농지 공간격자 자료의 층화랜덤샘플링: 농업시스템 기후변화 영향 공간모델링을 위한 국내 농지 최적 층화 및 샘플 수 최적화 연구)

  • Minyoung Lee;Yongeun Kim;Jinsol Hong;Kijong Cho
    • Korean Journal of Environmental Biology
    • /
    • v.39 no.4
    • /
    • pp.526-535
    • /
    • 2021
  • Spatial sampling design plays an important role in GIS-based modeling studies because it increases modeling efficiency while reducing the cost of sampling. In the field of agricultural systems, research demand for high-resolution spatial databased modeling to predict and evaluate climate change impacts is growing rapidly. Accordingly, the need and importance of spatial sampling design are increasing. The purpose of this study was to design spatial sampling of paddy fields (11,386 grids with 1 km spatial resolution) in Korea for use in agricultural spatial modeling. A stratified random sampling design was developed and applied in 2030s, 2050s, and 2080s under two RCP scenarios of 4.5 and 8.5. Twenty-five weather and four soil characteristics were used as stratification variables. Stratification and sample allocation were optimized to ensure minimum sample size under given precision constraints for 16 target variables such as crop yield, greenhouse gas emission, and pest distribution. Precision and accuracy of the sampling were evaluated through sampling simulations based on coefficient of variation (CV) and relative bias, respectively. As a result, the paddy field could be optimized in the range of 5 to 21 strata and 46 to 69 samples. Evaluation results showed that target variables were within precision constraints (CV<0.05 except for crop yield) with low bias values (below 3%). These results can contribute to reducing sampling cost and computation time while having high predictive power. It is expected to be widely used as a representative sample grid in various agriculture spatial modeling studies.