• 제목/요약/키워드: Large Tree

검색결과 929건 처리시간 0.027초

확률적 프로세스 트리 생성을 위한 타부 검색 -유전자 프로세스 마이닝 알고리즘 (Tabu Search-Genetic Process Mining Algorithm for Discovering Stochastic Process Tree)

  • 주우민;최진영
    • 산업경영시스템학회지
    • /
    • 제42권4호
    • /
    • pp.183-193
    • /
    • 2019
  • Process mining is an analytical technique aimed at obtaining useful information about a process by extracting a process model from events log. However, most existing process models are deterministic because they do not include stochastic elements such as the occurrence probabilities or execution times of activities. Therefore, available information is limited, resulting in the limitations on analyzing and understanding the process. Furthermore, it is also important to develop an efficient methodology to discover the process model. Although genetic process mining algorithm is one of the methods that can handle data with noises, it has a limitation of large computation time when it is applied to data with large capacity. To resolve these issues, in this paper, we define a stochastic process tree and propose a tabu search-genetic process mining (TS-GPM) algorithm for a stochastic process tree. Specifically, we define a two-dimensional array as a chromosome to represent a stochastic process tree, fitness function, a procedure for generating stochastic process tree and a model trace as a string of activities generated from the process tree. Furthermore, by storing and comparing model traces with low fitness values in the tabu list, we can prevent duplicated searches for process trees with low fitness value being performed. In order to verify the performance of the proposed algorithm, we performed a numerical experiment by using two kinds of event log data used in the previous research. The results showed that the suggested TS-GPM algorithm outperformed the GPM algorithm in terms of fitness and computation time.

LiDAR 데이터의 Quad Tree 구조 표현과 압축에 관한 연구 (Quad Tree Representation and Compression for LiDAR Data)

  • 이효종;우승용;조기성
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2008년도 하계종합학술대회
    • /
    • pp.753-754
    • /
    • 2008
  • LiDAR data are acknowledged as very useful method to represent 3-D geographical information. In this paper aquad tree has been utilized to represent the 3-D spatial information. Compression algorithm is implemented based on a given threshold. The efficiency of compress is very high with large threshold values.

  • PDF

A Framework for Wide-area Monitoring of Tree-related High Impedance Faults in Medium-voltage Networks

  • Bahador, Nooshin;Matinfar, Hamid Reza;Namdari, Farhad
    • Journal of Electrical Engineering and Technology
    • /
    • 제13권1호
    • /
    • pp.1-10
    • /
    • 2018
  • Wide-area monitoring of tree-related high impedance fault (THIF) efficiently contributes to increase reliability of large-scaled network, since the failure to early location of them may results in critical lines tripping and consequently large blackouts. In the first place, this wide-area monitoring of THIF requires managing the placement of sensors across large power grid network according to THIF detection objective. For this purpose, current paper presents a framework in which sensors are distributed according to a predetermined risk map. The proposed risk map determines the possibility of THIF occurrence on every branch in a power network, based on electrical conductivity of trees and their positions to power lines which extracted from spectral data. The obtained possibility value can be considered as a weight coefficient assigned to each branch in sensor placement problem. The next step after sensors deployment is to on-line monitor based on moving data window. In this on-line process, the received data window is evaluated for obtaining a correlation between low frequency and high frequency components of signal. If obtained correlation follows a specified pattern, received signal is considered as a THIF. Thereafter, if several faulted section candidates are found by deployed sensors, the most likely location is chosen from the list of candidates based on predetermined THIF risk map.

도시 주차장내 수목그늘의 경제적 이익 연구 -미국 캘리포니아 데이비스 대학 주차장을 사례로- (A Study on the Economic Benefit of Urban Parking Lot Tree Shading -In the Case of University of California Davis Parking Lot-)

  • 장동수
    • 한국조경학회지
    • /
    • 제33권6호
    • /
    • pp.98-108
    • /
    • 2006
  • The climate of urban area is an unstable type with considerable seasonal variation in precipitation wind speed, and temperature and it grows worse. Besides, ozone is a serious air pollutant in most of large cities. So worldwide, some of large cities are investing in forestry options to offset their climate problems, but lack of information has hindered comparisons of urban un cost effectiveness to other options. This research intends to study the economic benefits of tree shading of 19 parking lots in UCD campus. The economic benefits of tree shading are air conditioning savings, air quality, stormwater run-off, and other benefits. Especially, this study focuses how much the economic benefit of parking lot shading has been increased from 1995 to 2003 year by aerophoto. Some data on dimensions of parking lots and the number, size, tree species, and location of trees around each parking lot was inventoried. Two aerophotos(1995,2003) were used in order to analyze the increasement of tree canopy in 19 parking lots for 8 years. However, increasing coverage of trees and managing them for healthy growth would not be sufficient for avoiding adverse impacts by future climate change. Additional measures should be followed such as an increase of energy use efficiency and development of substitute energy. For example, coverage of trees help to save cooling energy by blocking solar radiation reaching parking cars and building structures through shading, and creating cool micro-climates through evapotranspiration. They also reduce heating demand by decreasing air infiltration and heat conduction out of the interior of buildings. Proper arrangement of vegetation over the parking lots can reduce cooling and heating costs. So proper planting design around hard space paving including species selection and location can significantly save cooling and heating energy. And a reduction in car and building's heating and cooling costs results in the reduction in energy demand which causes to emissions of air pollutants. Total increased tree canopy from 1995 to 2003 is $8,470.45m^2$ and the economic benefits is US$ 5,282.10. The economic benefit of one tree has been US$ 7.21 for 8 years. And an annually increased benefit is US$ 0.9 per a tree. If this kind of study is applied to studying the economic benefits of tree canopy in parking lots of Korea, it could result in guidelines of tree planting of parking lots. Because the trees selected for planting in parking lots were not suitable for an environment, the guidelines should contain a recommended list of trees. The guidelines should propose the shading percentage of parking lot when we plan a parking lot and contain the maintenance of trees in order to maximize the economic benefits of tree canopy.

1H*-tree: 데이터 스트림의 다차원 분석을 위한 개선된 데이터 큐브 구조 (1H*-tree: An Improved Data Cube Structure for Multi-dimensional Analysis of Data Streams)

  • 심상예;정우상;이연;신승선;이동욱;배혜영
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2008년도 추계학술발표대회
    • /
    • pp.332-335
    • /
    • 2008
  • In this paper, based on H-tree, which is proposed as the basic data cube structure for multi-dimensional data stream analysis, we have done some analysis. We find there are a lot of redundant nodes in H-tree, and the tree-build method can be improved for saving not only memory, but also time used for inserting tuples. Also, to facilitate more fast and large amount of data stream analysis, which is very important for stream research, H*-tree is designed and developed. Our performance study compare the proposed H*-tree and H-tree, identify that H*-tree can save more memory and time during inserting data stream tuples.

A comparison of three design tree based search algorithms for the detection of engineering parts constructed with CATIA V5 in large databases

  • Roj, Robin
    • Journal of Computational Design and Engineering
    • /
    • 제1권3호
    • /
    • pp.161-172
    • /
    • 2014
  • This paper presents three different search engines for the detection of CAD-parts in large databases. The analysis of the contained information is performed by the export of the data that is stored in the structure trees of the CAD-models. A preparation program generates one XML-file for every model, which in addition to including the data of the structure tree, also owns certain physical properties of each part. The first search engine is specializes in the discovery of standard parts, like screws or washers. The second program uses certain user input as search parameters, and therefore has the ability to perform personalized queries. The third one compares one given reference part with all parts in the database, and locates files that are identical, or similar to, the reference part. All approaches run automatically, and have the analysis of the structure tree in common. Files constructed with CATIA V5, and search engines written with Python have been used for the implementation. The paper also includes a short comparison of the advantages and disadvantages of each program, as well as a performance test.

다중 인공 신경망의 Federated Architecture와 그 응용-선박 중앙단면 형상 설계를 중심으로 (Federated Architecture of Multiple Neural Networks : A Case Study on the Configuration Design of Midship Structure)

  • 이경호;연윤석
    • 한국CDE학회논문집
    • /
    • 제2권2호
    • /
    • pp.77-84
    • /
    • 1997
  • This paper is concerning the development of multiple neural networks system of problem domains where the complete input space can be decomposed into several different regions, and these are known prior to training neural networks. We will adopt oblique decision tree to represent the divided input space and sel ect an appropriate subnetworks, each of which is trained over a different region of input space. The overall architecture of multiple neural networks system, called the federated architecture, consists of a facilitator, normal subnetworks, and tile networks. The role of a facilitator is to choose the subnetwork that is suitable for the given input data using information obtained from decision tree. However, if input data is close enough to the boundaries of regions, there is a large possibility of selecting the invalid subnetwork due to the incorrect prediction of decision tree. When such a situation is encountered, the facilitator selects a tile network that is trained closely to the boundaries of partitioned input space, instead of a normal subnetwork. In this way, it is possible to reduce the large error of neural networks at zones close to borders of regions. The validation of our approach is examined and verified by applying the federated neural networks system to the configuration design of a midship structure.

  • PDF

Content-Based Indexing and Retrieval in Large Image Databases

  • Cha, Guang-Ho;Chung, Chin-Wan
    • Journal of Electrical Engineering and information Science
    • /
    • 제1권2호
    • /
    • pp.134-144
    • /
    • 1996
  • In this paper, we propose a new access method, called the HG-tree, to support indexing and retrieval by image content in large image databases. Image content is represented by a point in a multidimensional feature space. The types of queries considered are the range query and the nearest-neighbor query, both in a multidimensional space. Our goals are twofold: increasing the storage utilization and decreasing the area covered by the directory regions of the index tree. The high storage utilization and the small directory area reduce the number of nodes that have to be touched during the query processing. The first goal is achieved by absorbing splitting if possible, and when splitting is necessary, converting two nodes to three. The second goal is achieved by maintaining the area occupied by the directory region minimally on the directory nodes. We note that there is a trade-off between the two design goals, but the HG-tree is so flexible that it can control the trade-off. We present the design of our access method and associated algorithms. In addition, we report the results of a series of tests, comparing the proposed access method with the buddy-tree, which is one of the most successful point access methods for a multidimensional space. The results show the superiority of our method.

  • PDF

FP-tree와 DHP 연관 규칙 탐사 알고리즘의 실험적 성능 비교 (Performance Evaluation of the FP-tree and the DHP Algorithms for Association Rule Mining)

  • 이형봉;김진호
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제35권3호
    • /
    • pp.199-207
    • /
    • 2008
  • FP-tree(Frequency Pattern Tree) 연관 규칙 탐사 알고리즘은 DB 스캔에 대한 부담을 획기적으로 절감시킴으로써 전체적인 성능을 향상시키고자 제안되었고, 따라서 다른 기법에 기반하는 알고리즘보다 성능이 매우 우수한 것으로 알려져 있다. 그러나, FP-tree 알고리즘은 기본적으로 DB에 저장된 거래 내용 중 빈발 항목을 포함하는 모든 거래를 트리에 저장해야 하기 때문에 그만큼 많은 메모리를 필요로 한다. 이 논문에서는 범용 운영체제인 유닉스 시스템 환경에서 FP-tree 알고리즘을 구현하여 소요 메모리와 실행시간 등 두 가지 성능 관점에서 해시 트리 및 직접 해시 테이블을 사용하는 DHP(Direct Hashing and Pruning) 알고리즘과 비교한다. 그 결과로서 알려진 바와는 크게 다르게 시스템 메모리가 충분한 상황에서도 대형 편의점 수준의 규모에 적용 가능한 거래 건수 100K, 전체 항목 개수 $1K{\sim}7K$, 평균 거래 길이 $5{\sim}10$, 평균 빈발 항목 집합 크기 $2{\sim}12$인 데이타에 대해서 FP-tree 알고리즘이 DHP 알고리즘보다 열등한 경우가 존재함을 보인다.

R-tree에서 Seeded 클러스터링을 이용한 다량 삽입 (Bulk Insertion Method for R-tree using Seeded Clustering)

  • 이태원;문봉기;이석호
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제31권1호
    • /
    • pp.30-38
    • /
    • 2004
  • 지구 관측 시스템(EOSDIS)나 많은 수의 클라이언트를 추적하는 이동전화 서비스 등 많은 응용에서는 지속적으로 생겨나는 대량의 복잡한 데이타들을 보관하고 인덱싱하는 것이 매우 어려운 일이다. 다차원 데이타를 효과적으로 관리하기 위해 R-tree에 기반 한 인덱스 구조가 널리 사용되어 왔다. 본 논문에서는 빠른 데이타 생성 속도를 따라잡으면서 대량 삽입을 통해 R-tree를 관리할 수 있는 seeded clustering이라는 확장성 있는 기법을 제안한다. 이 기법에서는 삽입할 대상 R-tree의 상위 k레벨의 구조를 활용하여 시드 트리를 만들어 삽입 데이타를 분류해 클러스터를 생성한다. 그리고 각 클러스터로부터 삽입 R-tree를 생성하고 이를 대상 R-tree에 한 번에 하나씩 삽입한다. 논문에서는 자세한 알고리즘과 함에 다양한 실험 결과를 보여준다. 실험 결과를 통해 seeded clustering을 이용한 대량 삽입이 기존의 대량 삽입 기법들과 비교해 삽입이나 질의 처리 모두에서 우수함을 알 수 있다.