• Title/Summary/Keyword: index clustering

Search Result 323, Processing Time 0.025 seconds

An Efficient Video Clip Matching Algorithm Using the Cauchy Function (커쉬함수를 이용한 효율적인 비디오 클립 정합 알고리즘)

  • Kim Sang-Hyul
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.5 no.4
    • /
    • pp.294-300
    • /
    • 2004
  • According to the development of digital media technologies various algorithms for video clip matching have been proposed to match the video sequences efficiently. A large number of video search methods have focused on frame-wise query, whereas a relatively few algorithms have been presented for video clip matching or video shot matching. In this paper, we propose an efficient algorithm to index the video sequences and to retrieve the sequences for video clip query. To improve the accuracy and performance of video sequence matching, we employ the Cauchy function as a similarity measure between histograms of consecutive frames, which yields a high performance compared with conventional measures. The key frames extracted from segmented video shots can be used not only for video shot clustering but also for video sequence matching or browsing, where the key frame is defined by the frame that is significantly different from the previous frames. Experimental results with color video sequences show that the proposed method yields the high matching performance and accuracy with a low computational load compared with conventional algorithms.

  • PDF

Classification and Retrieval of Object - Oriented Reuse Components with HACM (HACM을 사용한 객체지향 재사용 부품의 분류와 검색)

  • Bae, Je-Min;Kim, Sang-Geun;Lee, Kyung-Whan
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.7
    • /
    • pp.1733-1748
    • /
    • 1997
  • In this paper, we propose the classification scheme and retrieval mechanism which can apply to many application domains in order to construct the software reuse library. Classification scheme which is the core of the accessibility in the reusability, is defined by the hierarchical structure using the agglomerative clusters. Agglomerative cluster means the group of the reuse component by the functional relationships. Functional relationships are measured by the HACM which is the representation method about software components to calculate the similarities among the classes in the particular domain. And clustering informations are added to the library structure which determines the functionality and accuracy of the retrieval system. And the system stores the classification results such as the index information with the weights, the similarity matrix, the hierarchical structure. Therefore users can retrieve the software component using the query which is the natural language. The thesis is studied to focus on the findability of software components in the reuse library. As a result, the part of the construction process of the reuse library was automated, and we can construct the object-oriented reuse library with the extendibility and relationship about the reuse components. Also the our process is visualized through the browse hierarchy of the retrieval environment, and the retrieval system is integrated to the reuse system CARS 2.1.

  • PDF

Changes in the Ammonia-Oxidizing Bacteria Community in Response to Operational Parameters During the Treatment of Anaerobic Sludge Digester Supernatant

  • Cydzik-Kwiatkowska, Agnieszka;Zielinska, Magdalena;Bernat, Katarzyna;Kulikowska, Dorota;Wojnowska-Baryla, Irena
    • Journal of Microbiology and Biotechnology
    • /
    • v.22 no.7
    • /
    • pp.1005-1014
    • /
    • 2012
  • The understanding of the relationship between ammonia-oxidizing bacteria (AOB) communities in activated sludge and the operational treatment parameters supports the control of the treatment of ammonia-rich wastewater. The modifications of treatment parameters by alteration of the number and length of aerobic and anaerobic stages in the sequencing batch reactor (SBR) working cycle may influence the efficiency of ammonium oxidation and induce changes in the AOB community. Therefore, in the research, the impact of an SBR cycle mode with alternating aeration/mixing conditions (7 h/1 h vs. 4 h/5.5 h) and volumetric exchange rate (n) on AOB abundance and diversity in activated sludge during the treatment of anaerobic sludge digester supernatant at limited oxygen concentration in the aeration stage (0.7 mg $O_2/l$) was assessed. AOB diversity expressed by the Shannon-Wiener index (H') was determined by the cycle mode. At aeration/mixing stage lengths of 7 h/1 h, H' averaged $2.48{\pm}0.17$, while at 4 h/5.5 h it was $2.35{\pm}0.16$. At the given mode, AOB diversity decreased with increasing n. The cycle mode did not affect AOB abundance; however, a higher AOB abundance in activated sludge was promoted by decreasing the volumetric exchange rate. The sequences clustering with Nitrosospira sp. NpAV revealed the uniqueness of the AOB community and the simultaneously lower ability of adaptation of Nitrosospira sp. to the operational parameters applied in comparison with Nitrosomonas sp.

Development of Drought Map Based on Three-dimensional Spatio-temporal Analysis of Drought (가뭄사상에 대한 3차원적 시공간 분석을 통한 가뭄지도 개발)

  • Yoo, Jiyoung;So, Byung-Jin;Kwon, Hyun-Han;Kim, Tae-Woong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.40 no.1
    • /
    • pp.25-33
    • /
    • 2020
  • A drought event is characterized by duration, severity and affected area. In general, after calculating a drought index using hydro-meteorological time series at a station, a drought event is defined based on the run theory to identify the beginning and end time. However, this one-dimensional analysis has limitations for analyzing the spatio-temporal occurrence characteristics and movement paths of drought. Therefore, this study is to define a three-dimensional drought event using a simple clustering algorithm and to develop a drought map that can be used to understand the drought severity according to the spatio-temporal expansion of drought. As a result, compared with the two-dimensional monitoring information to show spatial distribution of drought index, a proposed drought map is able to show three-dimensional drought characteristics inclusing drought duration, spatial cumulative severity, and centroid of drought. The analysis of drought map indicated that there was a drought event which had the affected area less than 10 % while on occations while there were 11 drought events (44 %) which had the affected area more a than 90 % of the total area. This means that it is important to understand the relationship between spatial variation of drought affected area and severity corresponding to various drought durations. The development of drought map based on three-dimensional drought analysis is useful to analyze the spatio-temporal occurrence characteristics and propagation patterns of regional drought which can be utilized in developing mitigation measures for future extreme droughts.

Alleviating Semantic Term Mismatches in Korean Information Retrieval (한국어 정보 검색에서 의미적 용어 불일치 완화 방안)

  • Yun, Bo-Hyun;Park, Sung-Jin;Kang, Hyun-Kyu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.12
    • /
    • pp.3874-3884
    • /
    • 2000
  • An information retrieval system has to retrieve all and only documents which are relevant to a user query, even if index terms and query terms are not matched exactly. However, term mismatches between index terms and qucry terms have been a serious obstacle to the enhancement of retrieval performance. In this paper, we discuss automatic term normalization between words in text corpora and their application to a Korean information retrieval system. We perform two types of term normalizations to alleviate semantic term mismatches: equivalence class and co-occurrence cluster. First, transliterations, spelling errors, and synonyms are normalized into equivalence classes bv using contextual similarity. Second, context-based terms are normalized by using a combination of mutual information and word context to establish word similarities. Next, unsupervised clustering is done by using K-means algorithm and co-occurrence clusters are identified. In this paper, these normalized term products are used in the query expansion to alleviate semantic tem1 mismatches. In other words, we utilize two kinds of tcrm normalizations, equivalence class and co-occurrence cluster, to expand user's queries with new tcrms, in an attempt to make user's queries more comprehensive (adding transliterations) or more specific (adding spc'Cializationsl. For query expansion, we employ two complementary methods: term suggestion and term relevance feedback. The experimental results show that our proposed system can alleviatl' semantic term mismatches and can also provide the appropriate similarity measurements. As a result, we know that our system can improve the rctrieval efficiency of the information retrieval system.

  • PDF

General and abdominal obesity and risk of cardiometabolic factors in the community dwelling women (순환대사위험요인의 관련성에서 비만지표인자인 허리둘레와 체질량지수의 비교)

  • Shin, Sohee;So, Wi-Young;Kim, Hyun Soo
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.1
    • /
    • pp.233-240
    • /
    • 2018
  • The aim of this study was to investigate the cardiometabolic risk factors (CRF) of community dwelling women based on a combination of body mass index (BMI) and waist circumference (WC). This cross-sectional study was based on 1,447 subjects between 30 and 60 years of age. Subjects were categorized into 4 groups by BMI and WC [group 1, BMI<$25kg/m^2$ and WC<85 cm; group 2, BMI<$25kg/m^2$ and WC>85 cm; group 3, BMI>$25kg/m^2$ and WC<85 cm; and group 4 (BMI>$25kg/m^2$ and WC>85 cm. Logistic regression analyses showed that subjects in group 2 had 1.75 times increased risk of clustering of 2 or more CRFs compared with subjects in group 1 (p<0.001). In conclusion, early detection of people with normal weight but high waist circumference may prevent them from getting worse by implementation of lifestyle intervention, consisting of regular exercise and healthy eating. In addition, further studies on appropriate exercise contents for them should be examined.

A study on image segmentation for depth map generation (깊이정보 생성을 위한 영상 분할에 관한 연구)

  • Lim, Jae Sung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.10
    • /
    • pp.707-716
    • /
    • 2017
  • The advances in image display devices necessitate display images suitable for the user's purpose. The display devices should be able to provide object-based image information when a depthmap is required. In this paper, we represent the algorithm using a histogram-based image segmentation method for depthmap generation. In the conventional K-means clustering algorithm, the number of centroids is parameterized, so existing K-means algorithms cannot adaptively determine the number of clusters. Further, the problem of K-means algorithm tends to sink into the local minima, which causes over-segmentation. On the other hand, the proposed algorithm is adaptively able to select centroids and can stand on the basis of the histogram-based algorithm considering the amount of computational complexity. It is designed to show object-based results by preventing the existing algorithm from falling into the local minimum point. Finally, we remove the over-segmentation components through connected-component labeling algorithm. The results of proposed algorithm show object-based results and better segmentation results of 0.017 and 0.051, compared to the benchmark method in terms of Probabilistic Rand Index(PRI) and Segmentation Covering(SC), respectively.

Improvement of MODIS land cover classification over the Asia-Oceania region (아시아-오세아니아 지역의 MODIS 지면피복분류 개선)

  • Park, Ji-Yeol;Suh, Myoung-Seok
    • Korean Journal of Remote Sensing
    • /
    • v.31 no.2
    • /
    • pp.51-64
    • /
    • 2015
  • We improved the MODerate resolution Imaging Spectroradiometer (MODIS) land cover map over the Asia-Oceania region through the reclassification of the misclassified pixels. The misclassified pixels are defined where the number of land cover types are greater than 3 from the 12 years of MODIS land cover map. The ratio of misclassified pixels in this region amounts to 17.53%. The MODIS Normalized Difference Vegetation Index (NDVI) time series over the correctly classified pixels showed that continuous variation with time without noises. However, there are so many unreasonable fluctuations in the NDVI time series for the misclassified pixels. To improve the quality of input data for the reclassification, we corrected the MODIS NDVI using Correction based on Spatial and Temporal Continuity (CSaTC) developed by Cho and Suh (2013). Iterative Self-Organizing Data Analysis (ISODATA) was used for the clustering of NDVI data over the misclassified pixels and land cover types was determined based on the seasonal variation pattern of NDVI. The final land cover map was generated through the merging of correctly classified MODIS land cover map and reclassified land cover map. The validation results using the 138 ground truth data showed that the overall accuracy of classification is improved from 68% of original MODIS land cover map to 74% of reclassified land cover map.

Delineation of Provenance Regions of Forests Based on Climate Factors in Korea (기상인자(氣象因子)에 의한 우리 나라 산림(山林)의 산지구분(産地區分))

  • Choi, Wan Yong;Tak, Woo Sik;Yim, Kyong Bin;Jang, Suk Seong
    • Journal of Korean Society of Forest Science
    • /
    • v.88 no.3
    • /
    • pp.379-388
    • /
    • 1999
  • As a first step for delineating the provenance regions of the forest trees in Korea, horizontal zones have been deduced primarily from the various climatic factors such as annual mean temperature, extremely low temperature, relative humidity, annual gum of possible growing days, duration of sunshine and dry index. The basic concept to the delineation of the provenance regions was based on the ecological regions, which was likely to be more practical than that on the basis of the typical provenance regions at the species level. Primary classification of the regions has been based on the forest zones(sub-tropical, warm-temperate, mid-temperate and cool-temperate) as a broad geographic region. Further classification has been carried out using cluster analyses among the basic regions within forest zone. On the basis of clustering, a total of 19 regions including 3 from sub-tropical, 6 from warm-temperate, 8 from mid-temperate and 2 from cool-temperate was horizontally delineated. Of the mean values of 6 climate factors at the broad geographic region level, three factors such as annual mean temperature, extremely low temperature, annual growing days showed directional tendencies from subtropical to cool-temperate, while the others didn't. The values of relative humidity, duration of sunshine and dry index varied among the provenance regions within forest zone. These three factors might he more sensitive by the micro-environment condition than by the macro-environment condition. Present study aimed to delineate the primary provenance regions for tentative application to forest practices. These will be stepwise revised through the supplement using accumulated information regard to genecological data.

  • PDF

Assessing Water Quality of Siheung Stream in Shihwa Industrial Complex Using Both Principal Component Analysis and Multi-Dimensional Scaling Analysis of Korean Water Quality Index and Microbial Community Data (Principal Component Analysis와 Multi-Dimensional Scaling 분석을 이용한 시화공단 시흥천의 수질지표 및 미생물 군집 분포 연구)

  • Seo, Kyeong-Jin;Kim, Ju-Mi;Kim, Min-Jung;Kim, Seong-Keun;Lee, Ji-Eun;Kim, In-Young;Zoh, Kyung-Duk;Ko, Gwang-Pyo
    • Journal of Environmental Health Sciences
    • /
    • v.35 no.6
    • /
    • pp.517-525
    • /
    • 2009
  • The water quality of Lake Shihwa had been rapidly deteriorating since 1994 due to wastewater input from the watersheds, limited water circulation and the lack of a wastewater treatment policy. In 2000, the government decided to open the tidal embankment and make a comprehensive management plan to improve the water quality, especially inflowing stream water around Shihwa and Banwol industrial complex. However, the water quality and microbial community have not as yet been fully evaluated. The purpose of this study is to investigate the influent water quality around the industrial area based on chemical and biological analysis, and collected surface water sample from the Siheung Stream, up-stream to down-stream through the industrial complex, Samples were collected in July 2009. The results show that the downstream site near the industrial complex had higher concentrations of heavy metals (Cu, Mn, Fe, Mg, and Zn) and organic matter than upstream sites. A combination of DGGE (Denaturing Gradient Gel Electrophoresis) gels, lists of K-WQI (Korean Water Quality Index), cluster analysis, MDS (Multi-Dimensional Scaling) and PCA (Principal Component Analysis) has demonstrated clear clustering between Siheung stream 3 and 4 and with a high similarity and detected metal reducing bacteria (Shewanella spp.) and biodegrading bacteria (Acinetobacter spp.). These results suggest that use of both chemical and microbiological marker would be useful to fully evaluate the water quality.