• Title/Summary/Keyword: clustering method

Search Result 2,553, Processing Time 0.041 seconds

Segmentation of Multispectral MRI Using Fuzzy Clustering (퍼지 클러스터링을 이용한 다중 스펙트럼 자기공명영상의 분할)

  • 윤옥경;김현순;곽동민;김범수;김동휘;변우목;박길흠
    • Journal of Biomedical Engineering Research
    • /
    • v.21 no.4
    • /
    • pp.333-338
    • /
    • 2000
  • In this paper, an automated segmentation algorithm is proposed for MR brain images using T1-weighted, T2-weighted, and PD images complementarily. The proposed segmentation algorithm is composed of 3 step. In the first step, cerebrum images are extracted by putting a cerebrum mask upon the three input images. In the second step, outstanding clusters that represent inner tissues of the cerebrum are chosen among 3-dimensional(3D) clusters. 3D clusters are determined by intersecting densely distributed parts of 2D histogram in the 3D space formed with three optimal scale images. Optimal scale image is made up of applying scale space filtering to each 2D histogram and searching graph structure. Optimal scale image best describes the shape of densely distributed parts of pixels in 2D histogram and searching graph structure. Optimal scale image best describes the shape of densely distributed parts of pixels in 2D histogram. In the final step, cerebrum images are segmented using FCM algorithm with its initial centroid value as the outstanding clusters centroid value. The proposed cluster's centroid accurately. And also can get better segmentation results from the proposed segmentation algorithm with multi spectral analysis than the method of single spectral analysis.

  • PDF

Classifying a Strength of Dependency between classes by using Software Metrics and Machine Learning in Object-Oriented System (기계학습과 품질 메트릭을 활용한 객체간 링크결합강도 분류에 관한 연구)

  • Jung, Sungkyun;Ahn, Jaegyoon;Yeu, Yunku;Park, Sanghyun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.10
    • /
    • pp.651-660
    • /
    • 2013
  • Object oriented design brought up improvement of productivity and software quality by adopting some concepts such as inheritance and encapsulation. However, both the number of software's classes and object couplings are increasing as the software volume is becoming larger. The object coupling between classes is closely related with software complexity, and high complexity causes decreasing software quality. In order to solve the object coupling issue, IT-field researchers adopt a component based development and software quality metrics. The component based development requires explicit representation of dependencies between classes and the software quality metrics evaluates quality of software. As part of the research, we intend to gain a basic data that will be used on decomposing software. We focused on properties of the linkage between classes rather than previous studies evaluated and accumulated the qualities of individual classes. Our method exploits machine learning technique to analyze the properties of linkage and predict the strength of dependency between classes, as a new perspective on analyzing software property.

Use of the Quantitatively Transformed Field Soil Structure Description of the US National Pedon Characterization Database to Improve Soil Pedotransfer Function

  • Yoon, Sung-Won;Gimenez, Daniel;Nemes, Attila;Chun, Hyen-Chung;Zhang, Yong-Seon;Sonn, Yeon-Kyu;Kang, Seong-Soo;Kim, Myung-Sook;Kim, Yoo-Hak;Ha, Sang-Keun
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.44 no.5
    • /
    • pp.944-958
    • /
    • 2011
  • Soil hydraulic properties such as hydraulic conductivity or water retention which are costly to measure can be indirectly generated by soil pedotransfer function (PTF) using easily obtainable soil data. The field soil structure description which is routinely recorded could also be used in PTF as an input to reduce the uncertainty. The purposes of this study were to use qualitative morphological soil structure descriptions and soil structural index into PTF and to evaluate their contribution in the prediction of soil hydraulic properties. We transformed categorical morphological descriptions of soil structure into quantitative values using categorical principal component analysis (CATPCA). This approach was tested with a large data set from the US National Pedon Characterization database with the aid of a categorical regression tree analysis. Six different PTFs were used to predict the saturated hydraulic conductivity and those results were averaged to quantify the uncertainty. Quantified morphological description was successively used in multiple linear regression approach to predict the averaged ensemble saturated conductivity. The selected stepwise regression model with only the transformed morphological variables and structural index as predictors predicted the $K_{sat}$ with $r^2$ = 0.48 (p = 0.018), indicating the feasibility of CATPCA approach. In a regression tree analysis, soil structure index and soil texture turned out to be important factors in the prediction of the hydraulic properties. Among structural descriptions size class turned out to be an important grouping parameter in the regression tree. Bulk density, clay content, W33 and structural index explained clusters selected by a two step clustering technique, implying the morphologically described soil structural features are closely related to soil physical as well as hydraulic properties. Although this study provided relatively new method which related soil structure description to soil structure index, the same approach should be tested using a datasets containing the actual measurement of hydraulic properties. More insight on the predictive power of soil structure index to estimate hydraulic properties would be achieved by considering measured the saturated hydraulic conductivity and the soil water retention.

K-means clustering analysis and differential protection policy according to 3D NAND flash memory error rate to improve SSD reliability

  • Son, Seung-Woo;Kim, Jae-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.11
    • /
    • pp.1-9
    • /
    • 2021
  • 3D-NAND flash memory provides high capacity per unit area by stacking 2D-NAND cells having a planar structure. However, due to the nature of the lamination process, there is a problem that the frequency of error occurrence may vary depending on each layer or physical cell location. This phenomenon becomes more pronounced as the number of write/erase(P/E) operations of the flash memory increases. Most flash-based storage devices such as SSDs use ECC for error correction. Since this method provides a fixed strength of data protection for all flash memory pages, it has limitations in 3D NAND flash memory, where the error rate varies depending on the physical location. Therefore, in this paper, pages and layers with different error rates are classified into clusters through the K-means machine learning algorithm, and differentiated data protection strength is applied to each cluster. We classify pages and layers based on the number of errors measured after endurance test, where the error rate varies significantly for each page and layer, and add parity data to stripes for areas vulnerable to errors to provides differentiate data protection strength. We show the possibility that this differentiated data protection policy can contribute to the improvement of reliability and lifespan of 3D NAND flash memory compared to the protection techniques using RAID-like or ECC alone.

Genetic Diversity and Identification of Korean Grapevine Cultivars using SSR Markers (SSR마커를 이용한 국내육성 포도 품종의 다양성과 품종 판별)

  • Cho, Kang-Hee;Bae, Kyung-Mi;Noh, Jung Ho;Shin, Il Sheob;Kim, Se Hee;Kim, Jeong-Hee;Kim, Dae-Hyun;Hwang, Hae-Sung
    • Korean Journal of Breeding Science
    • /
    • v.43 no.5
    • /
    • pp.422-429
    • /
    • 2011
  • This study was conducted to investigate the genetic diversity and to develop a technique for cultivar identification using SSR markers in grapevine. Thirty Korean bred and introduced grapevine cultivars were evaluated by 28 SSR markers. A total of 143 alleles were produced ranging from 2 to 8 alleles with an average of 5.1 alleles per locus. Polymorphic information contents (PIC) were ranged from 0.666 (VVIp02) to 0.975 (VVIn33 and VVIn62) with an average of 0.882. UPGMA (unweighted pair-group method arithmetic average) clustering analysis based on genetic distances using 143 alleles classified 30 grapevine cultivars into 7 clusters by similarity index of 0.685. Similarity values among the tested grapevine cultivars ranged from 0.575 to 1.00, and the average similarity value was 0.661. The similarity index was the highest (1.00) between 'Jinok' and 'Campbell Early', and the lowest (0.575) between 'Alden' and 'Narsha'. The genetic relationships among the 30 studied grapevine cultivars were basically consistent with the known pedigree. The three SSR markers sets (VVIn61, VVIt60, and VVIu20) selected from 28 primers were differentiated all grapevine cultivars except for 'Jinok' and 'Campbell Early'. Five cultivars ('Narsha, 'Alden', 'Dutchess', 'Pione', and 'Muscat Hamburg') were identified by VVIn61 at the first step. Then 21 cultivars including 'Hongsodam' by VVIt60 at the second step and 2 cultivars ('Heukbosuck' and 'Suok') by VVIu20 at the third step were identified. These markers could be used as a reliable tool for the identification of Korean grapevine cultivars.

A Study On Clusters and Ecosystem In Distribution Industry Using Big Data Analysis (빅데이타 분석을 통한 유통산업 클러스터의 형성과 생태계 연구)

  • Jung, Jaeheon
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.7
    • /
    • pp.360-375
    • /
    • 2019
  • This paper tries to study the ecosystem after constructing the network of the continuing transactions associated with distribution industry with the data of more than 50 thousands firms provided by the Korean enterprise data (KED) for 2015. After applying the clustering method, one of social network analysis tools, we find the firms in the network grouped into 732 clusters occupying about 80% of whole distribution industry sales in KED data. The firms in a cluster have most of their transactions with other firms in the cluster. But the clusters have smaller firm numbers in the cluster and sales portion of the biggest firms in the industry than the case of the manufacturing industry. The Input-output analysis for the biggest distribution firms show that the small and medium size enterprise(SME)s have very high sale dependency on a main firm in some clusters. This fact implies more efficient fair transaction policies within the clusters. And small number of big distribution firms have very high rear production linkage effects on SMEs or on the 10th or 31th group with high portion of SME employment. They should be considered important in the SME growth and employment policies.

Comparison of genome-wide association and genomic prediction methods for milk production traits in Korean Holstein cattle

  • Lee, SeokHyun;Dang, ChangGwon;Choy, YunHo;Do, ChangHee;Cho, Kwanghyun;Kim, Jongjoo;Kim, Yousam;Lee, Jungjae
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.7
    • /
    • pp.913-921
    • /
    • 2019
  • Objective: The objectives of this study were to compare identified informative regions through two genome-wide association study (GWAS) approaches and determine the accuracy and bias of the direct genomic value (DGV) for milk production traits in Korean Holstein cattle, using two genomic prediction approaches: single-step genomic best linear unbiased prediction (ss-GBLUP) and Bayesian Bayes-B. Methods: Records on production traits such as adjusted 305-day milk (MY305), fat (FY305), and protein (PY305) yields were collected from 265,271 first parity cows. After quality control, 50,765 single-nucleotide polymorphic genotypes were available for analysis. In GWAS for ss-GBLUP (ssGWAS) and Bayes-B (BayesGWAS), the proportion of genetic variance for each 1-Mb genomic window was calculated and used to identify informative genomic regions. Accuracy of the DGV was estimated by a five-fold cross-validation with random clustering. As a measure of accuracy for DGV, we also assessed the correlation between DGV and deregressed-estimated breeding value (DEBV). The bias of DGV for each method was obtained by determining regression coefficients. Results: A total of nine and five significant windows (1 Mb) were identified for MY305 using ssGWAS and BayesGWAS, respectively. Using ssGWAS and BayesGWAS, we also detected multiple significant regions for FY305 (12 and 7) and PY305 (14 and 2), respectively. Both single-step DGV and Bayes DGV also showed somewhat moderate accuracy ranges for MY305 (0.32 to 0.34), FY305 (0.37 to 0.39), and PY305 (0.35 to 0.36) traits, respectively. The mean biases of DGVs determined using the single-step and Bayesian methods were $1.50{\pm}0.21$ and $1.18{\pm}0.26$ for MY305, $1.75{\pm}0.33$ and $1.14{\pm}0.20$ for FY305, and $1.59{\pm}0.20$ and $1.14{\pm}0.15$ for PY305, respectively. Conclusion: From the bias perspective, we believe that genomic selection based on the application of Bayesian approaches would be more suitable than application of ss-GBLUP in Korean Holstein populations.

Development of LiDAR-Based MRM Algorithm for LKS System (LKS 시스템을 위한 라이다 기반 MRM 알고리즘 개발)

  • Son, Weon Il;Oh, Tae Young;Park, Kihong
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.1
    • /
    • pp.174-192
    • /
    • 2021
  • The LIDAR sensor, which provides higher cognitive performance than cameras and radar, is difficult to apply to ADAS or autonomous driving because of its high price. On the other hand, as the price is decreasing rapidly, expectations are rising to improve existing autonomous driving functions by taking advantage of the LIDAR sensor. In level 3 autonomous vehicles, when a dangerous situation in the cognitive module occurs due to a sensor defect or sensor limit, the driver must take control of the vehicle for manual driving. If the driver does not respond to the request, the system must automatically kick in and implement a minimum risk maneuver to maintain the risk within a tolerable level. In this study, based on this background, a LIDAR-based LKS MRM algorithm was developed for the case when the normal operation of LKS was not possible due to troubles in the cognitive system. From point cloud data collected by LIDAR, the algorithm generates the trajectory of the vehicle in front through object clustering and converts it to the target waypoints of its own. Hence, if the camera-based LKS is not operating normally, LIDAR-based path tracking control is performed as MRM. The HAZOP method was used to identify the risk sources in the LKS cognitive systems. B, and based on this, test scenarios were derived and used in the validation process by simulation. The simulation results indicated that the LIDAR-based LKS MRM algorithm of this study prevents lane departure in dangerous situations caused by various problems or difficulties in the LKS cognitive systems and could prevent possible traffic accidents.

A Study on the Prediction of Yard Tractors Required by Vessels Arriving at Container Terminal (컨테이너터미널 입항 선박별 야드 트랙터 소요량 예측에 관한 연구)

  • Cho, Hyun-Jun;Shin, Jae-Young
    • Journal of Korea Port Economic Association
    • /
    • v.37 no.4
    • /
    • pp.33-40
    • /
    • 2021
  • Currently, the shipping and port industries are implementing strategies to improve port processing capabilities through the expansion and efficient operation of port logistics resources to survive fierce competition with rapidly changing trends. The calculation of the port's processing capacity is determined by the loading and unloading equipment installed at the dock, and the port's processing capacity can be improved through various methods, such as additional deployment of logistics resources or efficient operation of resources in use. However, it is difficult to expect an improvement effect in a short period of time because the additional deployment of logistics resources is clearly limited in time is clear. Therefore, it is a feasible way to find an efficient operation method for resources being used to improve processing capacity. Domestic ports are also actively promoting informatization and digitalization with the development of the 4th industrial revolution technology. However, the calculation of the number of Y/T (Yard Tractor) assignments in the current unloading process depends on expert experience, and related previous studies also focus on the allocations of Y/T or Calculation of the total number of Y/T required. Therefore, this study analyzed the factors affecting the number of Y/T allocations using the loading and unloading information of incoming ships, and based on this, cluster analysis, regression analysis, and deep neural network(DNN) model were used.

Analysis of Stomach Contents of Marine Orgnaisms in Gwangyang Bay and Yeosu Fish Market Using DNA Metabarcoding (DNA 메타바코딩을 이용한 광양만 및 어시장 해양 생물 위 내용물 분석)

  • Gun Hee Oh;Yong Jun Kim;Won-Seok Kim;Cheol Hong;Chang Woo Ji;Ihn-Sil Kwak
    • Korean Journal of Ecology and Environment
    • /
    • v.55 no.4
    • /
    • pp.368-375
    • /
    • 2022
  • Gut contents analysis is essential to predict the impact of organisms on food source changes due to variations of the habitat environment. Previous studies of gut content analysis have been conducted using traditional methods, such as visual observation. However, these studies are limited in analyzing food sources because of the digestive process in gut organ. DNA metabarcoding analysis is a useful method to analyze food sources by supplementing these limitations. We sampled marine fish of Pennahia argentata, Larimichthys polyactis, Crangon affinis, Loligo beka and Sepia officinalis from Gwangyang Bay and Yeosu fisheries market for analyzing gut contents by applying DNA metabarcoding analysis. 18S rRNA v9 primer was used for analyzing food source by DNA metabarcoding. Network and two-way clustering analyses characterized the relationship between organisms and food sources. As a result of comparing metabarcoding of gut contents for P. argentata between sampled from Gwangyang Bay and the fisheries market, fish and Copepoda were analyzed as common food sources. In addition, Decapoda and Copepoda were analyzed as common food sources for L. polyactis and C. affinis, respectively. Copepoda was analyzed as the primary food source for L. beka and S. officinalis. These study results demonstrated that gut contents analysis using DNA metabarcoding reflects diverse and detailed information of biological food sources in the aquatic environment. In addition, it will be possible to provide biological information in the gut to identify key food sources by applying it to the research on the food web in the ecosystem.