• Title/Summary/Keyword: Cluster System

Search Result 1,986, Processing Time 0.027 seconds

Discovering Association Rules using Item Clustering on Frequent Pattern Network (빈발 패턴 네트워크에서 아이템 클러스터링을 통한 연관규칙 발견)

  • Oh, Kyeong-Jin;Jung, Jin-Guk;Ha, In-Ay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.14 no.1
    • /
    • pp.1-17
    • /
    • 2008
  • Data mining is defined as the process of discovering meaningful and useful pattern in large volumes of data. In particular, finding associations rules between items in a database of customer transactions has become an important thing. Some data structures and algorithms had been proposed for storing meaningful information compressed from an original database to find frequent itemsets since Apriori algorithm. Though existing method find all association rules, we must have a lot of process to analyze association rules because there are too many rules. In this paper, we propose a new data structure, called a Frequent Pattern Network (FPN), which represents items as vertices and 2-itemsets as edges of the network. In order to utilize FPN, We constitute FPN using item's frequency. And then we use a clustering method to group the vertices on the network into clusters so that the intracluster similarity is maximized and the intercluster similarity is minimized. We generate association rules based on clusters. Our experiments showed accuracy of clustering items on the network using confidence, correlation and edge weight similarity methods. And We generated association rules using clusters and compare traditional and our method. From the results, the confidence similarity had a strong influence than others on the frequent pattern network. And FPN had a flexibility to minimum support value.

  • PDF

Spark based Scalable RDFS Ontology Reasoning over Big Triples with Confidence Values (신뢰값 기반 대용량 트리플 처리를 위한 스파크 환경에서의 RDFS 온톨로지 추론)

  • Park, Hyun-Kyu;Lee, Wan-Gon;Jagvaral, Batselem;Park, Young-Tack
    • Journal of KIISE
    • /
    • v.43 no.1
    • /
    • pp.87-95
    • /
    • 2016
  • Recently, due to the development of the Internet and electronic devices, there has been an enormous increase in the amount of available knowledge and information. As this growth has proceeded, studies on large-scale ontological reasoning have been actively carried out. In general, a machine learning program or knowledge engineer measures and provides a degree of confidence for each triple in a large ontology. Yet, the collected ontology data contains specific uncertainty and reasoning such data can cause vagueness in reasoning results. In order to solve the uncertainty issue, we propose an RDFS reasoning approach that utilizes confidence values indicating degrees of uncertainty in the collected data. Unlike conventional reasoning approaches that have not taken into account data uncertainty, by using the in-memory based cluster computing framework Spark, our approach computes confidence values in the data inferred through RDFS-based reasoning by applying methods for uncertainty estimating. As a result, the computed confidence values represent the uncertainty in the inferred data. To evaluate our approach, ontology reasoning was carried out over the LUBM standard benchmark data set with addition arbitrary confidence values to ontology triples. Experimental results indicated that the proposed system is capable of running over the largest data set LUBM3000 in 1179 seconds inferring 350K triples.

Broadcasting and Caching Schemes for Location-dependent Queries in Urban Areas (도심환경에서 위치의존 질의를 위한 방송과 캐싱 기법)

  • Jung Il-dong;Yu Young-ho;Lee Jong-hwan;Kim Kyongsok
    • Journal of KIISE:Databases
    • /
    • v.32 no.1
    • /
    • pp.56-70
    • /
    • 2005
  • The results of location-dependent queries(LDQ) generally depend on the current locations of query issuers. Many mechanisms, e.g. broadcast scheme, hoarding, or racking policy, have been developed to improve the system peformance and provide better services, which are specialized for LDQs. Considering geographical adjacency of data and characteristics oi target area, caching policy and broadcast scheme affect the overall performance in LDQ. For this reason, we propose both the caching policy and broadcast scheme, which these features are reflected in. Based on the adjacency of data in LDQ, our broadcast scheme use Hilbert curve to cluster data. Moreover, in order to develop the caching policy suitable for LDQ on urban area, we apply the moving distance of a MH(Mobile Host) to our caching policy We evaluate the performance of the caching policy measuring the workload of MHs and the correctness of LDQ results and the performance of the broadcast scheme measuring the average setup-time of MHs in our experiments. Finally, we expect that our caching policy Provides more correct answers when executing LDQ in focal cache and leads significant improvement of the performance of MHs. It also seems quite probable that our broadcast scheme leads improvement of battery life of the MH.

Molecular Characterization of Small-Spored Alternaria Species (소형의 포자를 형성하는 Alternaria 균류의 분자생물학적 특징)

  • Kim, Byung-Ryun;Park, Myung-Soo;Cho, Hye-Sun;Yu, Seung-Hun
    • Research in Plant Disease
    • /
    • v.11 no.1
    • /
    • pp.56-65
    • /
    • 2005
  • To establish taxonomic system of morphologically similar species of small-spored Alternaria, phylogenetic analysis of internal transcribed spacer (ITS 1, ITS 2 and 5.8S rDNA) and mitochondrial small subunit (mt SSU) rDNA sequences and URP-PCR fingerprinting analysis from 11 species ofAlternaria were performed. Phylogenetic analysis of ITS and mt SSU rDNA sequences revealed that 10 out of 11 species of the smallspored Alternaria were phylogenetically identical with a bootstrap value of 100%. A. infectoria only was phylogenetically differentiated from the other species. The results suggest that the 10 small-spored Alternaria species are very closely related evolutionally and the markers can not be used for differentiation of the smallspored Alternaria species. URP-PCR fingerprinting analysis from eleven species of smallspored Alternaria using 10 URP primers showed that it was possible to differentiate the species, although genetic similarities were found among the species. The Alternaria sp. from common pokeweed could be distinguished from other species by URP-PCR analysis, and it was considered as a new species. A. infectoria could be easily distinguished from the other 10 species by phylogenetic analysis of ITS and mt SSU rDNA sequences and the URPPCR fingerprinting analysis.

Clay Mineral Composition of the Soils Derived from Residuum and Colluvium (잔적 및 붕적모재 토양의 점토광물 특성구명)

  • Zhang, Yong-Seon;Sonn, Yeon-Kyu;Jung, Sug-Jae;Lee, Gye-Jun;Kim, Myung-Sook;Kim, Sun-Kwan;Lee, Ju-Young;Pyun, In-Hwan
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.39 no.5
    • /
    • pp.245-252
    • /
    • 2006
  • This experiment was conducted to investigate the distribution and compositions of clay mineral and to replenish the soil classification system in Korea. Soil layer samples were collected from 26 residuum and colluvium soil series out of 390 soil series in Korea, and then analyzed for soil physical and chemical characteristics, mineral and chemical compositions of clay in B horizon soils. Major clay minerals of residuum and colluvium were illite and chlorite in soils originated from the sedimentary rock such as limestone, shale, sandstone and conglomerate; quartz and kaolin in soils originated from rhyolite, neogene deposits, porphyry and tuff; and kaolin and quartz in the soils originated from granite, granite gneiss and anorthosite. Clay minerals in Korean soils were divided into 4 groups: mixed mineral group(MIX) mainly contained with illite, kaolin and vemiculite; kaolin group(KA) with kaolin and illite; chlorite group(CH) with chlorite and illite; and smectite group(SM) with kaolin, illite and smectite. The most predominant clay mineral group was kaolin group(KA) with kaolin and illite; an mixed mineral group(MIX) with illite, kaolin and vemiculite. Cation exchange capacity (CEC) of clay was low in the soils mainly composed with MIX and KA groups and silica-alumina molar ratio of clay was high in the soils composed with SM group

What Kinds of Rearing Stress Do the Mothers of the Gifted Have?: Using a Concept Mapping Approach (영재 자녀를 둔 어머니들의 양육 스트레스 분석: 개념도 기법을 활용하여)

  • Han, Ki-Soon;Lee, Young-Mi
    • Journal of Gifted/Talented Education
    • /
    • v.22 no.4
    • /
    • pp.893-916
    • /
    • 2012
  • This research investigates gifted students' mothers' rearing stress based on the concept mapping method. For this, 12 gifted students' mothers solicited, gathered and analyzed related statements, and then did multidimensional scaling and hierarchical cluster analysis. The stress value was .273 which was appropriate for the two level concept mapping study. In addition 101 mothers of gifted students rated for the rearing stresses they experience. Results were as follows. First, 79 concrete statements were solicited and as a result of concept mapping were categorized as 'burden and conflict as mothers of the gifted', 'possible negative characteristics due to the giftedness', and 'self-esteem and pressure by the title of the gifted'. Especially following items showed relatively high average: worrying about how to give the child a specific help for his/her career(M=4.65); worrying that she might be intervening too much in their child's behaviors(M=4.60); getting pressured supporting the child to get involved in the gifted education system continually(M=4.46); worrying if her child is not developing his/her talent enough due to the lack of time and money(M=4.44); being concerned that her high expectations might be putting her child under too much pressure (M=4.43). Implications of the study related to gifted education practices were discussed.

Creative Cultural Localization Ways and IT Market of the EU to Converge the Creative Industries (창조융합시장을 위한 유럽 연합 (EU)의 시장과문화적 지역특화방안)

  • Seo, Dae-Sung
    • Journal of Distribution Science
    • /
    • v.13 no.1
    • /
    • pp.27-33
    • /
    • 2015
  • Purpose - The ICT market in the EU is lagging behind that of the US; however, algorithm and software development within the EU have grown steadily, and they involve focusing on the creative cultural convergence conceptualized as part of Horizon 2020 and connecting neighboring markets in the EE and the Mediterranean region. It is essential to study the requirements to market the EU's creative ICT development in emerging industrial countries after examining its applicability in these countries. Research design, data, and methodology - This study deals with data pertaining to the EU's creative industry and competitive edge. The global cultural expansion of the EU facilitates a new concept involving not only low-cost IT products to enhance local cultural artifacts through R&D and the construction of efficient infrastructure services, but also information exchange with a realistic commercialization of the technology that can be applied for creative cultural localization. In the European industry, research on algorithms has been applied for the benefit of consumers. We investigated how the process is conducted in the EU. Results - Europe needs to adjust its economic structure to the local culture as part of IT distribution convergence. The convergence has been converted into a production algorithm with IT in the form of low-cost production. This is because there is an attempt to improve the quality of transport infrastructure, workforce availability, and the distribution of the distance to the local industries and consumers, using IT algorithms. Integrated into the manufacturing industry, based on the ICT infrastructure and solutions, smart localized regional clusters are formed with the help of grafting. Europe has own strategy to increase the number of hub-and-spoke cities. Europe is now becoming integrated, with an EPC system for regional cooperation rather than national competition in ICT technology. Europe has also been recognized in this study as changing the step-by-step paradigm for global competitiveness through new creative culture industries. Conclusions - As a result, there are several ways of converging with others through EU R&D intensity; therefore, the EU can be seen as successfully increasing marginal value, which is useful in developing a special industrial cluster or local cultural cities that create converged development by connecting people and objects with IT. In fact, when compared to the US, Europe has a strong culture and the car industries have a tendency to overshadow the IT industries with integration of services in IT distribution. Considering the rapid environmental changes, the convergence of IT services is likely to take place in Europe, similar to the pharmaceutical industry and the automotive industry. This requires a focus on human resources and automated systems management. The trend is to move away from low-wage industries, switched to key personnel centers of the local university-industry. EU emphasizes the creation of IT market demand in Europe involving local cultural convergence for marketing as the second step to strengthen the economic hub-and-spoke areas.

Exploring the Transformative Regional Innovation Policy and Applying Local Energy Transition: The Case Studies of Gussing, Austria and Esbjerg, Denmark (전환적 지역혁신론의 탐색과 지역에너지 전환의 적용: 오스트리아 귀씽과 덴마크 에스비아르 사례를 중심으로)

  • HAN, Jae kak;LEE, Jung-pil;HA, Vara;SONG, Wichin
    • Journal of Science and Technology Studies
    • /
    • v.19 no.3
    • /
    • pp.291-333
    • /
    • 2019
  • The regional innovation policies so far have been separated from the social problems facing the local communities. The regional innovation policies, regarding the region as the location of the business, have focused on the invigoration of business innovation activities. However, as the recent emergence of the new paradigm of innovation policy aiming the sustainability, 'transformative innovation policy,' has led to a search for regional innovation policies that begin with solving the local social problems. This research paper deals with regional innovation theory that starts from searching for solutions and system transformation for social problems such as climate crisis and energy problems. The objective is to present a new framework called 'transformative regional innovation policy' and to improve its content through case studies by combining the results of the transformative innovation policy and the regional innovation policy studies. In particular, the contribution of this paper is to analyze and discuss the concept of the transition platform, which aims to solve the local social problems, through the case studies of Gussing, Austria and Esbjerg, Denmark. Lastly, it discusses the derived implications of the cases applied in Korean society.

Optimal Parameter Analysis and Evaluation of Change Detection for SLIC-based Superpixel Techniques Using KOMPSAT Data (KOMPSAT 영상을 활용한 SLIC 계열 Superpixel 기법의 최적 파라미터 분석 및 변화 탐지 성능 비교)

  • Chung, Minkyung;Han, Youkyung;Choi, Jaewan;Kim, Yongil
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.6_3
    • /
    • pp.1427-1443
    • /
    • 2018
  • Object-based image analysis (OBIA) allows higher computation efficiency and usability of information inherent in the image, as it reduces the complexity of the image while maintaining the image properties. Superpixel methods oversegment the image with a smaller image unit than an ordinary object segment and well preserve the edges of the image. SLIC (Simple linear iterative clustering) is known for outperforming the previous superpixel methods with high image segmentation quality. Although the input parameter for SLIC, number of superpixels has considerable influence on image segmentation results, impact analysis for SLIC parameter has not been investigated enough. In this study, we performed optimal parameter analysis and evaluation of change detection for SLIC-based superpixel techniques using KOMPSAT data. Forsuperpixel generation, three superpixel methods (SLIC; SLIC0, zero parameter version of SLIC; SNIC, simple non-iterative clustering) were used with superpixel sizes in ranges of $5{\times}5$ (pixels) to $50{\times}50$ (pixels). Then, the image segmentation results were analyzed for how well they preserve the edges of the change detection reference data. Based on the optimal parameter analysis, image segmentation boundaries were obtained from difference image of the bi-temporal images. Then, DBSCAN (Density-based spatial clustering of applications with noise) was applied to cluster the superpixels to a certain size of objects for change detection. The changes of features were detected for each superpixel and compared with reference data for evaluation. From the change detection results, it proved that better change detection can be achieved even with bigger superpixel size if the superpixels were generated with high regularity of size and shape.

The complete genome sequence of a marine sponge-associated bacteria, Bacillus safensis KCTC 12796BP, which produces the anti-allergic compounds (해양 해면체로부터 분리한 세균으로 항알러지성물질을 생산하는 Bacillus safensis KCTC 12796BP의 유전체 해독)

  • Hanh, Nguyen Phan Kieu;Kim, Soo Hee;Kim, Geum Jin;Choi, Hyukjae;Nam, Doo Hyun
    • Korean Journal of Microbiology
    • /
    • v.54 no.4
    • /
    • pp.448-452
    • /
    • 2018
  • The full genome sequence of Bacillus safensis KCTC 12796BP which had been isolated from the marine sponge in the seawater of Jeju Island, was determined by Pac-Bio next-generation sequencing system. A circular chromosome in the length of 3,935,874 bp was obtained in addition to a circular form of plasmid having 36,690 bp. The G + C content of chromosome was 41.4%, and that of plasmid was 37.3%. The number of deduced CDSs in the chromosome was 3,980, whereas 36 CDS regions were determined in a plasmid. Among the deduced CDSs in chromosome, 81 tRNA genes and 24 rRNA genes in addition to one tmRNA were allocated. More than 30 CDSs for sporulation, 16 CDSs for spore coat, and 20 CDSs for germination were also assigned in the chromosome. Several genes for capsular polysaccharide biosynthesis and for flagella biosynthesis and chemotaxis in addition to genes for osmotic tolerance through glycine-choline betaine pathway were also identified. Above all, the biosynthetic gene cluster for anti-allergic compounds seongsanamides were found among two non-ribosomal peptide synthetase (NRPS) gene clusters for secondary metabolites.