• Title/Summary/Keyword: Clustering Effect

Search Result 299, Processing Time 0.028 seconds

Bayesian analysis of finite mixture model with cluster-specific random effects (군집 특정 변량효과를 포함한 유한 혼합 모형의 베이지안 분석)

  • Lee, Hyejin;Kyung, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.57-68
    • /
    • 2017
  • Clustering algorithms attempt to find a partition of a finite set of objects in to a potentially predetermined number of nonempty subsets. Gibbs sampling of a normal mixture of linear mixed regressions with a Dirichlet prior distribution calculates posterior probabilities when the number of clusters was known. Our approach provides simultaneous partitioning and parameter estimation with the computation of classification probabilities. A Monte Carlo study of curve estimation results showed that the model was useful for function estimation. Examples are given to show how these models perform on real data.

Separating nanocluster Si formation and Er activation in nanocluster-Si sensitized Er luminescence

  • Kim, In-Yong;Sin, Jung-Hun;Kim, Gyeong-Jung
    • Proceedings of the Korean Vacuum Society Conference
    • /
    • 2010.02a
    • /
    • pp.109-109
    • /
    • 2010
  • $Er^{3+}$ ion shows a stable and efficient luminescence at 1.54mm due to its $^4I_{13/2}\;{\rightarrow}\;^4I_{15/2}$ intra-4f transition. As this corresponds to the low-loss window of silica-based optical fibers, Er-based light sources have become a mainstay of the long-distance telecom. In most telecom applications, $Er^{3+}$ ions are excited via resonant optical pumping. However, if nanocluster-Si (nc-Si) are co-doped with $Er^{3+}$, $Er^{3+}$ can be excited via energy transfer from excited electrical carriers in the nc-Si as well. This combines the broad, strong absorption band of nc-Si with narrow, stable emission spectra of $Er^{3+}$ to allow top-pumping with off-resonant, low-cost broadband light sources as well as electrical pumping. A widely used method to achieve nc-Si sensitization of $Er^{3+}$ is high-temperature annealing of Er-doped, non-stoichiometric amorphous thin film with excess Si (e.g.,silicon-rich silicon oxide(SRSO)) to precipitate nc-Si and optically activate $Er^{3+}$ at the same time. Unfortunately, such precipitation and growth of nc-Si into Er-doped oxide matrix can lead to $Er^{3+}$ clustering away from nc-Si at anneal temperatures much lower than ${\sim}1000^{\circ}C$ that is necessary for full optical activation of $Er^{3+}$ in $SiO_2$. Recently, silicon-rich silicon nitride (SRSN) was reported to be a promising alternative to SRSO that can overcome this problem of Er clustering. But as nc-Si formation and optical activation $Er^{3+}$ remain linked in Er-doped SRSN, it is not clear which mechanism is responsible for the observed improvement. In this paper, we report on investigating the effect of separating the nc-Si formation and $Er^{3+}$ activation by using hetero-multilayers that consist of nm-thin SRSO or SRSN sensitizing layers with Er-doped $SiO_2$ or $Si_3N_4$ luminescing layers.

  • PDF

Sensor Selection Strategies for Activity Recognition in a Smart Environment (스마트 환경에서 행위 인식을 위한 센서 선정 기법)

  • Gu, Sungdo;Sohn, Kyung-Ah
    • Journal of KIISE
    • /
    • v.42 no.8
    • /
    • pp.1031-1038
    • /
    • 2015
  • The recent emergence of smart phones, wearable devices, and even the IoT concept made it possible for various objects to interact one another anytime and anywhere. Among many of such smart services, a smart home service typically requires a large number of sensors to recognize the residents' activities. For this reason, the ideas on activity recognition using the data obtained from those sensors are actively discussed and studied these days. Furthermore, plenty of sensors are installed in order to recognize activities and analyze their patterns via data mining techniques. However, if many of these sensors should be installed for IoT smart home service, it raises the issue of cost and energy consumption. In this paper, we proposed a new method for reducing the number of sensors for activity recognition in a smart environment, which utilizes the principal component analysis and clustering techniques, and also show the effect of improvement in terms of the activity recognition by the proposed method.

Spatial Typification based on Heat Balance for Improving Thermal Environment in Seoul (열수지를 활용한 서울시 열환경 개선을 위한 공간 유형화)

  • Kwon, You Jin;Ahn, Saekyul;Lee, Dong Kun;Yoon, Eun Joo;Sung, Sunyong;Lee, Kiseung
    • Journal of Korea Planning Association
    • /
    • v.53 no.7
    • /
    • pp.109-126
    • /
    • 2018
  • The purpose of this study is to identify the spatial types for thermal environment improvement considering heat flux and its spatial context through empirical orthodox formulas. First, k-means clustering was used to classify values of three kinds of heat flux - latent, sensible and storage heat. Next, from the k-means clustering, we defined a type of thermal environment (type LHL) where improvement is needed for more comfortable and pleasant thermal environment in the city, among the eight types. Lastly, we compared and analyzed the characteristics of each classified thermal environmental types based on land cover types. From the study, we found that the ratio of impervious surfaces, roads, and buildings of the type LHL is higher than those of the type HLH (relatively thermal comfort environment). In order to improve the thermal environment, the following contents are proposed to urban planners and designers depending on the results of the study. a) Increase the green zone rate by 10% to reduce sensible heat; b) Reduce the percentage of impermeable surfaces and roads by 10% ; c) Latent heat increases when water and green spaces are expanded. This study will help to establish a minimum criterion for a land cover rate for the improvement of the urban thermal environment and a standard index for the thermal environmental improvement can be derived.

Building Energy Time Series Data Mining for Behavior Analytics and Forecasting Energy consumption

  • Balachander, K;Paulraj, D
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.6
    • /
    • pp.1957-1980
    • /
    • 2021
  • The significant aim of this research has always been to evaluate the mechanism for efficient and inherently aware usage of vitality in-home devices, thus improving the information of smart metering systems with regard to the usage of selected homes and the time of use. Advances in information processing are commonly used to quantify gigantic building activity data steps to boost the activity efficiency of the building energy systems. Here, some smart data mining models are offered to measure, and predict the time series for energy in order to expose different ephemeral principles for using energy. Such considerations illustrate the use of machines in relation to time, such as day hour, time of day, week, month and year relationships within a family unit, which are key components in gathering and separating the effect of consumers behaviors in the use of energy and their pattern of energy prediction. It is necessary to determine the multiple relations through the usage of different appliances from simultaneous information flows. In comparison, specific relations among interval-based instances where multiple appliances use continue for certain duration are difficult to determine. In order to resolve these difficulties, an unsupervised energy time-series data clustering and a frequent pattern mining study as well as a deep learning technique for estimating energy use were presented. A broad test using true data sets that are rich in smart meter data were conducted. The exact results of the appliance designs that were recognized by the proposed model were filled out by Deep Convolutional Neural Networks (CNN) and Recurrent Neural Networks (LSTM and GRU) at each stage, with consolidated accuracy of 94.79%, 97.99%, 99.61%, for 25%, 50%, and 75%, respectively.

Prediction of ship power based on variation in deep feed-forward neural network

  • Lee, June-Beom;Roh, Myung-Il;Kim, Ki-Su
    • International Journal of Naval Architecture and Ocean Engineering
    • /
    • v.13 no.1
    • /
    • pp.641-649
    • /
    • 2021
  • Fuel oil consumption (FOC) must be minimized to determine the economic route of a ship; hence, the ship power must be predicted prior to route planning. For this purpose, a numerical method using test results of a model has been widely used. However, predicting ship power using this method is challenging owing to the uncertainty of the model test. An onboard test should be conducted to solve this problem; however, it requires considerable resources and time. Therefore, in this study, a deep feed-forward neural network (DFN) is used to predict ship power using deep learning methods that involve data pattern recognition. To use data in the DFN, the input data and a label (output of prediction) should be configured. In this study, the input data are configured using ocean environmental data (wave height, wave period, wave direction, wind speed, wind direction, and sea surface temperature) and the ship's operational data (draft, speed, and heading). The ship power is selected as the label. In addition, various treatments have been used to improve the prediction accuracy. First, ocean environmental data related to wind and waves are preprocessed using values relative to the ship's velocity. Second, the structure of the DFN is changed based on the characteristics of the input data. Third, the prediction accuracy is analyzed using a combination comprising five hyperparameters (number of hidden layers, number of hidden nodes, learning rate, dropout, and gradient optimizer). Finally, k-means clustering is performed to analyze the effect of the sea state and ship operational status by categorizing it into several models. The performances of various prediction models are compared and analyzed using the DFN in this study.

Distinguishing Aroma Profile of Highly-Marbled Beef according to Quality Grade using Electronic Nose Sensors Data and Chemometrics Approach

  • Utama, Dicky Tri;Jang, Aera;Kim, Gur Yoo;Kang, Sun-Moon;Lee, Sung Ki
    • Food Science of Animal Resources
    • /
    • v.42 no.2
    • /
    • pp.240-251
    • /
    • 2022
  • Fat deposition in animal muscles differs according to the genetics and muscle anatomical locations. Moreover, different fat to lean muscle ratios (quality grade, QG) might contribute to aroma development in highly marbled beef. Scientific evidence is required to determine whether the abundance of aroma volatiles is positively correlated with the amount of fat in highly marbled beef. Therefore, this study aims to investigate the effect of QG on beef aroma profile using electronic nose data and a chemometric approach. An electronic nose with metal oxide semiconductors was used, and discrimination was performed using multivariate analysis, including principal component analysis and hierarchical clustering. The M. longissimus lumborum (striploin) of QG 1++, 1+, 1, and 2 of Hanwoo steers (n=6), finished under identical feeding systems on similar farms, were used. In contrast to the proportion of monounsaturated fatty acids (MUFAs), the abundance of volatile compounds and the proportion of polyunsaturated fatty acids (PUFAs) decreased as the QG increased. The aroma profile of striploin from carcasses of different QGs was well-discriminated. QG1++ was close to QG1+, while QG1 and QG2 were within a cluster. In conclusion, aroma development in beef is strongly influenced by fat deposition, particularly the fat-to-lean muscle ratio with regard to the proportion of PUFA. As MUFA slows down the oxidation and release of volatile compounds, leaner beef containing a higher proportion of PUFA produces more volatile compounds than beef with a higher amount of intramuscular fat.

A Study on Vertiport Location and Corridor Selections using GIS Analysis in Busan Area (GIS 분석을 활용한 부산권 버티포트 위치 및 회랑 선정에 관한 연구)

  • ChanHee Moon;HaYoung Shi;TaeWan Ku;BeomSoo Kang
    • Journal of Aerospace System Engineering
    • /
    • v.17 no.6
    • /
    • pp.46-53
    • /
    • 2023
  • As urban traffic congestion and environmental pollution are becoming significant issues in major cities, Urban Air Mobility (UAM) is gaining attention as an efficient solution. In this study, we conducted a geographic information system (GIS)-based spatial analysis and clustering algorithm considering the actual data of the terrain and infrastructure in the Busan area, through which we were able to select the location of vertiports and corridors (flight routes) for the UAM operation. Based on the Gimhae International Airport, which is expected to be the center of the UAM infrastructure system in the Busan region, we judged that three vertiport locations in the target area were suitable. Subsequently, we used the A* (A-star) algorithm considering Ground Risk to select a flight path that minimized both risk and distance. Through this, we confirmed a risk reduction effect of 80.168% compared to the minimum distance route.

A streamlined pipeline based on HmmUFOtu for microbial community profiling using 16S rRNA amplicon sequencing

  • Hyeonwoo Kim;Jiwon Kim;Ji Won Cho;Kwang-Sung Ahn;Dong-Il Park;Sangsoo Kim
    • Genomics & Informatics
    • /
    • v.21 no.3
    • /
    • pp.40.1-40.11
    • /
    • 2023
  • Microbial community profiling using 16S rRNA amplicon sequencing allows for taxonomic characterization of diverse microorganisms. While amplicon sequence variant (ASV) methods are increasingly favored for their fine-grained resolution of sequence variants, they often discard substantial portions of sequencing reads during quality control, particularly in datasets with large number samples. We present a streamlined pipeline that integrates FastP for read trimming, HmmUFOtu for operational taxonomic units (OTU) clustering, Vsearch for chimera checking, and Kraken2 for taxonomic assignment. To assess the pipeline's performance, we reprocessed two published stool datasets of normal Korean populations: one with 890 and the other with 1,462 independent samples. In the first dataset, HmmUFOtu retained 93.2% of over 104 million read pairs after quality trimming, discarding chimeric or unclassifiable reads, while DADA2, a commonly used ASV method, retained only 44.6% of the reads. Nonetheless, both methods yielded qualitatively similar β-diversity plots. For the second dataset, HmmUFOtu retained 89.2% of read pairs, while DADA2 retained a mere 18.4% of the reads. HmmUFOtu, being a closed-reference clustering method, facilitates merging separately processed datasets, with shared OTUs between the two datasets exhibiting a correlation coefficient of 0.92 in total abundance (log scale). While the first two dimensions of the β-diversity plot exhibited a cohesive mixture of the two datasets, the third dimension revealed the presence of a batch effect. Our comparative evaluation of ASV and OTU methods within this streamlined pipeline provides valuable insights into their performance when processing large-scale microbial 16S rRNA amplicon sequencing data. The strengths of HmmUFOtu and its potential for dataset merging are highlighted.

A Study on the Influence of Commercial Facility Diversity on the Formation of Consumption Centre: Application of Spatial Regression Models (상업시설의 다양성이 소비중심지 형성에 미치는 영향에 관한 연구: 공간회귀모형의 적용)

  • Sul-Hee Kim;Heung-Soon Kim
    • Land and Housing Review
    • /
    • v.15 no.1
    • /
    • pp.57-75
    • /
    • 2024
  • To create dynamic and bustling urban environments, a diverse array of commercial facilities is indispensable. These facilities are recognised as pivotal in attracting and accommodating a larger floating population, thereby suggesting that a greater diversity of commercial establishments fosters heightened consumer expenditure. With this premise, our study endeavours to explore the influence of commercial facility diversity on the Consumer Centre Index. Focused on the temporal context of 2021 and the spatial context of Seoul, our analysis utilizes the Consumer Centre Index, derived from Kernel Density analysis, as the dependent variable. Independent variables encompass factors reflecting commercial attributes and urban characteristics. Employing spatial regression analysis at the administrative district level, we discern that the clustering of similar industries exerts a more pronounced positive effect on consumer activation compared to the clustering of disparate industries. Additionally, the findings underscore the importance of concentrating industries that bolster consumer activation. Anticipated outcomes of this study include insights beneficial for optimizing commercial facility location policies within the consumer market.