• 제목/요약/키워드: Multivariate statistical models

검색결과 128건 처리시간 0.025초

인테리어 내장재의 고급감에 관한 시각 및 촉각변수의 수량화 모형 개발 (Development of Quantification Models on Visual and Tactile Design Characteristics for the Luxuriousness of Interior Covering Materials)

  • 반상우;윤명환
    • 대한산업공학회지
    • /
    • 제33권4호
    • /
    • pp.393-401
    • /
    • 2007
  • Affective aspects of design attributes such as color, Pattern, and texture are important to the overall impression and the success of interior products. Among all the interior materials, wallpapers and flooring materials take up largest construction area and they are main components in creating affective impression for customers. This study aims to investigate the relationship between luxuriousness and related affective variables and design elements of wallpapers and flooring materials. The approach consists of 3 steps: (1) selecting related affective features and product design attributes through a literature survey, opinion of expert panel, and focus group interview, (2) conducting evaluation experiments, and (3) developing Kansei models using multivariate statistical analysis and analyzing critical attributes. Evaluation experiment was conducted using a questionnaire made up of 7-point scale and 100-point scale and 30 housewives and 20 interior designers participated in the evaluation experiment. The result of evaluation was analyzed through principal component regression and quantification I analysis. As a result of analyzing the survey data, the relationship between luxuriousness and related affective features and product design attributes was identified, moreover a optimal combination of the design component was identified. Consequently, it is expected that the results of the study would be a basis of the concept of emotion-based design by giving insights about how customers perceive the luxuriousness and suggesting the optimal combination, and providing specific quantitative design guidelines.

Using machine learning to forecast and assess the uncertainty in the response of a typical PWR undergoing a steam generator tube rupture accident

  • Tran Canh Hai Nguyen ;Aya Diab
    • Nuclear Engineering and Technology
    • /
    • 제55권9호
    • /
    • pp.3423-3440
    • /
    • 2023
  • In this work, a multivariate time-series machine learning meta-model is developed to predict the transient response of a typical nuclear power plant (NPP) undergoing a steam generator tube rupture (SGTR). The model employs Recurrent Neural Networks (RNNs), including the Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and a hybrid CNN-LSTM model. To address the uncertainty inherent in such predictions, a Bayesian Neural Network (BNN) was implemented. The models were trained using a database generated by the Best Estimate Plus Uncertainty (BEPU) methodology; coupling the thermal hydraulics code, RELAP5/SCDAP/MOD3.4 to the statistical tool, DAKOTA, to predict the variation in system response under various operational and phenomenological uncertainties. The RNN models successfully captures the underlying characteristics of the data with reasonable accuracy, and the BNN-LSTM approach offers an additional layer of insight into the level of uncertainty associated with the predictions. The results demonstrate that LSTM outperforms GRU, while the hybrid CNN-LSTM model is computationally the most efficient. This study aims to gain a better understanding of the capabilities and limitations of machine learning models in the context of nuclear safety. By expanding the application of ML models to more severe accident scenarios, where operators are under extreme stress and prone to errors, ML models can provide valuable support and act as expert systems to assist in decision-making while minimizing the chances of human error.

주성분 회귀분석 및 인공신경망을 이용한 AE변수와 응력확대계수와의 상관관계 해석 (Analysis on Correlation between AE Parameters and Stress Intensity Factor using Principal Component Regression and Artificial Neural Network)

  • 김기복;윤동진;정중채;박휘립;이승석
    • 비파괴검사학회지
    • /
    • 제21권1호
    • /
    • pp.80-90
    • /
    • 2001
  • AE 신호와 재료의 기계적 물성과의 관계를 정량적으로 제시할 수 있는 방법을 개발하였다. 재료의 여러 가지 기계적 성질들 중 피로균열 거동에 관련된 응력확대계수를 중심으로 AE 신호와 같은 다변량 데이터의 처리에 많이 사용되고 있는 주성분 회귀분석과 비선형적 문제 해결에 적합한 신경회로망 기법을 이용하였다. 이를 위하여 강교량 부재인 SWS490B 강에 대한 피로균열전파 실험을 수행하였으며 표준 CT 시편에 대한 피로균열진전 시 발생하는 AE 신호의 각 변수와 응력확대계수와의 관계를 고찰하였다. 통계분석 방법인 변수선택법을 적용한 결과 AE 카운트(RC), 에너지(EN), 신호지속시간(ED)의 각각에 대한 유의성이 높은 것으로 나타났으나 전반적으로 전체 AE 변수를 모두 이용할 경우 통계적 유의성이 높은 것으로 나타났다. 부재의 반복하중 시 발생하는 피로균열진전을 정량적으로 도출할 수 있는 응력확대계수 추정모델을 개발하고 평가하였다. 미지 시료에 대하여 개발된 모델의 응력확대계수 예측 성능을 분석한 결과 주성분 회귀모델과 인공신경망 모델 모두 우수한 예측성능을 나타내었으나 전반적으로 인공신경망 모델이 주성분 회귀모델보다 다소 양호한 것으로 분석되었다.

  • PDF

데이터 기반 이상진단법을 위한 화학공정의 조업모드 판별 (Operation Modes Classification of Chemical Processes for History Data-Based Fault Diagnosis Methods)

  • 이창준;고재욱;이기백
    • Korean Chemical Engineering Research
    • /
    • 제46권2호
    • /
    • pp.383-388
    • /
    • 2008
  • 화학공정의 안전하고 효율적인 운전에 관심이 커지면서 공정이상의 원인을 조기에 진단하기 위한 다양한 이상진단방법이 연구되어 왔다. 최근에는 통계적 모델 등 정량적 데이터에 기반한 이상진단방법이 많이 연구되고 있으나, 특정 조업영역에서 얻어진 통계적 모델을 다른 조업영역에 적용하면 오진단이 많아지게 된다. 따라서 공정특성상 다양한 조업영역이 존재하는 화학공정에 데이터기반 방법론을 적용하기에는 어려움이 있어 화학공정의 조업영역 판별법이 요구되고 있다. 이 연구에서는 유클리드 거리(Euclidean distance), FDA(Fisher's discriminant analysis), PCA(principal component analysis)의 통계모델과 이 모델들에 공정변수의 동특성을 반영한 모델을 제안하였다. 6개의 조업모드를 가진 TE(tennessee eastman) 공정에 대한 사례연구를 통해 동특성을 반영한 PCA 모델의 성능이 가장 우수함을 확인하였다.

Plant breeding in the 21st century: Molecular breeding and high throughput phenotyping

  • Sorrells, Mark E.
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2017년도 9th Asian Crop Science Association conference
    • /
    • pp.14-14
    • /
    • 2017
  • The discipline of plant breeding is experiencing a renaissance impacting crop improvement as a result of new technologies, however fundamental questions remain for predicting the phenotype and how the environment and genetics shape it. Inexpensive DNA sequencing, genotyping, new statistical methods, high throughput phenotyping and gene-editing are revolutionizing breeding methods and strategies for improving both quantitative and qualitative traits. Genomic selection (GS) models use genome-wide markers to predict performance for both phenotyped and non-phenotyped individuals. Aerial and ground imaging systems generate data on correlated traits such as canopy temperature and normalized difference vegetative index that can be combined with genotypes in multivariate models to further increase prediction accuracy and reduce the cost of advanced trials with limited replication in time and space. Design of a GS training population is crucial to the accuracy of prediction models and can be affected by many factors including population structure and composition. Prediction models can incorporate performance over multiple environments and assess GxE effects to identify a highly predictive subset of environments. We have developed a methodology for analyzing unbalanced datasets using genome-wide marker effects to group environments and identify outlier environments. Environmental covariates can be identified using a crop model and used in a GS model to predict GxE in unobserved environments and to predict performance in climate change scenarios. These new tools and knowledge challenge the plant breeder to ask the right questions and choose the tools that are appropriate for their crop and target traits. Contemporary plant breeding requires teams of people with expertise in genetics, phenotyping and statistics to improve efficiency and increase prediction accuracy in terms of genotypes, experimental design and environment sampling.

  • PDF

Scoring systems for the management of oncological hepato-pancreato-biliary patients

  • Alexander W. Coombs;Chloe Jordan;Sabba A. Hussain;Omar Ghandour
    • 한국간담췌외과학회지
    • /
    • 제26권1호
    • /
    • pp.17-30
    • /
    • 2022
  • Oncological scoring systems in surgery are used as evidence-based decision aids to best support management through assessing prognosis, effectiveness and recurrence. Currently, the use of scoring systems in the hepato-pancreato-biliary (HPB) field is limited as concerns over precision and applicability prevent their widespread clinical implementation. The aim of this review was to discuss clinically useful oncological scoring systems for surgical management of HPB patients. A narrative review was conducted to appraise oncological HPB scoring systems. Original research articles of established and novel scoring systems were searched using Google Scholar, PubMed, Cochrane, and Ovid Medline. Selected models were determined by authors. This review discusses nine scoring systems in cancers of the liver (CLIP, BCLC, ALBI Grade, RETREAT, Fong's score), pancreas (Genç's score, mGPS), and biliary tract (TMHSS, MEGNA). Eight models used exclusively objective measurements to compute their scores while one used a mixture of both subjective and objective inputs. Seven models evaluated their scoring performance in external populations, with reported discriminatory c-statistic ranging from 0.58 to 0.82. Selection of model variables was most frequently determined using a combination of univariate and multivariate analysis. Calibration, another determinant of model accuracy, was poorly reported amongst nine scoring systems. A diverse range of HPB surgical scoring systems may facilitate evidence-based decisions on patient management and treatment. Future scoring systems need to be developed using heterogenous patient cohorts with improved stratification, with future trends integrating machine learning and genetics to improve outcome prediction.

국내 MIS 연구에서 구조방정식모형 활용에 관한 메타분석 (A Meta Analysis of Using Structural Equation Model on the Korean MIS Research)

  • 김종기;전진환
    • Asia pacific journal of information systems
    • /
    • 제19권4호
    • /
    • pp.47-75
    • /
    • 2009
  • Recently, researches on Management Information Systems (MIS) have laid out theoretical foundation and academic paradigms by introducing diverse theories, themes, and methodologies. Especially, academic paradigms of MIS encourage a user-friendly approach by developing the technologies from the users' perspectives, which reflects the existence of strong causal relationships between information systems and user's behavior. As in other areas in social science the use of structural equation modeling (SEM) has rapidly increased in recent years especially in the MIS area. The SEM technique is important because it provides powerful ways to address key IS research problems. It also has a unique ability to simultaneously examine a series of casual relationships while analyzing multiple independent and dependent variables all at the same time. In spite of providing many benefits to the MIS researchers, there are some potential pitfalls with the analytical technique. The research objective of this study is to provide some guidelines for an appropriate use of SEM based on the assessment of current practice of using SEM in the MIS research. This study focuses on several statistical issues related to the use of SEM in the MIS research. Selected articles are assessed in three parts through the meta analysis. The first part is related to the initial specification of theoretical model of interest. The second is about data screening prior to model estimation and testing. And the last part concerns estimation and testing of theoretical models based on empirical data. This study reviewed the use of SEM in 164 empirical research articles published in four major MIS journals in Korea (APJIS, ISR, JIS and JITAM) from 1991 to 2007. APJIS, ISR, JIS and JITAM accounted for 73, 17, 58, and 16 of the total number of applications, respectively. The number of published applications has been increased over time. LISREL was the most frequently used SEM software among MIS researchers (97 studies (59.15%)), followed by AMOS (45 studies (27.44%)). In the first part, regarding issues related to the initial specification of theoretical model of interest, all of the studies have used cross-sectional data. The studies that use cross-sectional data may be able to better explain their structural model as a set of relationships. Most of SEM studies, meanwhile, have employed. confirmatory-type analysis (146 articles (89%)). For the model specification issue about model formulation, 159 (96.9%) of the studies were the full structural equation model. For only 5 researches, SEM was used for the measurement model with a set of observed variables. The average sample size for all models was 365.41, with some models retaining a sample as small as 50 and as large as 500. The second part of the issue is related to data screening prior to model estimation and testing. Data screening is important for researchers particularly in defining how they deal with missing values. Overall, discussion of data screening was reported in 118 (71.95%) of the studies while there was no study discussing evidence of multivariate normality for the models. On the third part, issues related to the estimation and testing of theoretical models on empirical data, assessing model fit is one of most important issues because it provides adequate statistical power for research models. There were multiple fit indices used in the SEM applications. The test was reported in the most of studies (146 (89%)), whereas normed-test was reported less frequently (65 studies (39.64%)). It is important that normed- of 3 or lower is required for adequate model fit. The most popular model fit indices were GFI (109 (66.46%)), AGFI (84 (51.22%)), NFI (44 (47.56%)), RMR (42 (25.61%)), CFI (59 (35.98%)), RMSEA (62 (37.80)), and NNFI (48 (29.27%)). Regarding the test of construct validity, convergent validity has been examined in 109 studies (66.46%) and discriminant validity in 98 (59.76%). 81 studies (49.39%) have reported the average variance extracted (AVE). However, there was little discussion of direct (47 (28.66%)), indirect, and total effect in the SEM models. Based on these findings, we suggest general guidelines for the use of SEM and propose some recommendations on concerning issues of latent variables models, raw data, sample size, data screening, reporting parameter estimated, model fit statistics, multivariate normality, confirmatory factor analysis, reliabilities and the decomposition of effects.

로그선형모델을 이용한 팔당호 유입지류 수질의 연속성 시뮬레이션과 경향 분석 (Continuity Simulation and Trend Analysis of Water Qualities in Incoming Flows to Lake Paldang by Log Linear Models)

  • 나은혜;박석순
    • 생태와환경
    • /
    • 제36권3호통권104호
    • /
    • pp.336-343
    • /
    • 2003
  • 본 연구에서는 남, 북한강 그리고 경안천으로부터 팔당호에 유입되는 유기물 및 영양물질농도의 연속성 예측을 위하여 단순로그선형모델과 다변수 로그선형모델이 함께 적용되었으며, F-검정과 결정계수에 기초하여 산정된 모델의 유의성과 유효성이 검토되었다. 검토 결과 단순로그선형모델은 산정된 9개의 모델 중 4개 모델만이 통계적으로 유의한 반면 다변수 로그선형모델의 경우에는 9개 모델 모두 통계적으로 유의한 것으로 나타났다. 모델의 유효성을 평가하는 결정계수 또한 다변수 로그선형모델의 경우에 더 높게 예측되었다. 즉 팔당호 유입 수질 농도의 연속성 예측과 경향성 파악을 위해서는 다변수 로그선형모델의 적용이 더 적합한 것으로 판단되었다. 다변수 로그선형모델 결과에 기초하여 팔당호 유입수질의 유량 의존성, 경향성, 계절성을 분석하였다. 분석결과 모든 지류에서 유량이 증가함에 따라 팔당호로 유입되는 BOD 농도는 감소하는 것으로 나타났으며, TN과 TP의 경우에는 BOD와 달리 유량이 증가하더라도 농포는 증가 또는 감소하지 않는 것으로 나타났다. 따라서 3개 지류 유역에서 유기물을 배출하는 주요 오염원은 점오염원인 반면 영양물질의 경우에는 점오염원 뿐만 아니라 비점오염원 역시 주요 배출원으로 작용하고 있는 것으로 판단된다. 경향성을 분석한 결과 1995턴부터2000년까지 모든 지류에서 팔당호로 유입되는 BOD농도의 증감 경향은 보이지 않았다. 남한강과 북한강으로부터 팔당호로 유입되는 TP의 경우 1988년부터 1994년까지 점진적인 증가 추세를 보이는 것으로 보고된 바 있으나 본 연구의 대상 기간인 1995년 이후에는 이러한 증가 추세는 관찰되지 않았으며, 반면 경안천으로부터 유입되는 TP농도는 연간 10%정도 증가하고 있는 것으로 예측되었다. 한편 북한강으로부터 팔당호로 유입되는 TN농도는 연간 10%정도 감소하는 반면 남한강과 경안천으로부터의 유입 농도는 각각 연간 3%와 7%씩 증가하는 경향을 보였다. 수질 농도의 계절별 변화 경향을 분석한 결과 팔당호로 유입되는 3개 지류의 유기물 및 영양물질 농도는 모두 계절성을 갖는 것으로 분석되었으며, 이 중 가장 작은 변동폭을 갖는 수질항목은 총 질소인 것으로 나타났다.

함수적 변동성 fGARCH(1, 1)모형을 통한 초고빈도 시계열 변동성 (The fGARCH(1, 1) as a functional volatility measure of ultra high frequency time series)

  • 윤재은;김종민;황선영
    • 응용통계연구
    • /
    • 제31권5호
    • /
    • pp.667-675
    • /
    • 2018
  • 초고빈도(ultra high frequency; UHF)시계열의 함수적 변동성 측정을 위한 최신 기법인 함수적 변동성 functional GARCH : fGARCH(1, 1) 모형을 소개하고 설명하였다. 실증분석을 위해 R-code fGARCH(1, 1) 프로그램을 KOSPI/현대차 초고빈도 수익률 자료에 적합하여 예시하였다.

다변량 다수준 이항자료에 대한 일반화선형혼합모형 (Generalized Linear Mixed Model for Multivariate Multilevel Binomial Data)

  • 임화경;송석헌;송주원;전수영
    • 응용통계연구
    • /
    • 제21권6호
    • /
    • pp.923-932
    • /
    • 2008
  • 우리는 자명하지 않은 상관 구조를 갖는 복잡한 다변량 자료에 직면하는 경우가 있다. 예를 들어 군집 구조 자료의 경우 생략된 변수들이 한 개 이상의 관측값에 동시적으로 영향을 줄 수 있기 때문에 결과들 간에 상관 구조를 모형화하는 것은 추정량의 효율성과 정확한 표준오차의 계산 등의 타당한 추론을 위해서 중요하다 관측값들 간에 종속성을 두는 표준 방법으로는 관측 값들이 관찰되지 않은 어떤 변수를 공유한다고 가정하는 것인데, 이러한 가정에 대해 본 연구에서는 다수준 모형을 고려한 상관된 임의효과 모형을 적합시켰다. 추정은 준모수적 접근방법으로 임의계수 분포에 대한 모수적 가정 없이 유한혼합 EM-알고리즘을 통하여 수행되었다.