• 제목/요약/키워드: Interpretability

검색결과 90건 처리시간 0.027초

비대칭 다차원척도법의 시각화 (Visualizations of Asymmetric Multidimensional Scaling)

  • 이수기;최용석;이보희
    • 응용통계연구
    • /
    • 제27권4호
    • /
    • pp.619-627
    • /
    • 2014
  • 다차원척도법(MDS)에서는 대게 개체간의 거리나 유사성이 대칭성을 따른다. 따라서 비대칭 거리를 다루기는 쉽지 않다. 통용되고 있는 비대칭 다차원척도법도 여전히 결과를 해석하는데 어려움이 있다. 본 연구는 비대칭행렬의 순서 통계량을 활용하여 더 간단한 비대칭 대차원척도법을 제안한다. 제안된 웹(Web) 방법은 개체간의 영향력을 사용자들이 해석을 쉽게 하도록 화살표의 방향크기와 모양에 따라 시각화하여 보여준다.

다차원척도법에 의한 서울주민의 교통수단선호 분석 (Multidimensional Scaling of User Preferences for the Transportation Modes in Seoul.)

  • 허우선
    • 대한교통학회지
    • /
    • 제4권1호
    • /
    • pp.12-27
    • /
    • 1986
  • This study examined user preferences toward transportation modes in Seoul. Two multidimensional scaling models, the ideal point and vector models, were applied to data on mode preferences of 114 adults in the metropolitan area. While both models produced fairly similar results, the vector model performed slightly better than the other in terms of interpretability of the results. The transport attributes elicited are comfort, flexibility, travel cost, travel time, privacy, and safety; among which comfort is salient most. The comfort variable is a multi-faceted attribute in nature. The variations of attribute preferences are most significant between the gender groups as well as worker/nonworker groups. In particular, male workers, female workers and female nonworkers form three distinctive market segments. An unidimensional scaling of the preference data reveals that subway, auto-driver, and subscription bus modes are preferred most, whereas motorcycle and bicycle least. The other modes of express bus, taxt, auto-passenger, bus and walk rank intermediately. An examination of how preference orders vary among modal groups hints that users align their stated attitudes to their choice in order to reduce cognitive dissonance.

  • PDF

Regression by Least Absolute Value Method with L1-constraint on Parameters

  • 고영현;전치혁
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회/대한산업공학회 2003년도 춘계공동학술대회
    • /
    • pp.151-157
    • /
    • 2003
  • OLS로 알려진 기존의 주절 방법은 변수수의 증가에 따라 다중공선성(Multicollinearity)의 문제와 더불어 해석력(Interpretability)이 떨어지는 문제를 가지게 된다. 본 연구에서는 파라미터의 절대값의 크기(L1-Norm)에 제약을 줌으로써 이와 같은 OLS의 문제를 해결할 수 있는 동시에, 잔차의 제곱합대신 절대오차를 사용하는 Least Absolute Value(LAV) 방법을 사용함으로써 이상치에 로버스트한 결과를 주는 방법론을 제안한다. 또한. 본 연구에서 제안하는 방법이 선형계획법에 의해 모델처럼 될 수 있는 특성으로 인해 제약조건이 있는 이차 형태의 최적화 문제보다 수행 속도면에서 뛰어난 결과를 주는 것을 수치예제을 통해 보인다.

  • PDF

기계 조립품 정보의 표현을 위한 XML기반 공용문서 구조 (Development of Common Document Structure based on XML for Representing Mechanical Part and Assembly Information)

  • 정태형;박승현;윤성원
    • 한국정밀공학회지
    • /
    • 제20권9호
    • /
    • pp.180-187
    • /
    • 2003
  • In engineering design environment it is hard to link design data and systems because the types of them are disparate. Therefore, the importance of metadata has increased. Some researches have been executed to develop metadata. But they cannot interact with other metadata and are difficult to extend. The purpose of this paper is to develop a common document structure which represents the general information of mechanical part assembly using XML, and to use it as base documents in order to integrate design data and systems. It is composed of part, assembly and user documents. Part document represents the information of a part independently to part type. Assembly document represents the location of constituent part documents. User document represents user's information. Common documents can be used as a broker between design data and systems, and it can improve the interpretability and reusability of document. We applied the developed common document structure to 2-stage spur gear drive.

Generalized Partially Linear Additive Models for Credit Scoring

  • Shim, Ju-Hyun;Lee, Young-K.
    • 응용통계연구
    • /
    • 제24권4호
    • /
    • pp.587-595
    • /
    • 2011
  • Credit scoring is an objective and automatic system to assess the credit risk of each customer. The logistic regression model is one of the popular methods of credit scoring to predict the default probability; however, it may not detect possible nonlinear features of predictors despite the advantages of interpretability and low computation cost. In this paper, we propose to use a generalized partially linear model as an alternative to logistic regression. We also introduce modern ensemble technologies such as bagging, boosting and random forests. We compare these methods via a simulation study and illustrate them through a German credit dataset.

Regression Models for Haplotype-Based Association Studies

  • Oh, So-Hee;NamKung, Jung-Hyun;Park, Tae-Sung
    • Genomics & Informatics
    • /
    • 제5권1호
    • /
    • pp.19-23
    • /
    • 2007
  • In this paper, we provide an overview of statistical models for haplotype-based association studies, and summarize their features based on the design matrix. We classify the design matrix into the two types: direct and indirect. For these two kinds of matrices, we present and compare characteristics using a simple hypothetical example, and a real data set. The motivation behind this study was to provide practitioners with an improved understanding, to facilitate the informed selection of the appropriate haplotype-based model and to improve the interpretability of the models.

CADICA: Diagnosis of Coronary Artery Disease Using the Imperialist Competitive Algorithm

  • Mahmoodabadi, Zahra;Abadeh, Mohammad Saniee
    • Journal of Computing Science and Engineering
    • /
    • 제8권2호
    • /
    • pp.87-93
    • /
    • 2014
  • Coronary artery disease (CAD) is currently a prevalent disease from which many people suffer. Early detection and treatment could reduce the risk of heart attack. Currently, the golden standard for the diagnosis of CAD is angiography, which is an invasive procedure. In this article, we propose an algorithm that uses data mining techniques, a fuzzy expert system, and the imperialist competitive algorithm (ICA), to make CAD diagnosis by a non-invasive procedure. The ICA is used to adjust the fuzzy membership functions. The proposed method has been evaluated with the Cleveland and Hungarian datasets. The advantage of this method, compared with others, is the interpretability. The accuracy of the proposed method is 94.92% by 11 rules, and the average length of 4. To compare the colonial competitive algorithm with other metaheuristic algorithms, the proposed method has been implemented with the particle swarm optimization (PSO) algorithm. The results indicate that the colonial competition algorithm is more efficient than the PSO algorithm.

Multidimensional Scaling of Asymmetric Distance Matrices

  • Huh, Myung-Hoe;Lee, Yong-Goo
    • 응용통계연구
    • /
    • 제25권4호
    • /
    • pp.613-620
    • /
    • 2012
  • In most cases of multidimensional scaling(MDS), the distances or dissimilarities among units are assumed to be symmetric. Thus, it is not an easy task to deal with asymmetric distances. Asymmetric MDS developed so far face difficulties in the interpretation of results. This study proposes a much simpler asymmetric MDS, that utilizes the notion of "altitude". The analogy arises in mountaineering: It is easier (more difficult) to move from the higher (lower) point to the lower (higher). The idea is formulated as a quantification problem, in which the disparity of distances is maximally related to the altitude difference. The proposed method is demonstrated in three examples, in which the altitudes are visualized by rainbow colors to ease the interpretability of users.

Building a Fuzzy Model with Transparent Membership Functions through Constrained Evolutionary Optimization

  • Kim, Min-Soeng;Kim, Chang-Hyun;Lee, Ju-Jang
    • International Journal of Control, Automation, and Systems
    • /
    • 제2권3호
    • /
    • pp.298-309
    • /
    • 2004
  • In this paper, a new evolutionary scheme to design a TSK fuzzy model from relevant data is proposed. The identification of the antecedent rule parameters is performed via the evolutionary algorithm with the unique fitness function and the various evolutionary operators, while the identification of the consequent parameters is done using the least square method. The occurrence of the multiple overlapping membership functions, which is a typical feature of unconstrained optimization, is resolved with the help of the proposed fitness function. The proposed algorithm can generate a fuzzy model with transparent membership functions. Through simulations on various problems, the proposed algorithm found a TSK fuzzy model with better accuracy than those found in previous works with transparent partition of input space.

기계 조립품 정보의 표현을 위한 XML 기반 공용문서 구조 개발 (Development of Common Document Structure based on XML for Representing Mechanical Part Assembly Information)

  • 정태형;박승현;윤성원
    • 한국공작기계학회:학술대회논문집
    • /
    • 한국공작기계학회 2002년도 추계학술대회 논문집
    • /
    • pp.359-364
    • /
    • 2002
  • In engineering design environment it is hard to link design data and system because the types of them are disparate. Therefore, the importance of metadata has increased. Some researches have been executed to develop metadata. But they cannot interact with other metadata and are difficult to extend. The purpose of this paper is to develop a common metadata structure which represents the general information of mechanical part assembly using XML, and to use it as base documents in order to integrate design data and systems. It is composed of part and assembly documents. Part document represents the information of a part independently to part type. Assembly document represents the location of part documents which compose an assembly. Common documents can be used as a broker between design data and systems and improve interpretability and reusability of document. We applied the developed common document structure to 2-stage spur gear drive.

  • PDF