• 제목/요약/키워드: Mixed Type of Data

검색결과 434건 처리시간 0.033초

The local influence of LIU type estimator in linear mixed model

  • Zhang, Lili;Baek, Jangsun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권2호
    • /
    • pp.465-474
    • /
    • 2015
  • In this paper, we study the local influence analysis of LIU type estimator in the linear mixed models. Using the method proposed by Shi (1997), the local influence of LIU type estimator in three disturbance models are investigated respectively. Furthermore, we give the generalized Cook's distance to assess the influence, and illustrate the efficiency of the proposed method by example.

Parallel Coordinate Plots of Mixed-Type Data

  • Kwak, Il-Youp;Huh, Myung-Hoe
    • Communications for Statistical Applications and Methods
    • /
    • 제15권4호
    • /
    • pp.587-595
    • /
    • 2008
  • Parallel coordinate plot of Inselberg (1985) is useful for visualizing dozens of variables, but so far the plot's applicability is limited to the variables of numerical type. The aim of this study is to extend the parallel coordinate plot so that it can accommodate both numerical and categorical variables. We combine Hayashi's (1950, 1952) quantification method of categorical variables and Hurley's (2004) endlink algorithm of ordering variables for the parallel coordinate plot. In line with our former study (Kwak and Huh, 2008), we develop Andrews' type modification of conventional straight-lines parallel coordinate plot to visualize the mixed-type data.

Rheological Approaches to Classify the Mixed Gel Network of $\kappa$-Carrageenan/Agar

  • Lee, Seung-Ju;Kim, Young-Ho;Hwang, Jae-Kwan
    • Food Science and Biotechnology
    • /
    • 제16권2호
    • /
    • pp.183-186
    • /
    • 2007
  • The type of mixed gel network of $\kappa$-carrageenan/agar was determined by applying rheological principles. Apparent Young's modulus of the mixed gels was mathematically analyzed with (a) simply adding the moduli of two component gels, (b) phase-separated type's upper and lower bound models, (c) interpenetrating type's logarithmic model. The experimental data fitted the estimates from the operation (a). Whereas, as for the models (b), the experimental values in the agar-rich region fitted the estimates of the upper bound model, but in the $\kappa$-carrageenan-rich region slightly deviated from those of the lower bound model. It reflected an evidence of a phase-separated type, although it was not typical, that there must be data good-fit in the agar-rich and $\kappa$-carrageenan-rich regions with the upper and lower bound models, respectively. Experimental values disagreed with estimates of the model (c). Gel time was analyzed to evince the phase-separated type. As agar concentrations increased at a fixed amount of $\kappa$-carrageenan, gel time gradually decreased and then sharply increased and decreased again. The pattern of such change in gel time also represented a typical behavior of phase-separated type's mixed gels.

혼합형태 심볼릭 데이터의 군집분석방법 (A Divisive Clustering for Mixed Feature-Type Symbolic Data)

  • 김재직
    • 응용통계연구
    • /
    • 제28권6호
    • /
    • pp.1147-1161
    • /
    • 2015
  • 오늘날 데이터는 p-차원의 공간에서 점들로써 표현되는 전통적인 형태를 벗어나 시그널(signal), 함수, 이미지(image), 모양(shape) 등과 같은 다양한 형태의 자료들이 데이터로써 고려되고 분석되고있다. 그러한 종류의 새로운 종류의 데이터 중 하나로 심볼릭 데이터(symbolic data)를 고려할 수 있다. 심볼릭 데이터는 구간(interval), 히스토그램(histogram), 목록(list), 통계표, 분포, 또는 모형 등과 같은 다양한 형태들을 가질 수 있다. 지금까지의 연구가 주로 심볼릭 데이터의 각각의 형태별 자료를 고려했다면, 본 연구에서는 이를 확장하여 수집된 히스토그램과 멀티모달의 혼합된 형태로 이루어진 자료에 대한 계층 분할적 군집분석방법을 소개하고 이를 업종별 산업재해자료의 분석을 위해 이용한다.

과도상태의 회전형 흡수기에서 혼합기체 중 이산화탄소 흡수량 계산 모델 (A Mathematical Model on the Absorption Rate of Carbon-Dioxide in Mixed Gas During the Transient State of Rotary Type Absorbers)

  • 백현종
    • 대한기계학회논문집B
    • /
    • 제26권12호
    • /
    • pp.1729-1737
    • /
    • 2002
  • A mathematical model for the prediction of carbon-dioxide absorption rate during the transient state of rotary type absorber is developed. The rotary type absorber operates using a fast rotating porous structure and clean water. The model for the transient state rotary type absorbers is based on the steady state model of packed tower absorber. The paper manipulates the operating data of an arbitrary quasi-steady state condition of rotary type absorber for the determination of the coefficients involved in the model developed. The prediction accuracy is evaluated from the measured data of rotary type absorber operated under fast transient state. The measured data include the mole fraction of carbon dioxide in mixed gas and the pressure of absorber. The relative error in carbon dioxide prediction is estimated to be 20% at maximum. The model is successfully applied for the prediction of the behavior of a closed cycle diesel engine.

정지궤도 기상위성 자료를 활용한 강우유형별 강우량 추정연구 (A Study on the Algorithm for Estimating Rainfall According to the Rainfall Type Using Geostationary Meteorological Satellite Data)

  • 이은주;서명석
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2006년도 춘계학술대회 논문집
    • /
    • pp.117-120
    • /
    • 2006
  • Heavy rainfall events are occurred exceedingly various forms by a complex interaction between synoptic, dynamic and atmospheric stability. As the results, quantitative precipitation forecast is extraordinary difficult because it happens locally in a short time and has a strong spatial and temporal variations. GOES-9 imagery data provides continuous observations of the clouds in time and space at the right resolution. In this study, an power-law type algorithm(KAE: Korea auto estimator) for estimating rainfall based on the rainfall type was developed using geostationary meteorological satellite data. GOES-9 imagery and automatic weather station(AWS) measurements data were used for the classification of rainfall types and the development of estimation algorithm. Subjective and objective classification of rainfall types using GOES-9 imagery data and AWS measurements data showed that most of heavy rainfalls are occurred by the convective and mired type. Statistical analysis between AWS rainfall and GOES-IR data according to the rainfall types showed that estimation of rainfall amount using satellite data could be possible only for the convective and mixed type rainfall. The quality of KAE in estimating the rainfall amount and rainfall area is similar or slightly superior to the National Environmental Satellite Data and Information Service's auto-estimator(NESDIS AE), especially for the multi cell convective and mixed type heavy rainfalls. Also the high estimated level is denoted on the mature stage as well as decaying stages of rainfall system.

  • PDF

Mixed Linear Models with Censored Data

  • Ha, Il-do;Lee, Youngjo-;Song, Jae-Kee
    • Journal of the Korean Statistical Society
    • /
    • 제28권2호
    • /
    • pp.211-223
    • /
    • 1999
  • We propose a simple estimation procedure in the mixed linear models with censored normal data, using both Buckly and James(1979) type pseudo random variables and Lee and Nelder's(1996) estimation procedure. The proposed method is illustrated with the matched pairs data in Pettitt(1986).

  • PDF

Development of the forest type classification technique for the mixed forest with coniferous and broad-leaved species using the high resolution satellite data

  • Sasakawa, Hiroshi;Tsuyuki, Satoshi
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.467-469
    • /
    • 2003
  • This research aimed to develop forest type classification technique for the mixed forest with coniferous and broad-leaved species using the high resolution satellite data. QuickBird data was used as satellite data. The method of this research was to extract satellite data for every single tree crown using image segmentation technique, then to evaluate the accuracy of classification by changing grouping criteria such as tree species, families, coniferous or broad-leaved species, and timber prices. As a result, the classification of tree species and families level was inaccurate, on the other hand, coniferous or broad-leaved species and timber price level was high accurate.

  • PDF

Cluster Analysis with Balancing Weight on Mixed-type Data

  • Chae, Seong-San;Kim, Jong-Min;Yang, Wan-Youn
    • Communications for Statistical Applications and Methods
    • /
    • 제13권3호
    • /
    • pp.719-732
    • /
    • 2006
  • A set of clustering algorithms with proper weight on the formulation of distance which extend to mixed numeric and multiple binary values is presented. A simple matching and Jaccard coefficients are used to measure similarity between objects for multiple binary attributes. Similarities are converted to dissimilarities between i th and j th objects. The performance of clustering algorithms with balancing weight on different similarity measures is demonstrated. Our experiments show that clustering algorithms with application of proper weight give competitive recovery level when a set of data with mixed numeric and multiple binary attributes is clustered.

Extreme wind speeds from multiple wind hazards excluding tropical cyclones

  • Lombardo, Franklin T.
    • Wind and Structures
    • /
    • 제19권5호
    • /
    • pp.467-480
    • /
    • 2014
  • The estimation of wind speed values used in codes and standards is an integral part of the wind load evaluation process. In a number of codes and standards, wind speeds outside of tropical cyclone prone regions are estimated using a single probability distribution developed from observed wind speed data, with no distinction made between the types of causal wind hazard (e.g., thunderstorm). Non-tropical cyclone wind hazards (i.e., thunderstorm, non-thunderstorm) have been shown to possess different probability distributions and estimation of non-tropical cyclone wind speeds based on a single probability distribution has been shown to underestimate wind speeds. Current treatment of non-tropical cyclone wind hazards in worldwide codes and standards is touched upon in this work. Meteorological data is available at a considerable number of United States (U.S.) stations that have information on wind speed as well as the type of causal wind hazard. In this paper, probability distributions are fit to distinct storm types (i.e., thunderstorm and non-thunderstorm) and the results of these distributions are compared to fitting a single probability distribution to all data regardless of storm type (i.e., co-mingled). Distributions fitted to data separated by storm type and co-mingled data will also be compared to a derived (i.e., "mixed") probability distribution considering multiple storm types independently. This paper will analyze two extreme value distributions (e.g., Gumbel, generalized Pareto). It is shown that mixed probability distribution, on average, is a more conservative measure for extreme wind speed estimation. Using a mixed distribution is especially conservative in situations where a given wind speed value for either storm type has a similar probability of occurrence, and/or when a less frequent storm type produces the highest overall wind speeds. U.S. areas prone to multiple non-tropical cyclone wind hazards are identified.