• 제목/요약/키워드: Distribution data

검색결과 17,588건 처리시간 0.044초

전국자연환경조사 자료를 이용한 종분포모형 연구 (A Study on the Species Distribution Modeling using National Ecosystem Survey Data)

  • 김지연;서창완;권혁수;류지은;김명진
    • 환경영향평가
    • /
    • 제21권4호
    • /
    • pp.593-607
    • /
    • 2012
  • The Ministry of Environment have started the 'National Ecosystem Survey' since 1986. It has been carried out nationwide every ten years as the largest survey project in Korea. The second one and the third one produced the GIS-based inventory of species. Three survey methods were different from each other. There were few studies for species distribution using national survey data in Korea. The purposes of this study are to test species distribution models for finding the most suitable modeling methods for the National Ecosystem Survey data and to investigate the modeling results according to survey methods and taxonominal group. Occurrence data of nine species were extracted from the National Ecosystem Survey by taxonomical group (plant, mammal, and bird). Plants are Korean winter hazel (Corylopsis coreana), Iris odaesanensis (Iris odaesanensis), and Berchemia (Berchemia berchemiaefolia). Mammals are Korean Goral (Nemorhaedus goral), Marten (Martes flavigula koreana), and Leopard cat (Felis bengalensis). Birds are Black Woodpecker (Dryocopus martius), Eagle Owl (Bubo Bubo), and Common Buzzard (Buteo buteo). Environmental variables consisted of climate, topography, soil and vegetation structure. Two modeling methods (GAM, Maxent) were tested across nine species, and predictive species maps of target species were produced. The results of this study were as follows. Firstly, Maxent showed similar 5 cross-validated AUC with GAM. Maxent is more useful model to develop than GAM because National Ecosystem Survey data has presence-only data. Therefore, Maxent is more useful species distribution model for National Ecosystem Survey data. Secondly, the modeling results between the second and third survey methods showed sometimes different because of each different surveying methods. Therefore, we need to combine two data for producing a reasonable result. Lastly, modeling result showed different predicted distribution pattern by taxonominal group. These results should be considered if we want to develop a species distribution model using the National Ecosystem Survey and apply it to a nationwide biodiversity research.

절토사면 현황자료를 이용한 충청도 관내 위험절토사면 분포 연구 (Study on Dangerous Cut-slopes Distribution Using Inventory Data in Chungcheongdo)

  • 김진환;구호본;이종현;윤천주
    • 한국지반공학회:학술대회논문집
    • /
    • 한국지반공학회 2008년도 춘계 학술발표회 초청강연 및 논문집
    • /
    • pp.858-862
    • /
    • 2008
  • KICT has been carrying out inventory research on the cut slopes of national roads. Inventory research results are basic data used in cut slope management system. Inventory data are classified by general status, cut slope characteristics and inspector opinion. Cut slope inventory data are utilized to figure out dangerous slopes and decide survey ranking of detailed safety diagnostication. This paper drew the distribution of dangerous cut slopes using inventory data in Chungcheongdo, then verified an efficiency on distribution of dangerous cut slopes by comparing occurrence frequency of real collapsed cut slopes.

  • PDF

Bayesian estimation in the generalized half logistic distribution under progressively type-II censoring

  • Kim, Yong-Ku;Kang, Suk-Bok;Se, Jung-In
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권5호
    • /
    • pp.977-989
    • /
    • 2011
  • The half logistic distribution has been used intensively in reliability and survival analysis especially when the data is censored. In this paper, we provide Bayesian estimation of the shape parameter and reliability function in the generalized half logistic distribution based on progressively Type-II censored data under various loss functions. We here consider conjugate prior and noninformative prior and corresponding posterior distributions are obtained. As an illustration, we examine the validity of our estimation using real data and simulated data.

Estimation in the exponential distribution under progressive Type I interval censoring with semi-missing data

  • Shin, Hyejung;Lee, Kwangho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권6호
    • /
    • pp.1271-1277
    • /
    • 2012
  • In this paper, we propose an estimation method of the parameter in an exponential distribution based on a progressive Type I interval censored sample with semi-missing observation. The maximum likelihood estimator (MLE) of the parameter in the exponential distribution cannot be obtained explicitly because the intervals are not equal in length under the progressive Type I interval censored sample with semi-missing data. To obtain the MLE of the parameter for the sampling scheme, we propose a method by which progressive Type I interval censored sample with semi-missing data is converted to the progressive Type II interval censored sample. Consequently, the estimation procedures in the progressive Type II interval censored sample can be applied and we obtain the MLE of the parameter and survival function. It will be shown that the obtained estimators have good performance in terms of the mean square error (MSE) and mean integrated square error (MISE).

Prole likelihood estimation of generalized half logistic distribution under progressively type-II censoring

  • Kim, Yong-Ku;Kang, Suk-Bok;Han, Song-Hui;Seo, Jung-In
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권3호
    • /
    • pp.597-603
    • /
    • 2011
  • The half logistic distribution has been used intensively in reliability and survival analysis especially when the data is censored. In this paper, we provide prole likelihood estimation of the shape parameter and scale parameter in the generalized half logistic distribution based on progressively Type-II censored data. We also introduce approximate maximum prole likelihood estimates for the scale parameter. As an illustration, we examine the validity of our estimation using real data and simulated data.

A GEOSENSOR FILTER FOR PROCESSING GEOSENSOR QUERIES ON DATA STREAMS

  • Lee, Dong-Gyu;Ryu, Keun-Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2008년도 International Symposium on Remote Sensing
    • /
    • pp.119-121
    • /
    • 2008
  • Pattern matching is increasingly being employed in various researches as health care service, RFID-based system, facility management, and surveillance. Geosensor filter correlates a data stream to match specific patterns in distribution environments. In this paper, we present a geosensor query language to represent efficiently declarative geosensor query. Geosensor operators are proposed to use for fast query processing in terms of spatial and temporal area in distribution environments. We also propose a geosensor filter to match new query predicates into incoming stream predicates. Our filter can reduce the volume of transmission data and save power consumption of sensors. It can be utilized the stream data mining system to process in real-time various data as location, time, and geosensor information in distribution environments.

  • PDF

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권2호
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

인터넷 공간데이타 전자유통 시스템에 관한 연구 (A Study on the Internet Spatial Data Electronic Distribution System)

  • 이기영;서의석;이용수
    • 한국컴퓨터정보학회논문지
    • /
    • 제5권3호
    • /
    • pp.40-45
    • /
    • 2000
  • 최근에 WWW의 등장과 함께 인터넷 사용자가 폭발적으로 증가하고 웹 지리정보시스템의 필요성과 중요성이 크게 대두되어 인터넷을 통한 공간데이타 유통 환경을 구축하기 위하여 여러 기관 등에서 많은 기술개발 연구가 이루어지고 있다. 공간데이타를 인터넷상에서 접근하기 위해서는 WWW상에 공간데이타를 등록하고 필요로 하는 사람에게 판매할 수 있는 공간데이타 전자유통 시스템이 필요하다. 본 논문에서는 WWW과 연동하는 공간데이타 전자유통 시스템을 구축하기 위한 효율적인 공간데이타 전자유통시스템의 모델을 설계하고 제시한다. 또한 시스템의 각 모듈에 대한 기능과 구현 방법을 제안한다.

  • PDF

적대적 데이터 혼합: 분포 외 데이터에 대한 강건성과 추론 결과에 대한 신뢰성 향상 방법 (Adversarial-Mixup: Increasing Robustness to Out-of-Distribution Data and Reliability of Inference)

  • 권경필;유준혁
    • 대한임베디드공학회논문지
    • /
    • 제16권1호
    • /
    • pp.1-8
    • /
    • 2021
  • Detecting Out-of-Distribution (OOD) data is fundamentally required when Deep Neural Network (DNN) is applied to real-world AI such as autonomous driving. However, modern DNNs are quite vulnerable to the over-confidence problem even if the test data are far away from the trained data distribution. To solve the problem, this paper proposes a novel Adversarial-Mixup training method to let the DNN model be more robust by detecting OOD data effectively. Experimental results show that the proposed Adversarial-Mixup method improves the overall performance of OOD detection by 78% comparing with the State-of-the-Art methods. Furthermore, we show that the proposed method can alleviate the over-confidence problem by reducing the confidence score of OOD data than the previous methods, resulting in more reliable and robust DNNs.

포아송 분포를 가정한 Wafer 수준 Statistical Bin Limits 결정방법과 표본크기 효과에 대한 평가 (Methods and Sample Size Effect Evaluation for Wafer Level Statistical Bin Limits Determination with Poisson Distributions)

  • 박성민;김영식
    • 산업공학
    • /
    • 제17권1호
    • /
    • pp.1-12
    • /
    • 2004
  • In a modern semiconductor device manufacturing industry, statistical bin limits on wafer level test bin data are used for minimizing value added to defective product as well as protecting end customers from potential quality and reliability excursion. Most wafer level test bin data show skewed distributions. By Monte Carlo simulation, this paper evaluates methods and sample size effect regarding determination of statistical bin limits. In the simulation, it is assumed that wafer level test bin data follow the Poisson distribution. Hence, typical shapes of the data distribution can be specified in terms of the distribution's parameter. This study examines three different methods; 1) percentile based methodology; 2) data transformation; and 3) Poisson model fitting. The mean square error is adopted as a performance measure for each simulation scenario. Then, a case study is presented. Results show that the percentile and transformation based methods give more stable statistical bin limits associated with the real dataset. However, with highly skewed distributions, the transformation based method should be used with caution in determining statistical bin limits. When the data are well fitted to a certain probability distribution, the model fitting approach can be used in the determination. As for the sample size effect, the mean square error seems to reduce exponentially according to the sample size.