• Title/Summary/Keyword: Clustering sampling

Search Result 86, Processing Time 0.03 seconds

Constraints on cosmology and baryonic feedback by the combined analysis of weak lensing and galaxy clustering with the Deep Lens Survey

  • Yoon, Mijin;Jee, M. James;Tyson, Tony
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.43 no.2
    • /
    • pp.41.1-41.1
    • /
    • 2018
  • We constrain cosmological parameters by combining three different power spectra measured from galaxy clustering, galaxy-galaxy lensing, and cosmic shear using the Deep Lens Survey (DLS). Two lens bins (centered at z~0.27 and 0.54) and two source bins (centered at z~0.64, and 1.1) containing more than one million galaxies are selected to measure the power spectra. We re-calibrate the initial photo-z estimation of the lens bins by matching with SHELS and PRIMUS and confirm its fidelity by measuring a cross-correlation between the bins. We also check the reliability of the lensing signals through the null tests, lens-source flipping and cross shear measurement. Residual systematic errors from photometric redshift and shear calibration uncertainties are marginalized over in the nested sampling during our parameter constraint process. For the flat LCDM model, we determine S_8=sigma_8(Omega_m/0.3)^0.5=0.832+-0.028, which is in great agreement with the Planck data. We also verify that the two independent constraints from the cosmic shear and the galaxy clustering+galaxy-galaxy lensing measurements are consistent with each other. To address baryonic feedback effects on small scales, we marginalize over a baryonic feedback parameter, which we are able to constrain with the DLS data alone and more tightly when combined with Planck data. The constrained value hints at the possibility that the AGN feedback in the current OWLS simulations might not be strong enough.

  • PDF

Deconstructing Agile Survey to Identify Agile Skeptics

  • Entesar Alanazi;Mohammad Mahdi Hassan
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.3
    • /
    • pp.201-210
    • /
    • 2024
  • In empirical software engineering research, there is an increased use of questionnaires and surveys to collect information from practitioners. Typically, such data is then analyzed based on overall, descriptive statistics. Overall, they consider the whole survey population as a single group with some sampling techniques to extract varieties. In some cases, the population is also partitioned into sub-groups based on some background information. However, this does not reveal opinion diversity properly as similar opinions can exist in different segments of the population, whereas people within the same group might have different opinions. Even though existing approach can capture the general trends there is a risk that the opinions of different sub-groups are lost. The problem becomes more complex in case of longitudinal studies where minority opinions might fade or resolute over time. Survey based longitudinal data may have some potential patterns which can be extracted through a clustering process. It may reveal new information and attract attention to alternative perspectives. We suggest using a data mining approach to finding the diversity among the different groups in longitudinal studies (agile skeptics). In our study, we show that diversity can be revealed and tracked over time with the use of clustering approach, and the minorities have an opportunity to be heard.

CLUSTER ANALYSIS FOR REGION ELECTRIC LOAD FORECASTING SYSTEM

  • Park, Hong-Kyu;Kim, Young-Il;Park, Jin-Hyoung;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • 2007.10a
    • /
    • pp.591-593
    • /
    • 2007
  • This paper is to cluster the AMR (Automatic Meter Reading) data. The load survey system has been applied to record the power consumption of sampling the contract assortment in KEPRI AMR. The effect of the contract assortment change to the customer power consumption is determined by executing the clustering on the load survey results. We can supply the power to customer according to usage to the analysis cluster. The Korea a class of the electricity supply type is less than other country. Because of the Korea electricity markets exists one electricity provider. Need to further divide of electricity supply type for more efficient supply. We are found pattern that is different from supplied type to customer. Out experiment use the Clementine which data mining tools.

  • PDF

Construction of Observational Locations for Measuring Water Quality in the River Area (하천유역 수질 관측망 구성 연구)

  • Kwon, S.H.;Oh, H.S.
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.35 no.3
    • /
    • pp.187-191
    • /
    • 2012
  • The methods for constructing network of observational locations for measuring water quality in water reservoirs have been widely proposed, but they had some limitations to be applied to river areas, which lie in awkward clustering and finding representative observational locations among locations within each clustering. In this paper, a statistical approach to detect anomaly locations which were significantly different in important measurements for the water quality from the previous locations and construct observational network with them was proposed. Anomaly was detected with the sampling distribution of each primary principal component score, sum of primary PCs, or sum of residual PCs. The empirical study with the data of Nakdong Dam for guiding how to use our proposed approach and showing limitations of previous studied was described.

Suggestion Method of Classific System of Abnormal Genetic using EP (진화프로그래밍을 이용한 이상 유전자 분류 방법 제안)

  • Kim, Young-Gie;Bae, Sang-Hyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.05a
    • /
    • pp.776-779
    • /
    • 2008
  • It is expect that Microarray technique be direct classification and diagnosis of Genetic data have abnomal data value because DNA technique. It is necessary that many noses that is abnomal data in sampling genetic data. So in this paper reported sampling method in exiting study then suggests new data classific system and modeling method using EP by Matlab about three dataset.

  • PDF

A Study on Selective Sampling using SOM (SOM을 적용한 선택적 샘플링에 관한 연구)

  • Kim, Man-Sun;Yang, Hyung-Jeong;Kim, Jeong-Sik;Kim, Sun-Hee
    • Annual Conference of KIPS
    • /
    • 2007.11a
    • /
    • pp.38-41
    • /
    • 2007
  • 데이타 마이닝을 위하여 수집된 대용량의 데이타를 여과 없이 기계학습에 적용하는 것은 많은 시간과 비용이 요구될 뿐만 아니라 저장 공간면에서도 비효율적이다. 선별적 샘플링은 이러한 상황에서 매우 효율적으로 적용할 수 있도록 원본 데이타의 특성을 가능한 반영하여 새로운 훈련 데이타를 생성하는 방법이다. 본 연구에서는 신경망의 하나인 SOM을 적용한 선별적 샘플링을 수행하는데 있어서 여러 가지 선택 문제를 효과적으로 해결하기 위한 실험을 수행한다. 실험 결과로는 두 가지 결과를 얻었다. 1) 충분한 맵 사이즈를 선택해야 학습 데이타의 함축적인 특성을 잘 반영한다, 2) 선택적 샘플링을 위한 유닛선택 방법에서는 의미없는 유닛을 제거함으로서 분류 성능향상을 얻을 수 있다.

Removing non-informative features weakening of class separability (클래스 구분력이 없는 특징 소거법)

  • Lee, Jae-Seong;Kim, Dae-Won
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.59-62
    • /
    • 2007
  • 본 논문에서는 불균형 및 Under-sampling된 바이오 데이터에 대하여 클래스 구분력이 없는 특징의 소거를 통해 이후 이어질 FLDA 둥 다양한 방법론올 적용할 수 있는 방법을 제안하고자 한다. 제안하는 알고리즘은 평균과 분산을 통해 클래스의 형태를 결정하는 기존 방법론의 문제점을 회피할 수 있는 방법을 제공하며, 클래스 구분력에 중점을 두어 특정을 선별하였을 경우 선별된 특정들의 상관 계수가 높은 문제를 극복할 수 있도록 한다. 이에 따라 알고리즘이 선택한 특정집합은 서로의 특징에 대해 상관계수가 낮으며, 클래스의 구분력이 높은 특정을 갖게 된다.

  • PDF

A Differentially Private K-Means Clustering using Quadtree and Uniform Sampling (쿼드트리와 균등 샘플링를 이용한 효과적 차분 프라이버시 K-평균 클러스터링 알고리즘)

  • Hong, Daeyoung;Goo, Hanjun;Shim, Kyuseok
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2018.05a
    • /
    • pp.25-26
    • /
    • 2018
  • 최근 데이터를 공개할 때 프라이버시를 보호하기 위한 방법들이 연구되고 있다. 그 중 차분 프라이버시(differential privacy)는 최소성 공격 등에 대해서도 안전함이 증명된 익명화 기법이다. 본 논문에서는 기존 차분 프라이버시 -평균 클러스터링 알고리즘의 성능을 개선하고 실생활 데이터를 이용한 실험을 통해 이를 검증한다.

  • PDF

Recognition of damage pattern and evolution in CFRP cable with a novel bonding anchorage by acoustic emission

  • Wu, Jingyu;Lan, Chengming;Xian, Guijun;Li, Hui
    • Smart Structures and Systems
    • /
    • v.21 no.4
    • /
    • pp.421-433
    • /
    • 2018
  • Carbon fiber reinforced polymer (CFRP) cable has good mechanical properties and corrosion resistance. However, the anchorage of CFRP cable is a big issue due to the anisotropic property of CFRP material. In this article, a high-efficient bonding anchorage with novel configuration is developed for CFRP cables. The acoustic emission (AE) technique is employed to evaluate the performance of anchorage in the fatigue test and post-fatigue ultimate bearing capacity test. The obtained AE signals are analyzed by using a combination of unsupervised K-means clustering and supervised K-nearest neighbor classification (K-NN) for quantifying the performance of the anchorage and damage evolutions. An AE feature vector (including both frequency and energy characteristics of AE signal) for clustering analysis is proposed and the under-sampling approaches are employed to regress the influence of the imbalanced classes distribution in AE dataset for improving clustering quality. The results indicate that four classes exist in AE dataset, which correspond to the shear deformation of potting compound, matrix cracking, fiber-matrix debonding and fiber fracture in CFRP bars. The AE intensity released by the deformation of potting compound is very slight during the whole loading process and no obvious premature damage observed in CFRP bars aroused by anchorage effect at relative low stress level, indicating the anchorage configuration in this study is reliable.

Public Perception of the Concentration of Cardiac and Cerebrovascular Surgery to Metropolitan Hospitals

  • Lee, Young-Hoon;Lee, Kun Sei;Jeong, Hyo Seon;Ahn, Hye Mi;Oh, Gyung-Jae
    • Journal of Chest Surgery
    • /
    • v.49 no.sup1
    • /
    • pp.44-52
    • /
    • 2016
  • Background: This study investigates the perception of the general public regarding the concentration to metropolitan, hospitals of cardiac and cerebrovascular surgeries, and the perceived public need for government policies to resolve this issue. Methods: A total of 800 participants were recruited for our telephone interview survey. Quota sampling was performed, adjusting for age and sex, to select by various geographic regions. Sampling with random digit dialing was performed; we called the randomly generated telephone numbers and made three attempts for non-responders before moving on to a different telephone number. Results: Our sample population was 818 participants, 401 men (49.0%) and 417 women (51.0%). Our data showed that 85.5% of participants thought that cardiac surgery and neurosurgery patients are concentrated in large hospitals in Seoul. The principle reason for regional patients to want to receive surgery at major hospitals in Seoul was because of poor medical standards associated with regional hospitals (87.7%). We found that a vast majority of participants (97.5%) felt that government policies are needed to even out the clustering of cardiac surgery and neurosurgery patients, and that this clustering may be alleviated if policies that can specifically enhance the quality and the capacity of regional hospitals to carry out surgeries are adopted (98.3%). Conclusion: Government policy making must reflect public desiderata, and we suggest that these public health needs may be partially resolved through government-designated cardiac and neurosurgery specialist hospitals in regional areas.