• Title/Summary/Keyword: Statistics Attribute

Search Result 95, Processing Time 0.024 seconds

Counterfactual image generation by disentangling data attributes with deep generative models

  • Jieon Lim;Weonyoung Joo
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.6
    • /
    • pp.589-603
    • /
    • 2023
  • Deep generative models target to infer the underlying true data distribution, and it leads to a huge success in generating fake-but-realistic data. Regarding such a perspective, the data attributes can be a crucial factor in the data generation process since non-existent counterfactual samples can be generated by altering certain factors. For example, we can generate new portrait images by flipping the gender attribute or altering the hair color attributes. This paper proposes counterfactual disentangled variational autoencoder generative adversarial networks (CDVAE-GAN), specialized for data attribute level counterfactual data generation. The structure of the proposed CDVAE-GAN consists of variational autoencoders and generative adversarial networks. Specifically, we adopt a Gaussian variational autoencoder to extract low-dimensional disentangled data features and auxiliary Bernoulli latent variables to model the data attributes separately. Also, we utilize a generative adversarial network to generate data with high fidelity. By enjoying the benefits of the variational autoencoder with the additional Bernoulli latent variables and the generative adversarial network, the proposed CDVAE-GAN can control the data attributes, and it enables producing counterfactual data. Our experimental result on the CelebA dataset qualitatively shows that the generated samples from CDVAE-GAN are realistic. Also, the quantitative results support that the proposed model can produce data that can deceive other machine learning classifiers with the altered data attributes.

Factors Affecting Smartphone Purchase Intention of Consumers in Nepal

  • RAI, Bharat
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.2
    • /
    • pp.465-473
    • /
    • 2021
  • The main aim of this research paper is to identify the factors that influence smartphone purchase intention in the Nepalese market. The study identifies how the brand personality, attribute factors, and the price factor influence the purchase intention of a smartphone. The paper puts the emphasis on how the consumer preference functions in the selection of the smartphone and which factor plays the more significant role in smartphone purchase intention. This research paper has used primary data and a 7-point Likert scale survey questionnaire. The primary data has been collected through a structured survey questionnaire by using convenient sampling technique from 294 smartphone users in the Kathmandu Valley. Descriptive statistics, Correlation Analysis and Structural Equation Modeling (SEM) have been carried out to analyze the primary data using the SPSS AMOS 24. Brand personality, attribute factor, and product price were taken as independent variables to identify the impact on purchase intention. The result of the regression path analysis showed that brand personality has no significant effect on purchase intention in the purchasing of smartphone. It is also found that the product attributes and product price have a significant influence on consumer purchase intention of a smartphone in Nepal.

Hierarchical grouping recommendation system based on the attributes of contents: a case study of 'The Movie Dataset' (콘텐츠 속성에 따른 계층적 그룹화 추천시스템: 'The Movie Dataset' 분석사례연구)

  • Kim, Yoon Kyoung;Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.6
    • /
    • pp.833-842
    • /
    • 2020
  • Global platforms such as Netflix, Amazon, and YouTube have developed a precise recommendation system based on various information from large set of customers and many of the items recommended here are leading to actual purchases. In this paper, a cluster analysis was conducted according to the attribute of the content, expecting that there would be a difference in user preferences according to the attribute of the recommended content. Gower distance was used for use regardless of the type of variables. In this paper, using the data of movie rating site 'The Movie Dataset', the users were grouped hierarchically and recommended movies based on genre, director and actor variables. To evaluate the recommended systems proposed, user group was divided into train set and test set to examine the precision. The results showed that proposed algorithms have far higher precision than UBCF.

AGGREGATION OPERATORS OF CUBIC PICTURE FUZZY QUANTITIES AND THEIR APPLICATION IN DECISION SUPPORT SYSTEMS

  • Ashraf, Shahzaib;Abdullah, Saleem;Mahmood, Tahir
    • Korean Journal of Mathematics
    • /
    • v.28 no.2
    • /
    • pp.343-359
    • /
    • 2020
  • The paper aim is to resolve the issue of ranking to the fuzzy numbers in decision analysis, artificial intelligence and optimization. In the literature lot of ideologies have been established for ranking to the fuzzy numbers, that ideologies have some restrictions and limitations. In this paper, we proposed a method based on cubic picture fuzzy information's, for ranking to defeat the existing restrictions. Further introduced some cubic picture fuzzy algebraic and cubic picture fuzzy algebraic* aggregated operators for aggregated the information. Finally, a multi-attribute decision making problem is assumed as a practical application to establish the appropriateness and suitability of the proposed ranking approach.

A multiplicative unrelated quantitative randomized response model (승법 무관양적속성 확률화응답모형)

  • Lee, Gi-Sung
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.5
    • /
    • pp.897-906
    • /
    • 2016
  • We augment an unrelated quantitative attribute to Bar-Lev et al.'s model (2004) which is composed of sensitive quantitative variable and scrambled one to present a multiplicative unrelated quantitative randomized response model(MUQ RRM). We also establish theoretical grounds to estimate the sensitive quantitative attribute according to circumstances irrespective of known or unknown unrelated quantitative attribute. Finally, we explore the relationship among the suggested model, Eichhorn-Hayre model, Bar-Lev et al.'s model and Gjestvang-Singh's model, and compare the efficiency of our model with Bar-Lev et al.'s model.

Geovisualization of Migration Statistics Using Flow Mapping Based on Web GIS (Web GIS 기반 유선도 작성을 통한 인구이동통계의 지리적 시각화)

  • Kim, Kam-Young;Lee, Sang-Il
    • Journal of the Korean Geographical Society
    • /
    • v.47 no.2
    • /
    • pp.268-281
    • /
    • 2012
  • In spite of the usefulness of migration statistics in spatially understanding social processes and identifying social effects of spatial processes, services and analyses of the statistics have been restricted due to the complexity of their data structure. In addition, flow mapping functionality which is a useful method to explore and visualize the migration statistics has yet to be fully represented in modern GIS applications. Given this, the purpose of this research is to demonstrate the possibility of flow mapping and the exploratory spatial analysis of the migration statistics in a Web GIS environment. For this, the characteristics of the statistics were examined from database, GIS, and cartographic perspectives. Then, O-D structure of the migration statistics was converted to spatial data appropriate to f low mapping based on the characteristics. The interface of Web GIS is specialized the migration statistics and provides exploratory visualization by allowing dynamic interactions such as spatial focusing and attribute filtering.

  • PDF

Cutpoint Selection via Penalization in Credit Scoring (신용평점화에서 벌점화를 이용한 절단값 선택)

  • Jin, Seul-Ki;Kim, Kwang-Rae;Park, Chang-Yi
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.2
    • /
    • pp.261-267
    • /
    • 2012
  • In constructing a credit scorecard, each characteristic variable is divided into a few attributes; subsequently, weights are assigned to those attributes in a process called coarse classification. While partitioning a characteristic variable into attributes, one should determine appropriate cutpoints for the partition. In this paper, we propose a cutpoint selection method via penalization. In addition, we compare the performances of the proposed method with classification spline machine (Koo et al., 2009) on both simulated and real credit data.

A Conditional Unrelated Question Model with Quantitative Attribute

  • Lee, Gi Sung;Hong, Ki Hak
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.3
    • /
    • pp.753-765
    • /
    • 2001
  • We suggest a quantitative conditional unrelated question model that can be used in obtaining more sensitive information. For whom say "yes" about the less 7han sensitive question .B we ask only about the more sensitive variable X. We extend our model to two sample case when there is no information about the true mean of the unrelated variable Y. Finally we compare the efficiency of our model with that of Greenberg et al.′s.

  • PDF

New Attributes and Variables Control Charts under Repetitive Sampling

  • Aslam, Muhammad;Azam, Muhammad;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • v.13 no.1
    • /
    • pp.101-106
    • /
    • 2014
  • New control charts under repetitive sampling are proposed, which can be used for variables and attributes quality characteristics. The proposed control charts have inner and outer control limits so that repetitive sampling may be needed if the plotted statistic falls between the two limits. Particularly, the new np and variable X-bar control charts under repetitive sampling are considered in detail. The in-control and out-of-control average run lengths are analyzed according to various process shifts. The performance of the proposed control charts is compared with the existing np and the X-bar control charts in terms of the average run lengths.

Cluster Analysis with Balancing Weight on Mixed-type Data

  • Chae, Seong-San;Kim, Jong-Min;Yang, Wan-Youn
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.3
    • /
    • pp.719-732
    • /
    • 2006
  • A set of clustering algorithms with proper weight on the formulation of distance which extend to mixed numeric and multiple binary values is presented. A simple matching and Jaccard coefficients are used to measure similarity between objects for multiple binary attributes. Similarities are converted to dissimilarities between i th and j th objects. The performance of clustering algorithms with balancing weight on different similarity measures is demonstrated. Our experiments show that clustering algorithms with application of proper weight give competitive recovery level when a set of data with mixed numeric and multiple binary attributes is clustered.