• Title/Summary/Keyword: Data Inference

Search Result 1,310, Processing Time 0.03 seconds

Causal inference from nonrandomized data: key concepts and recent trends (비실험 자료로부터의 인과 추론: 핵심 개념과 최근 동향)

  • Choi, Young-Geun;Yu, Donghyeon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.2
    • /
    • pp.173-185
    • /
    • 2019
  • Causal questions are prevalent in scientific research, for example, how effective a treatment was for preventing an infectious disease, how much a policy increased utility, or which advertisement would give the highest click rate for a given customer. Causal inference theory in statistics interprets those questions as inferring the effect of a given intervention (treatment or policy) in the data generating process. Causal inference has been used in medicine, public health, and economics; in addition, it has received recent attention as a tool for data-driven decision making processes. Many recent datasets are observational, rather than experimental, which makes the causal inference theory more complex. This review introduces key concepts and recent trends of statistical causal inference in observational studies. We first introduce the Neyman-Rubin's potential outcome framework to formularize from causal questions to average treatment effects as well as discuss popular methods to estimate treatment effects such as propensity score approaches and regression approaches. For recent trends, we briefly discuss (1) conditional (heterogeneous) treatment effects and machine learning-based approaches, (2) curse of dimensionality on the estimation of treatment effect and its remedies, and (3) Pearl's structural causal model to deal with more complex causal relationships and its connection to the Neyman-Rubin's potential outcome model.

Spatial Information Based Simulator for User Experience's Optimization

  • Bang, Green;Ko, Ilju
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.3
    • /
    • pp.97-104
    • /
    • 2016
  • In this paper, we propose spatial information based simulator for user experience optimization and minimize real space complexity. We focus on developing simulator how to design virtual space model and to implement virtual character using real space data. Especially, we use expanded events-driven inference model for SVM based on machine learning. Our simulator is capable of feature selection by k-fold cross validation method for optimization of data learning. This strategy efficiently throughput of executing inference of user behavior feature by virtual space model. Thus, we aim to develop the user experience optimization system for people to facilitate mapping as the first step toward to daily life data inference. Methodologically, we focus on user behavior and space modeling for implement virtual space.

A study on the fault and diagnosis system for diesel engine using neural network and knowledge based fuzzy inference (뉴럴 네트웍과 지식 기반 퍼지 추론을 이용한 디젤기관 고장진단 시스템에 관한 연구)

  • 천행춘;김영일;김경엽;안순영;오현경;유영호
    • Proceedings of the Korean Society of Marine Engineers Conference
    • /
    • 2002.05a
    • /
    • pp.233-238
    • /
    • 2002
  • This paper propose the construction of fault diagnosis engine for diesel generator engine and rule inference method to induce rule for fuzzy inference from the monitored data of diesel engine. The proposed fault diagnosis system is constructed the Malfunction Diagnosis Engine(MDE) and Hierarchy of Malfunction Hypotheses(HME), It is Proposed the rule reduction method of knowledge base for concerning data among the various analog data.

  • PDF

Secure Healthcare Management: Protecting Sensitive Information from Unauthorized Users

  • Ko, Hye-Kyeong
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.82-89
    • /
    • 2021
  • Recently, applications are increasing the importance of security for published documents. This paper deals with data-publishing where the publishers must state sensitive information that they need to protect. If a document containing such sensitive information is accidentally posted, users can use common-sense reasoning to infer unauthorized information. In recent studied of peer-to-peer databases, studies on the security of data of various unique groups are conducted. In this paper, we propose a security framework that fundamentally blocks user inference about sensitive information that may be leaked by XML constraints and prevents sensitive information from leaking from general user. The proposed framework protects sensitive information disclosed through encryption technology. Moreover, the proposed framework is query view security without any three types of XML constraints. As a result of the experiment, the proposed framework has mathematically proved a way to prevent leakage of user information through data inference more than the existing method.

A Probabilistic Tensor Factorization approach for Missing Data Inference in Mobile Crowd-Sensing

  • Akter, Shathee;Yoon, Seokhoon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.3
    • /
    • pp.63-72
    • /
    • 2021
  • Mobile crowd-sensing (MCS) is a promising sensing paradigm that leverages mobile users with smart devices to perform large-scale sensing tasks in order to provide services to specific applications in various domains. However, MCS sensing tasks may not always be successfully completed or timely completed for various reasons, such as accidentally leaving the tasks incomplete by the users, asynchronous transmission, or connection errors. This results in missing sensing data at specific locations and times, which can degrade the performance of the applications and lead to serious casualties. Therefore, in this paper, we propose a missing data inference approach, called missing data approximation with probabilistic tensor factorization (MDI-PTF), to approximate the missing values as closely as possible to the actual values while taking asynchronous data transmission time and different sensing locations of the mobile users into account. The proposed method first normalizes the data to limit the range of the possible values. Next, a probabilistic model of tensor factorization is formulated, and finally, the data are approximated using the gradient descent method. The performance of the proposed algorithm is verified by conducting simulations under various situations using different datasets.

Statistical Inference Concerning Local Dependence between Two Multinomial Populations

  • Oh, Myong-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.413-428
    • /
    • 2003
  • If a restriction is imposed only to a (proper) subset of parameters of interest, we call it a local restriction. Statistical inference under a local restriction in multinomial setting is studied. The maximum likelihood estimation under a local restriction and likelihood ratio tests for and against a local restriction are discussed. A real data is analyzed for illustrative purpose.

  • PDF

Recent advances in Bayesian inference of isolation-with-migration models

  • Chung, Yujin
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.37.1-37.8
    • /
    • 2019
  • Isolation-with-migration (IM) models have become popular for explaining population divergence in the presence of migrations. Bayesian methods are commonly used to estimate IM models, but they are limited to small data analysis or simple model inference. Recently three methods, IMa3, MIST, and AIM, resolved these limitations. Here, we describe the major problems addressed by these three software and compare differences among their inference methods, despite their use of the same standard likelihood function.

Suggestions for the Study of Acupoint Indications in the Era of Artificial Intelligence (인공지능시대의 경혈 주치 연구를 위한 제언)

  • Chae, Youn Byoung
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.35 no.5
    • /
    • pp.132-138
    • /
    • 2021
  • Artificial intelligence technology sheds light on new ways of innovating acupuncture research. As acupoint selection is specific to target diseases, each acupoint is generally believed to have a specific indication. However, the specificity of acupoint selection may be not always same with the specificity of acupoint indication. In this review, we propose that the specificity of acupoint indication can be inferred from clinical data using reverse inference. Using forward inference, the prescribed acupoints for each disease can be quantified for the specificity of acupoint selection. Using reverse inference, targeted diseases for each acupoint can be quantified for the specificity of acupoint indication. It is noteworthy that the selection of an acupoint for a particular disease does not imply the acupoint has specific indications for that disease. Electronic medical record includes various symptoms and chosen acupoint combinations. Data mining approach can be useful to reveal the complex relationships between diseases and acupoints from clinical data. Combining the clinical information and the bodily sensation map, the spatial patterns of acupoint indication can be further estimated. Interoperable medical data should be collected for medical knowledge discovery and clinical decision support system. In the era of artificial intelligence, machine learning can reveal the associations between diseases and prescribed acupoints from large scale clinical data warehouse.

Bayesian Method for Modeling Male Breast Cancer Survival Data

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Rana, Sagar;Ahmed, Nasar Uddin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.2
    • /
    • pp.663-669
    • /
    • 2014
  • Background: With recent progress in health science administration, a huge amount of data has been collected from thousands of subjects. Statistical and computational techniques are very necessary to understand such data and to make valid scientific conclusions. The purpose of this paper was to develop a statistical probability model and to predict future survival times for male breast cancer patients who were diagnosed in the USA during 1973-2009. Materials and Methods: A random sample of 500 male patients was selected from the Surveillance Epidemiology and End Results (SEER) database. The survival times for the male patients were used to derive the statistical probability model. To measure the goodness of fit tests, the model building criterions: Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC) were employed. A novel Bayesian method was used to derive the posterior density function for the parameters and the predictive inference for future survival times from the exponentiated Weibull model, assuming that the observed breast cancer survival data follow such type of model. The Markov chain Monte Carlo method was used to determine the inference for the parameters. Results: The summary results of certain demographic and socio-economic variables are reported. It was found that the exponentiated Weibull model fits the male survival data. Statistical inferences of the posterior parameters are presented. Mean predictive survival times, 95% predictive intervals, predictive skewness and kurtosis were obtained. Conclusions: The findings will hopefully be useful in treatment planning, healthcare resource allocation, and may motivate future research on breast cancer related survival issues.

A Design of Effective Inference Methods and Their Application Guidelines for Supporting Various Medical Analytics Schemes (다양한 의료 분석 방식을 지원하는 효과적 추론 기법 설계 및 적용 지침)

  • Kim, Moon Kwon;La, Hyun Jung;Kim, Soo Dong
    • Journal of KIISE
    • /
    • v.42 no.12
    • /
    • pp.1590-1599
    • /
    • 2015
  • As a variety of personal medical devices appear, it is possible to acquire a large number of diverse medical contexts from the devices. There have been efforts to analyze the medical contexts via software applications. In this paper, we propose a generic model of medical analytics schemes that are used by medical experts, identify inference methods for realizing each medical analytics scheme, and present guidelines for applying the inference methods to the medical analytics schemes. Additionally, we develop a PoC inference system and analyze real medical contexts to diagnose relevant diseases so that we can validate the feasibility and effectiveness of the proposed medical analytics schemes and guidelines of applying inference methods.