• 제목/요약/키워드: Missing data

Search Result 1,303, Processing Time 0.028 seconds

Breast Cancer and Modifiable Lifestyle Factors in Argentinean Women: Addressing Missing Data in a Case-Control Study

  • Coquet, Julia Becaria;Tumas, Natalia;Osella, Alberto Ruben;Tanzi, Matteo;Franco, Isabella;Diaz, Maria Del Pilar
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.10
    • /
    • pp.4567-4575
    • /
    • 2016
  • A number of studies have evidenced the effect of modifiable lifestyle factors such as diet, breastfeeding and nutritional status on breast cancer risk. However, none have addressed the missing data problem in nutritional epidemiologic research in South America. Missing data is a frequent problem in breast cancer studies and epidemiological settings in general. Estimates of effect obtained from these studies may be biased, if no appropriate method for handling missing data is applied. We performed Multiple Imputation for missing values on covariates in a breast cancer case-control study of $C{\acute{o}}rdoba$ (Argentina) to optimize risk estimates. Data was obtained from a breast cancer case control study from 2008 to 2015 (318 cases, 526 controls). Complete case analysis and multiple imputation using chained equations were the methods applied to estimate the effects of a Traditional dietary pattern and other recognized factors associated with breast cancer. Physical activity and socioeconomic status were imputed. Logistic regression models were performed. When complete case analysis was performed only 31% of women were considered. Although a positive association of Traditional dietary pattern and breast cancer was observed from both approaches (complete case analysis OR=1.3, 95%CI=1.0-1.7; multiple imputation OR=1.4, 95%CI=1.2-1.7), effects of other covariates, like BMI and breastfeeding, were only identified when multiple imputation was considered. A Traditional dietary pattern, BMI and breastfeeding are associated with the occurrence of breast cancer in this Argentinean population when multiple imputation is appropriately performed. Multiple Imputation is suggested in Latin America's epidemiologic studies to optimize effect estimates in the future.

Association between oral health status and oral health impact profile(OHIP-14) among the community elderlies (노인의 객관적 구강건강상태와 주관적 구강건강수준간의 관련성)

  • Ahn, Kwon-Suk;Shin, Mi-A
    • Journal of Korean society of Dental Hygiene
    • /
    • v.11 no.6
    • /
    • pp.923-938
    • /
    • 2011
  • Objectives : This study was attempted in order to grasp oral health level according to socio-demographic characteristics in elders in some communities, and to evaluate oral health status and its association. Methods : The subjects in this study were performed with 235 people, who were over 65 years and resided in Daejeon Province, from June 20 to July 10, 2011. An individual interview was held, and they got a dental checkup. As for data analysis, chi-square test, t-test, one-way ANOVA, pearson correlation were utilized. Methods : The subjects in this study were performed with 235 people, who were over 65 years and resided in Daejeon Province, from June 20 to July 10, 2011. An individual interview was held, and they got a dental checkup. As for data analysis, chi-square test, t-test, one-way ANOVA, pearson correlation were utilized. Results : The older age in the whole research subjects and the lower educational level led to the less remaining teeth and the larger missing teeth index. The decayed missing filled teeth index and the decayed missing filled teeth rate were higher in more women and older age and in the lower educational level. Tooth mortality rate was higher in the older age, the lower educational level, and the group of living together with spouse. The maxillary-mandibular fixed-bridge status in the mouth was indicated to be the highest in the full-denture mounting ratio as for elders in over 80 years old. Oral Health Impact Profile(OHIP-14) average score was $56.05{\pm}11.64$ in the whole research subjects The decayed missing filled teeth index and the decayed missing filled teeth rate showed significantly positive correlation with the decayed missing filled teeth rate, tooth mortality rate and showed significantly negative correlation with OHIP-14. Tooth mortality rate showed significantly negative correlation with OHIP-14 Oral Health Impact Profile(OHIP-14) showed significantly positive correlation with its factors. Conclusions : Accordingly, the policy effort is considered to be necessary that implements in elders in order to spend active senescence, and that elders' health and oral-health behavior can be implemented continuously and preventively through classification according to elders' physical function.

The effect of oral health behavior of the visually impaired on DMFT index (시각장애인의 구강보건행태가 DMFT지수에 미치는 영향)

  • Lee, Jong-Hwa;Lee, Seung-Hee;Yun, Hyun-Kyung
    • Journal of Korean society of Dental Hygiene
    • /
    • v.17 no.3
    • /
    • pp.331-342
    • /
    • 2017
  • Objectives: This study aimed at helping oral health prevention of the blind and related management plan, which is defined as the influence factors between missing and filled permanent teeth index and general feature and oral health behavior of the blind in Korea (estimates 229,678 persons) using data of the 6th Korea National Health and Nutrition Examination Survey from 2014 Korea Centers For Disease Control and Prevention. Methods: The blind over the age of 30 were selected as study subjects who have conducted health survey and dental inspections in KNHANES VI-2. Estimates of the subjects were 229,67 persons. For analyzing data, general linear models: GLM and covariance analysis were conducted to identify the relation between general feature and oral health behavior and missing and filled permanent teeth index. SPSS 21 statistical program was used, which is possible to conduct complex sampling design, and the significance level was 0.05. Results: The missing and filled permanent teeth index was 8.58 points. Regarding the results of the analysis, R-squared of the missing and filled permanent teeth index depending on general features of the blind was 0.839 points, which shows gender, age, residence, education level, individual income, disability rating, kinds of health insurance, marital status and recipient of basic living had an effect on the missing and filled permanent teeth index. R2 of the missing and filled permanent teeth index depending on oral health form of the blind was 0.728 points, which shows oral examination, dental treatment, smoking and toothbrushing after lunch had an effect on the missing and filled permanent teeth index. Conclusions: With the result of this study, we found the oral health actual condition of the blind in Korea. Therefore, it is considered that the government needs to introduce the personalized oral health education program to maintain oral health of the blind and to develop a program that uses braille and voice device which enables to access and utilize to improve oral health behavior that the government could use it as a reference to establish the policy plan.

Spatial-Temporal Modelling of Road Traffic Data in Seoul City

  • Lee, Sang-Yeol;Ahn, Soo-Han;Park, Chang-Yi;Jeon, Jong-Woo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.2
    • /
    • pp.261-270
    • /
    • 2002
  • Recently, the demand of the Intelligent Transportation System(ITS) has been increased to a large extent, and a real-time traffic information service based on the internet system became very important. When ITS companies carry out real-time traffic services, they find some traffic data missing, and use the conventional method of reconstructing missing values by calculating average time trend. However, the method is found unsatisfactory, so that we develop a new method based the spatial and spatial-temporal models. A cross-validation technique shows that the spatial-temporal model outperforms the others.

  • PDF

Reverse Engineering of Compound Surfaces on the Machine Tool using a Vision Probe (비전 프로브를 이용한 기상에서의 복합곡면의 역공학)

  • 김경진;윤길상;초명우;권혁동;서태일
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2002.05a
    • /
    • pp.287-292
    • /
    • 2002
  • This paper presents a reverse engineering method for compound surfaces using vision system. A CNC machining center is used as a measuring station, which is equipped with slit beam generator and vision probe. Since obtained data using slit beam or laser scanner may have much data loss along the edge of compound surfaces, an algorithm is presented in this study to recover missing geometric data at such region. First, b-spline interpolation is applied to extract edge information of the surface, and as a next step, b-spline approximation is applied to recover the missing geometric data. Finally, b-spline skinning method is applied to regenerate the surface information. Appropriate simulation and experimental works are preformed to very the effectiveness of the proposed methods.

  • PDF

Study on promoting the educational role of security sector to prevent child missing (아동실종 예방을 위한 시큐리티 분야의 교육적 역할증진)

  • Park, SangKyun;Kim, JinHwan
    • Convergence Security Journal
    • /
    • v.13 no.5
    • /
    • pp.215-222
    • /
    • 2013
  • This study aims to provide future direction recognizing the educational importance and to present the way ahead that practices correctly for improving an educational role on security sector to prevent child missing. Therefore, it was conducted with questionnaire that is "Research on participation grade of education to prevent child missing and the actual condition" made by researcher of this study which is on 363of parents of pre-chirdren of 6,7 and 8years old in a kindergarten and an elementary school where is located in metropolitan area. It operated to take processing enterprise statistics using SPSS/WIN 12.0 for getting data, and analyzed frequency and t-verification.It investigated correct selection of an education specialist on preventive education and participation grade of education to prevent missing on home, how often, how it is conducted, then what requirement for educationto prevent child missing and participation grade is, whether difference is in accordance with gender of parentsand gender of child.

A Implementation of Acer Pictum Sap Integrated Management System based on Energy Harvesting and Monitoring System (에너지 하베스팅 및 모니터링 기반의 고로쇠 수액 통합 관리 시스템 구현)

  • Jung, SeHoon;Jo, KyeongHo;Kim, JunYeoung;Park, Jun;Kim, JongChan;Choi, SooIm;Sim, ChunBo
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.11
    • /
    • pp.1324-1337
    • /
    • 2019
  • This study set out to investigate an energy harvesting device to ensure stable energy supply to batteries and data collection devices and a monitoring system for acer pictum sap to check collected data. Acer pictum sap farmers have written down weather information and yield of acer pictum sap manually for data storage. Since the job is done manually, there are many missing values in their data. In addition, it is not easy to manage batteries due to the characteristics of the areas where acer pictum sap is collected. The present study thus decided to build an energy harvesting device based on new renewable energy to ensure stable energy supply by taking into consideration power load, daily power consumption, and number of days with no sunshine for various devices. For a monitoring system, the investigator proposed a JSP-based web page to monitor temperature, humidity, volume of collected water, and battery state in real time. The proposed energy harvesting device was applied to reduce missing values in data. It promoted stable energy supply to the batteries and data collection devices, reducing the percentage of missing values in data from 30.55% to 0%.

Survival Analysis of Gastric Cancer Patients with Incomplete Data

  • Moghimbeigi, Abbas;Tapak, Lily;Roshanaei, Ghodaratolla;Mahjub, Hossein
    • Journal of Gastric Cancer
    • /
    • v.14 no.4
    • /
    • pp.259-265
    • /
    • 2014
  • Purpose: Survival analysis of gastric cancer patients requires knowledge about factors that affect survival time. This paper attempted to analyze the survival of patients with incomplete registered data by using imputation methods. Materials and Methods: Three missing data imputation methods, including regression, expectation maximization algorithm, and multiple imputation (MI) using Monte Carlo Markov Chain methods, were applied to the data of cancer patients referred to the cancer institute at Imam Khomeini Hospital in Tehran in 2003 to 2008. The data included demographic variables, survival times, and censored variable of 471 patients with gastric cancer. After using imputation methods to account for missing covariate data, the data were analyzed using a Cox regression model and the results were compared. Results: The mean patient survival time after diagnosis was $49.1{\pm}4.4$ months. In the complete case analysis, which used information from 100 of the 471 patients, very wide and uninformative confidence intervals were obtained for the chemotherapy and surgery hazard ratios (HRs). However, after imputation, the maximum confidence interval widths for the chemotherapy and surgery HRs were 8.470 and 0.806, respectively. The minimum width corresponded with MI. Furthermore, the minimum Bayesian and Akaike information criteria values correlated with MI (-821.236 and -827.866, respectively). Conclusions: Missing value imputation increased the estimate precision and accuracy. In addition, MI yielded better results when compared with the expectation maximization algorithm and regression simple imputation methods.

An Estimation of Link Travel Time by Using BMS Data (BMS 데이터를 활용한 링크단위 여행시간 산출방안에 관한 연구)

  • Jeon, Ok-Hee;Ahn, Gye-Hyeong;Hyun, Cheol-Seung;Hong, Kyung-Sik;Kim, Hyun-Ju;Lee, Choul-Ki
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.3
    • /
    • pp.78-88
    • /
    • 2014
  • Now, UTIS collects and provides traffic information by building RSE 1,150(unit) and OBE about 51,000(vehicle). it's inevitable to enlarge traffic information sources which use to improve quality of UTIS traffic information for Stabilizing UTIS's service. but there are missing data sections. And, In this study as a way to overcome these problems, based on BIS(Bus information system) installed and operating in the capital area to develop normal vehicle's link transit time estimation model which is used realtime collecting BMS data, we'll utilize the model to provide missing data section's information. For these problem, we selected partial section of suwon-city, anyang-city followed by drive only way or not and conducted model estimating and verification each of BMS data and UTIS traffic information. Consequently, Case2,4,6,8 presented highly credibility between UTIS communication data and estimated value but In the Case 3,5 we determined to replace communication data of UTIS' missing data section too hard for large error. So we need to apply high credibility model formula adjusting road managing condition and the situation of object section.

Development of Machine Learning Based Precipitation Imputation Method (머신러닝 기반의 강우추정 방법 개발)

  • Heechan Han;Changju Kim;Donghyun Kim
    • Journal of Wetlands Research
    • /
    • v.25 no.3
    • /
    • pp.167-175
    • /
    • 2023
  • Precipitation data is one of the essential input datasets used in various fields such as wetland management, hydrological simulation, and water resource management. In order to efficiently manage water resources using precipitation data, it is essential to secure as much data as possible by minimizing the missing rate of data. In addition, more efficient hydrological simulation is possible if precipitation data for ungauged areas are secured. However, missing precipitation data have been estimated mainly by statistical equations. The purpose of this study is to propose a new method to restore missing precipitation data using machine learning algorithms that can predict new data based on correlations between data. Moreover, compared to existing statistical methods, the applicability of machine learning techniques for restoring missing precipitation data is evaluated. Representative machine learning algorithms, Artificial Neural Network (ANN) and Random Forest (RF), were applied. For the performance of classifying the occurrence of precipitation, the RF algorithm has higher accuracy in classifying the occurrence of precipitation than the ANN algorithm. The F1-score and Accuracy values, which are evaluation indicators of the classification model, were calculated as 0.80 and 0.77, while the ANN was calculated as 0.76 and 0.71. In addition, the performance of estimating precipitation also showed higher accuracy in RF than in ANN algorithm. The RMSE of the RF and ANN algorithms was 2.8 mm/day and 2.9 mm/day, and the values were calculated as 0.68 and 0.73.