• Title/Summary/Keyword: Root-mean-square-error method

Search Result 435, Processing Time 0.03 seconds

Comparison of total energy intakes estimated by 24-hour diet recall with total energy expenditure measured by the doubly labeled water method in adults

  • Kim, Eun-Kyung;Fenyi, Justice Otoo;Kim, Jae-Hee;Kim, Myung-Hee;Yean, Seo-Eun;Park, Kye-Wol;Oh, Kyungwon;Yoon, Sungha;Ishikawa-Takata, Kazuko;Park, Jonghoon;Kim, Jung-Hyun;Yoon, Jin-Sook
    • Nutrition Research and Practice
    • /
    • v.16 no.5
    • /
    • pp.646-657
    • /
    • 2022
  • BACKGROUND/OBJECTIVES: The doubly labeled water (DLW) method is the gold standard for estimating total energy expenditure (TEE) and is also useful for verifying the validities of dietary evaluation tools. In this study, we compared the accuracy of total energy intakes (TEI) estimated by the 24-h diet recall method with TEE obtained using the doubly labeled water method. SUBJECTS/METHODS: This study involved 71 subjects aged 20-49 yrs. Over a 14-day period, three 24-h diet recalls per subject (2 weekdays and 1 weekend day) were used to estimate energy intakes, while TEE was measured using the DLW method. The paired t-test was used to determine the significance of differences between TEI and TEE results, and the accuracy of the 24-h recall method was determined by accuracy predictions percentage, root mean square error, and bias. RESULTS: Average study subject age was 33.4 ± 8.6 yrs. The association between TEI and TEE was positive and significant (r = 0.463, P < 0.001), and the difference between TEI (2,084.3 ± 684.2 kcal/day) and TEE (2,401.7 ± 480.3 kcal/day) was also significant (P < 0.001). In all study subjects, mean TEI was 12.0% (307.5 ± 629.3 kcal/day) less than mean TEE, and 12.2% (349.4 ± 632.5 kcal/day) less in men and 11.8% (266.7 ± 632.5 kcal/day) less in women. Rates of TEI underprediction for all study subjects, men, and women, were 60.5%, 51.4%, and 66.7%, respectively. CONCLUSIONS: This study shows that 24-h diet recall underreports energy intakes. More research is needed to corroborate our findings and evaluate the accuracy of 24-h recall with respect to additional demographics.

Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks

  • Kim, Joo-Chang;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.4
    • /
    • pp.2060-2077
    • /
    • 2019
  • Recently, mobile healthcare services have attracted significant attention because of the emerging development and supply of diverse wearable devices. Smartwatches and health bands are the most common type of mobile-based wearable devices and their market size is increasing considerably. However, simple value comparisons based on accumulated data have revealed certain problems, such as the standardized nature of health management and the lack of personalized health management service models. The convergence of information technology (IT) and biotechnology (BT) has shifted the medical paradigm from continuous health management and disease prevention to the development of a system that can be used to provide ground-based medical services regardless of the user's location. Moreover, the IT-BT convergence has necessitated the development of lifestyle improvement models and services that utilize big data analysis and machine learning to provide mobile healthcare-based personal health management and disease prevention information. Users' health data, which are specific as they change over time, are collected by different means according to the users' lifestyle and surrounding circumstances. In this paper, we propose a prediction model of user physical activity that uses data characteristics-based long short-term memory (DC-LSTM) recurrent neural networks (RNNs). To provide personalized services, the characteristics and surrounding circumstances of data collectable from mobile host devices were considered in the selection of variables for the model. The data characteristics considered were ease of collection, which represents whether or not variables are collectable, and frequency of occurrence, which represents whether or not changes made to input values constitute significant variables in terms of activity. The variables selected for providing personalized services were activity, weather, temperature, mean daily temperature, humidity, UV, fine dust, asthma and lung disease probability index, skin disease probability index, cadence, travel distance, mean heart rate, and sleep hours. The selected variables were classified according to the data characteristics. To predict activity, an LSTM RNN was built that uses the classified variables as input data and learns the dynamic characteristics of time series data. LSTM RNNs resolve the vanishing gradient problem that occurs in existing RNNs. They are classified into three different types according to data characteristics and constructed through connections among the LSTMs. The constructed neural network learns training data and predicts user activity. To evaluate the proposed model, the root mean square error (RMSE) was used in the performance evaluation of the user physical activity prediction method for which an autoregressive integrated moving average (ARIMA) model, a convolutional neural network (CNN), and an RNN were used. The results show that the proposed DC-LSTM RNN method yields an excellent mean RMSE value of 0.616. The proposed method is used for predicting significant activity considering the surrounding circumstances and user status utilizing the existing standardized activity prediction services. It can also be used to predict user physical activity and provide personalized healthcare based on the data collectable from mobile host devices.

Outlier Detection and Treatment for the Conversion of Chemical Oxygen Demand to Total Organic Carbon (화학적산소요구량의 총유기탄소 변환을 위한 이상자료의 탐지와 처리)

  • Cho, Beom Jun;Cho, Hong Yeon;Kim, Sung
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.26 no.4
    • /
    • pp.207-216
    • /
    • 2014
  • Total organic carbon (TOC) is an important indicator used as an direct biological index in the research field of the marine carbon cycle. It is possible to produce the sufficient TOC estimation data by using the Chemical Oxygen Demand(COD) data because the available TOC data is relatively poor than the COD data. The outlier detection and treatment (removal) should be carried out reasonably and objectively because the equation for a COD-TOC conversion is directly affected the TOC estimation. In this study, it aims to suggest the optimal regression model using the available salinity, COD, and TOC data observed in the Korean coastal zone. The optimal regression model is selected by the comparison and analysis on the changes of data numbers before and after removal, variation coefficients and root mean square (RMS) error of the diverse detection methods of the outlier and influential observations. According to research result, it is shown that a diagnostic case combining SIQR (Semi - Inter-Quartile Range) boxplot and Cook's distance method is most suitable for the outlier detection. The optimal regression function is estimated as the TOC(mg/L) = $0.44{\cdot}COD(mg/L)+1.53$, then determination coefficient is showed a value of 0.47 and RMS error is 0.85 mg/L. The RMS error and the variation coefficients of the leverage values are greatly reduced to the 31% and 80% of the value before the outlier removal condition. The method suggested in this study can provide more appropriate regression curve because the excessive impacts of the outlier frequently included in the COD and TOC monitoring data is removed.

Spatial Estimation of the Site Index for Pinus densiplora using Kriging (크리깅을 이용한 소나무림 지위지수 공간분포 추정)

  • Kim, Kyoung-Min;Park, Key-Ho
    • Journal of Korean Society of Forest Science
    • /
    • v.102 no.4
    • /
    • pp.467-476
    • /
    • 2013
  • Site index information given from forest site map only exist in the sampled locations. In this study, site index for unsampled locations were estimated using kriging interpolation method which can interpolate values between point samples to generate a continuous surface. Site index of Pinus densiplora in Danyang area were calculated using Chapman-Richards model by plot unit. Then site index for unsampled locations were interpolated by theoretical variogram models and ordinary kriging. Also in order to assess parameter selection, cross-validation was performed by calculating mean error (ME), average standard error (ASE) and root mean square error (RMSE). In result, gaussian model was excluded because of the biggest relative nugget (37.40%). Then spherical model (16.80%) and exponential model (8.77%) were selected. Site index estimates of Pinus densiplora throughout the entire area in Danyang showed 4.39~19.53 based on exponential model, and 4.54~19.23 based on spherical model. By cross-validation, RMSE had almost no difference. But ME and ASE from spherical model were slightly lower than exponential model. Therefore site index prediction map from spherical model were finally selected. Average site index from site prediction map was 10.78. It can be expected that regional variance can be considered by site index prediction map in order to estimate forest biomass which has big spatial variance and eventually it is helpful to improve an accuracy of forest carbon estimation.

The Optimal Partition of Initial Input Space for Fuzzy Neural System : Measure of Fuzziness (퍼지뉴럴 시스템을 위한 초기 입력공간분할의 최적화 : Measure of Fuzziness)

  • Baek, Deok-Soo;Park, In-Kue
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.39 no.3
    • /
    • pp.97-104
    • /
    • 2002
  • In this paper we describe the method which optimizes the partition of the input space by means of measure of fuzziness for fuzzy neural network. It covers its generation of fuzzy rules for input sub space. It verifies the performance of the system depended on the various time interval of the input. This method divides the input space into several fuzzy regions and assigns a degree of each of the generated rules for the partitioned subspaces from the given data using the Shannon function and fuzzy entropy function generating the optimal knowledge base without the irrelevant rules. In this scheme the basic idea of the fuzzy neural network is to realize the fuzzy rule base and the process of reasoning by neural network and to make the corresponding parameters of the fuzzy control rules be adapted by the steepest descent algorithm. According to the input interval the proposed inference procedure proves that the fast convergence of root mean square error (RMSE) owes to the optimal partition of the input space

Prediction of movie audience numbers using hybrid model combining GLS and Bass models (GLS와 Bass 모형을 결합한 하이브리드 모형을 이용한 영화 관객 수 예측)

  • Kim, Bokyung;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.4
    • /
    • pp.447-461
    • /
    • 2018
  • Domestic film industry sales are increasing every year. Theaters are the primary sales channels for movies and the number of audiences using the theater affects additional selling rights. Therefore, the number of audiences using the theater is an important factor directly linked to movie industry sales. In this paper we consider a hybrid model that combines a multiple linear regression model and the Bass model to predict the audience numbers for a specific day. By combining the two models, the predictive value of the regression analysis was corrected to that of the Bass model. In the analysis, three films with different release dates were used. All subset regression method is used to generate all possible combinations and 5-fold cross validation to estimate the model 5 times. In this case, the predicted value is obtained from the model with the smallest root mean square error and then combined with the predicted value of the Bass model to obtain the final predicted value. With the existence of past data, it was confirmed that the weight of the Bass model increases and the compensation is added to the predicted value.

Prediction of Nutrient Composition and In-Vitro Dry Matter Digestibility of Corn Kernel Using Near Infrared Reflectance Spectroscopy

  • Choi, Sung Won;Lee, Chang Sug;Park, Chang Hee;Kim, Dong Hee;Park, Sung Kwon;Kim, Beob Gyun;Moon, Sang Ho
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.34 no.4
    • /
    • pp.277-282
    • /
    • 2014
  • Nutritive value analysis of feed is very important for the growth of livestock, and ensures the efficiency of feeds as well as economic status. However, general laboratory analyses require considerable time and high cost. Near-infrared reflectance spectroscopy (NIRS) is a spectroscopic technique used to analyze the nutritive values of seeds. It is very effective and less costly than the conventional method. The sample used in this study was a corn kernel and the partial least square regression method was used for evaluating nutrient composition, digestibility, and energy value based on the calibration equation. The evaluation methods employed were the coefficient of determination ($R^2$) and the root mean squared error of prediction (RMSEP). The results showed the moisture content ($R^2_{val}=0.97$, RMSEP=0.109), crude protein content ($R^2_{val}=0.94$, RMSEP=0.212), neutral detergent fiber content ($R^2_{val}=0.96$, RMSEP=0.763), acid detergent fiber content ($R^2_{val}=0.96$, RMSEP=0.142), gross energy ($R^2_{val}=0.82$, RMSEP=23.249), in vitro dry matter digestibility ($R^2_{val}=0.68$, RMSEP=1.69), and metabolizable energy (approximately $R^2_{val}$ >0.80). This study confirmed that the nutritive components of corn kernels can be predicted using near-infrared reflectance spectroscopy.

Estimation of Design Rainfall by the Regional Frequency Analysis using Higher Probability Weighted Moments and GIS Techniques (III) - On the Method of LH-moments and GIS Techniques - (고차확률가중모멘트법에 의한 지역화빈도분석과 GIS기법에 의한 설계강우량 추정 (III) - LH-모멘트법과 GIS 기법을 중심으로 -)

  • 이순혁;박종화;류경식;지호근;신용희
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.44 no.5
    • /
    • pp.41-53
    • /
    • 2002
  • This study was conducted to derive the regional design rainfall by the regional frequency analysis based on the regionalization of the precipitation suggested by the first report of this project. According to the regions and consecutive durations, optimal design rainfalls were derived by the regional frequency analysis for L-moment in the second report of this project. Using the LH-moment ratios and Kolmogorov-Smirnov test, the optimal regional probability distribution was identified to be the Generalized extreme value (GEV) distribution among applied distributions. regional and at-site parameters of the GEV distribution were estimated by the linear combination of the higher probability weighted moments, LH-moment. Design rainfall using LH-moments following the consecutive duration were derived by the regional and at-site analysis using the observed and simulated data resulted from Monte Carlo techniques. Relative root-mean-square error (RRMSE), relative bias (RBIAS) and relative reduction (RR) in RRMSE for the design rainfall were computed and compared in the regional and at-site frequency analysis. Consequently, it was shown that the regional analysis can substantially more reduce the RRMSE, RBIAS and RR in RRMSE than at-site analysis in the prediction of design rainfall. Relative efficiency (RE) for an optimal order of L-moments was also computed by the methods of L, L1, L2, L3 and L4-moments for GEV distribution. It was found that the method of L-moments is more effective than the others for getting optimal design rainfall according to the regions and consecutive durations in the regional frequency analysis. Diagrams for the design rainfall derived by the regional frequency analysis using L-moments were drawn according to the regions and consecutive durations by GIS techniques.

The solid angle estimation of acetabular coverage of the femoral head using 3D method (입체각 측정을 통한 대퇴골두에 대한 관골구 coverage 측정)

  • Choi, K.H.;Kim, M.C.;Lim, C.T.;Kim, S.I.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.05
    • /
    • pp.123-126
    • /
    • 1997
  • We present a method for the estimation of 3D solid angle assessment of the acetabular coverage of the femoral head in 3D space. At first, femoral head and acetabulum is segmented from the original CT scan images. The slice thickness is 1.5mm and the number of slices is usually 30-40 to cover the entire acetabulum. The superior half of the femoral head is modeled as part of a sphere. Thus, the axial cross sections of the upper half of the femoral head are also modeled as circles. A set of points from each outline image of femoral head is fitted recursively into a circle by minimizing root-mean-square (RMS) error. With these fitted circles, a center point of the femoral head model is evaluated. This is a reference point for calculating the solid angle of the acetabular inner surface. Next, the tangent lines connecting from a set of points of the acetabular edge to the center of the fitted sphere are obtained. The lines pass through the unit sphere whose center is the same as that of the femoral head. With the points on the unit sphere, we calculate area and estimate the solid angle. Based on this solid angle, the deformity of the acetabulum is analyzed. In case of normal subject, the solid angle is about 4.3 (rad) and acetabular coverage is 68%.

  • PDF

Performance Analysis for Mirrors of 30 cm Cryogenic Space Infrared Telescope

  • Park, Kwi-Jong;Moon, Bong-Kon;Lee, Dae-Hee;Jeong, Woong-Seob;Nam, Uk-Won;Park, Young-Sik;Pyo, Jeong-Hyun;Han, Won-Yong
    • Journal of Astronomy and Space Sciences
    • /
    • v.29 no.3
    • /
    • pp.321-328
    • /
    • 2012
  • We have designed a 30 cm cryogenic space infrared telescope for astronomical observation. The telescope is designed to observe in the wavelength range of 0.5~2.1 ${\mu}m$, when it is cooled down to 77 K. The result of the preliminary design of the support structure and support method of the mirror of a 30 cm cryogenic space infrared telescope is shown in this paper. As a Cassegrain prescription, the optical system of a 30 cm cryogenic space infrared telescope has a focal ratio of f/3.1 with a 300 mm primary mirror (M-1) and 113 mm secondary mirror (M-2). The material of the whole structure including mirrors is aluminum alloy (Al6061-T6). Flexures that can withstand random vibration were designed, and it was validated through opto-mechanical analysis that both primary and secondary mirrors, which are assembled in the support structure, meet the requirement of root mean square wavefront error < ${\lambda}/8$ for all gravity direction. Additionally, when the M-1 and flexures are assembled by bolts, the effect of thermal stress occurring from a stainless steel bolt when cooled and bolt torque on the M-1 was analyzed.