• 제목/요약/키워드: Data Quality Validation

검색결과 379건 처리시간 0.031초

객토 농경지의 토양특성을 고려한 도암댐 유역에서의 수문 및 유사 거동 모의 (Simulation of Hydrological and Sediment Behaviors in the Doam-dam Watershed considering Soil Properties of the Soil Reconditioned Agricultural Fields)

  • 허성구;김재영;유동선;김기성;안재훈;윤정숙;임경재
    • 한국농공학회논문집
    • /
    • 제49권2호
    • /
    • pp.49-60
    • /
    • 2007
  • The alpine agricultural activities are usually performed at higher and steep areas in nature. Thus, significant amounts of soil erosion are occurring compared with those from other areas. Thus, the soil erosion induced environmental impacts in these areas are getting greater. The Doam watershed is located at alpine areas and it has been well known that the agricultural activities in the watershed are causing accelerated soil erosion and water quality degradations. Many modeling approaches were employed to solve soil erosion and water quality issues. In this study, the Soil and Water Assessment Tool (SWAT) model was utilized to simulate the hydrologic and sediment behaviors in the Doam watershed. In many previous modeling studies, the digital soil map and its corresponding soil properties were used without modification to reflect soil conditioning at many agricultural fields of the Doam watershed. Thus, the soil sample was taken at the agricultural field within the Doam watershed and analyzed for its physical properties. In this study, the digital topsoil properties in the agricultural fields within the Doam watershed were replaced with the soil properties for reconditioned soil analyzed in this study to simulate the impacts of using soil properties for reconditioned soil in hydrologic and sediment modeling at the Doam watershed using the SWAT model. The hydrologic component of the SWAT model was calibrated and validated for measured flow data from 2002 to 2003. The $R^2$ value was 0.79 and the EI value was 0.53 for weekly simulated data. The calibrated model parameters were used for hydrologic component validation and the $R^2$ value was 0.86 and the EI value was 0.74 for weekly data. For sediment comparison, the $R^2$ value was 0.67 and the EI value was 0.59. These statistics improved with the use of soil properties of the reconditioned soil in the field compared with the results obtained without considering soil reconditioning. The simulated sediment amounts with and without considering the soil properties of the reconditioned soil were 284,813 ton and 158,369 ton, respectively. This result indicates that there could be approximately 79% of errors in estimated sediment yield at the Doam watershed, although the model comparison with the measured data gave similar satisfactory statistics with and without considering soil properties from the reconditioned soil.

지각된 품질이 브랜드이미지와 고객충성도에 미치는 영향 -복합기제품을 중심으로- (The Effects of the Perceived Quality on Brand Image and Customer Loyalty -Focusing on Multi-Function Printer-)

  • 송거영;유연우
    • 디지털융복합연구
    • /
    • 제11권3호
    • /
    • pp.263-272
    • /
    • 2013
  • 본 연구는 복합기 제품에 있어서의 지각된 품질 요인, 브랜드이미지, 고객충성도 간의 관계를 규명함으로써 충성고객 확보에 효과적으로 접근할 수 있는 컨설팅 영역을 알아보고자 진행하였다. 연구목적의 달성을 위하여 관련 선행연구를 근거로 연구모형과 가설을 수립한 후 산업체의 구매의사 결정자를 대상으로 자료를 수집하여 타당성 분석 및 신뢰도 분석과 구조방정식모형 분석을 실시하였다. 본 연구의 결과를 요약하면 다음과 같다. 첫째, 복합기의 지각된 품질 구성요인이 브랜드이미지에 미치는 영향 정도는 제품품질, 판매원 서비스 속성, 서비스 품질의 순으로 확인되었다. 둘째, 복합기 제품의 브랜드이미지는 고객충성도에 긍정적인 영향을 미치는 것으로 나타났다. 본 연구는 복합기 제품과 관련한 연구범위를 경영분야까지 확장하였다는데 의의가 있으며 복합기 제품과 유사한 사업구조를 갖는 산업에 있어서의 경영전략 수립 실무 및 경영 컨설팅에 도움을 줄 것으로 기대된다.

환경부 토지피복도 사용여부에 따른 예측 SWAT 오류 평가 (Analysis of SWAT Simulated Errors with the Use of MOE Land Cover Data)

  • 허성구;김남원;유동선;김기성;임경재
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2008년도 학술발표회 논문집
    • /
    • pp.194-198
    • /
    • 2008
  • Significant soil erosion and water quality degradation issues are occurring at highland agricultural areas of Kangwon province because of agronomic and topographical specialities of the region. Thus spatial and temporal modeling techniques are often utilized to analyze soil erosion and sediment behaviors at watershed scale. The Soil and Water Assessment Tool (SWAT) model is one of the watershed scale models that have been widely used for these ends in Korea. In most cases, the SWAT users tend to use the readily available input dataset, such as the Ministry of Environment (MOE) land cover data ignoring temporal and spatial changes in land cover. Spatial and temporal resolutions of the MOE land cover data are not good enough to reflect field condition for accurate assesment of soil erosion and sediment behaviors. Especially accelerated soil erosion is occurring from agricultural fields, which is sometimes not possible to identify with low-resolution MOD land cover data. Thus new land cover data is prepared with cadastral map and high spatial resolution images of the Doam-dam watershed. The SWAT model was calibrated and validated with this land cover data. The EI values were 0.79 and 0.85 for streamflow calibration and validation, respectively. The EI were 0.79 and 0.86 for sediment calibration and validation, respectively. These EI values were greater than those with MOE land cover data. With newly prepared land cover dataset for the Doam-dam watershed, the SWAT model better predicts hydrologic and sediment behaviors. The number of HRUs with new land cover data increased by 70.2% compared with that with the MOE land cover, indicating better representation of small-sized agricultural field boundaries. The SWAT estimated annual average sediment yield with the MOE land cover data was 61.8 ton/ha/year for the Doam-dam watershed, while 36.2 ton/ha/year (70.7% difference) of annual sediment yield with new land cover data. Especially the most significant difference in estimated sediment yield was 548.0% for the subwatershed #2 (165.9 ton/ha/year with the MOE land cover data and 25.6 ton/ha/year with new land cover data developed in this study). The results obtained in this study implies that the use of MOE land cover data in SWAT sediment simulation for the Doam-dam watershed could results in 70.7% differences in overall sediment estimation and incorrect identification of sediment hot spot areas (such as subwatershed #2) for effective sediment management. Therefore it is recommended that one needs to carefully validate land cover for the study watershed for accurate hydrologic and sediment simulation with the SWAT model.

  • PDF

Application of artificial neural networks to predict total dissolved solids in the river Zayanderud, Iran

  • Gholamreza, Asadollahfardi;Afshin, Meshkat-Dini;Shiva, Homayoun Aria;Nasrin, Roohani
    • Environmental Engineering Research
    • /
    • 제21권4호
    • /
    • pp.333-340
    • /
    • 2016
  • An Artificial Neural Network including a Radial Basis Function (RBF) and a Time Delay Neural Network (TDNN) was used to predict total dissolved solid (TDS) in the river Zayanderud. Water quality parameters in the river for ten years, 2001-2010, were prepared from data monitored by the Isfahan Regional Water Authority. A factor analysis was applied to select the inputs of water quality parameters, which obtained total hardness, bicarbonate, chloride and calcium. Input data to the neural networks were pH, $Na^+$, $Mg^{2+}$, Carbonate ($CO{_3}^{-2}$), $HCO{_3}^{-1}$, $Cl^-$, $Ca^{2+}$ and Total hardness. For learning process 5-fold cross validation were applied. In the best situation, the TDNN contained 2 hidden layers of 15 neurons in each of the layers and the RBF had one hidden layer with 100 neurons. The Mean Squared Error and the Mean Bias Error for the TDNN during the training process were 0.0006 and 0.0603 and for the RBF neural network the mentioned errors were 0.0001 and 0.0006, respectively. In the RBF, the coefficient of determination ($R^2$) and the index of agreement (IA) between the observed data and predicted data were 0.997 and 0.999, respectively. In the TDNN, the $R^2$ and the IA between the actual and predicted data were 0.957 and 0.985, respectively. The results of sensitivity illustrated that $Ca^{2+}$ and $SO{_4}^{2-}$ parameters had the highest effect on the TDS prediction.

Analysis of Infertility Keywords in the Largest Domestic Mom Cafe Bulletin Board in Korea Using Text Mining

  • Sangmin Lee
    • 인터넷정보학회논문지
    • /
    • 제24권4호
    • /
    • pp.137-144
    • /
    • 2023
  • The purpose of this study is to examine consumers' perceptions of domestic infertility support policies based on infertility-related keywords and the trends of their changes. To this end, Momsholic, a mom cafe which has the most active infertility-related bulletin boards on Naver, was selected as the analysis target, and 'infertility' was selected as a keyword for data search. The data was collected for three months. In addition, network analysis and visualization were performed using R for data collection and analysis, and cross-validation was attempted using the NetDraw function of 'textom 1.0' and the UCINET6 program. As a result of the analysis, the main keywords were cost, artificial insemination, in vitro fertilization, freezing, harvest, ovulation, and how much. Next, looking at the central value of the degree of connection, it was found that the degree of connection between the words cost, cost, how much, problem, public health center, and artificial insemination was high. According to the results of this study, women who visit mom cafes due to infertility in Korea are more interested in the cost. It is believed to be closely related to infertility treatment as well as in vitro fertilization and egg freezing. Therefore, by examining keywords related toinfertility, it has academic significance in that it is possible to identify major factors that end users are interested in. Furthermore, it is possible to redefine the guidelines for domestic infertility support policies by presenting infertility support policies that reflect the factors of interest of end consumers.

원격탐사를 이용한 대형 수체의 수질 모델 검증 효과 제고 방안에 관한 연구 (Application of Remote Sensing Technique to Enhance the Water Quality Model Validation in a Large Water Body)

  • 임현주;최정현;박석순
    • 대한환경공학회지
    • /
    • 제28권4호
    • /
    • pp.447-452
    • /
    • 2006
  • 대형 수체의 수질 모델 검증 효과를 향상시키기 위하여 원격탐사 기술이 적용되었다. 인공위성 영상은 대형 수체의 넓은 표면을 한꺼번에 파악할 수 있으므로 모델의 보정 및 검증에 사용되는 관측 자료의 부족함을 보완할 수 있다. 이 논문은 2000년 4월 29일과 9월 4일에 촬영된 Landsat FTM+영상을 분석하여 팔당호 표층 수온 검증 연구를 제시하고 있다. 영상으로부터 계산된 수온과 모델의 표층 수온의 자료를 획득하여 3가지 방법으로 영상에 의한 수온과 모델의 결과를 비교하였다. 4월 29일 영상의 경우 모델 결과를 기준으로 오차율이 0.13이며 9월 4일에는 오차율이 0.04로 모델의 표층 수온이 영상으로부터 계산된 수온과 잘 일치함을 알 수 있다. 그러나 영상촬영 시점의 대기의 간섭을 고려하지 못한 것이 4월 29일 결과의 오차를 발생시킨 주요 원인으로 사료된다. 그러므로 정확한 수질자료를 얻기 위해서는 영상촬영 시점의 대기의 효과를 고려한 대기보정이 필요할 것이라 사료된다.

생체모방로봇 소프트웨어 검증 지원 다중 HILS 기반 로봇 테스트베드 설계 및 구현 (Design and Implementation of Multi-HILS based Robot Testbed to Support Software Validation of Biomimetic Robots)

  • 김한진;김관혁;하범수;김주영;심성준;구지훈;김원태
    • 정보처리학회 논문지
    • /
    • 제13권6호
    • /
    • pp.243-250
    • /
    • 2024
  • 생체모방로봇은 조류나 곤충과 같은 생체의 특성을 모방하여 미래 전장에서 은밀한 감시와 정찰 작업에 큰 역할을 할 것으로 기대된다. 이 로봇들의 효과적인 활용을 위해서는 새의 날갯짓이나 바퀴벌레의 움직임 등을 모방하는 기술이 중요하지만, 이를 지원하는 하드웨어 확보와 소프트웨어 개발 및 검증 과정의 복잡성으로 인해 어려움이 따른다. 본 논문에서는 모델링 및 시뮬레이션(M&S) 기술을 적용한 다중 HILS 기반 생체모방로봇 소프트웨어 검증 테스트베드를 설계하고 구현한 결과를 소개한다. 테스트베드를 사용함으로써 개발자들은 하드웨어 부재 문제를 극복하고, 미래 전장 시나리오를 시뮬레이션하며 소프트웨어 개발과 테스트를 수행할 수 있다. 그러나, 다중 HILS 기반 테스트베드는 테스트 대상 로봇 수의 증가에 따른 장치 간 연동 지연 문제를 경험할 수 있으며, 이는 시뮬레이션 결과의 신뢰도에 크게 영향을 미칠 수 있다. 이를 해결하기 위해, 우리는 우선순위 기반 미들웨어인 data distribution service prority (DDSP)를 추가로 제안한다. DDSP는 기존 DDS 대비 1.95 ms의 평균 지연 감소 효과를 보이며, 테스트베드에서 요구되는 데이터 전송 품질을 보장할 수 있음을 입증하였다.

Using Artificial Neural Networks for Forecasting Algae Counts in a Surface Water System

  • Coppola, Emery A. Jr.;Jacinto, Adorable B.;Atherholt, Tom;Poulton, Mary;Pasquarello, Linda;Szidarvoszky, Ferenc;Lohbauer, Scott
    • 생태와환경
    • /
    • 제46권1호
    • /
    • pp.1-9
    • /
    • 2013
  • Algal blooms in potable water supplies are becoming an increasingly prevalent and serious water quality problem around the world. In addition to precipitating taste and odor problems, blooms damage the environment, and some classes like cyanobacteria (blue-green algae) release toxins that can threaten human health, even causing death. There is a recognized need in the water industry for models that can accurately forecast in real-time algal bloom events for planning and mitigation purposes. In this study, using data for an interconnected system of rivers and reservoirs operated by a New Jersey water utility, various ANN models, including both discrete prediction and classification models, were developed and tested for forecasting counts of three different algal classes for one-week and two-weeks ahead periods. Predictor model inputs included physical, meteorological, chemical, and biological variables, and two different temporal schemes for processing inputs relative to the prediction event were used. Despite relatively limited historical data, the discrete prediction ANN models generally performed well during validation, achieving relatively high correlation coefficients, and often predicting the formation and dissipation of high algae count periods. The ANN classification models also performed well, with average classification percentages averaging 94 percent accuracy. Despite relatively limited data events, this study demonstrates that with adequate data collection, both in terms of the number of historical events and availability of important predictor variables, ANNs can provide accurate real-time forecasts of algal population counts, as well as foster increased understanding of important cause and effect relationships, which can be used to both improve monitoring programs and forecasting efforts.

건강보험 청구자료를 이용한 일반 질 지표로서의 위험도 표준화 재입원율 산출: 방법론적 탐색과 시사점 (Developing a Hospital-Wide All-Cause Risk-Standardized Readmission Measure Using Administrative Claims Data in Korea: Methodological Explorations and Implications)

  • 김명화;김홍수;황수희
    • 보건행정학회지
    • /
    • 제25권3호
    • /
    • pp.197-206
    • /
    • 2015
  • Background: The purpose of this study was to propose a method for developing a measure of hospital-wide all-cause risk-standardized readmissions using administrative claims data in Korea and to discuss further considerations in the refinement and implementation of the readmission measure. Methods: By adapting the methodology of the United States Center for Medicare & Medicaid Services for creating a 30-day readmission measure, we developed a 6-step approach for generating a comparable measure using Korean datasets. Using the 2010 Korean National Health Insurance (NHI) claims data as the development dataset, hierarchical regression models were fitted to calculate a hospital-wide all-cause risk-standardized readmission measure. Six regression models were fitted to calculate the readmission rates of six clinical condition groups, respectively and a single, weighted, overall readmission rate was calculated from the readmission rates of these subgroups. Lastly, the case mix differences among hospitals were risk-adjusted using patient-level comorbidity variables. The model was validated using the 2009 NHI claims data as the validation dataset. Results: The unadjusted, hospital-wide all-cause readmission rate was 13.37%, and the adjusted risk-standardized rate was 10.90%, varying by hospital type. The highest risk-standardized readmission rate was in hospitals (11.43%), followed by general hospitals (9.40%) and tertiary hospitals (7.04%). Conclusion: The newly developed, hospital-wide all-cause readmission measure can be used in quality and performance evaluations of hospitals in Korea. Needed are further methodological refinements of the readmission measures and also strategies to implement the measure as a hospital performance indicator.

쾌삭 303계 스테인리스강 소형 압연 선재 제조 공정의 생산품질 예측 모형 (Quality Prediction Model for Manufacturing Process of Free-Machining 303-series Stainless Steel Small Rolling Wire Rods)

  • 서석준;김흥섭
    • 산업경영시스템학회지
    • /
    • 제44권4호
    • /
    • pp.12-22
    • /
    • 2021
  • This article suggests the machine learning model, i.e., classifier, for predicting the production quality of free-machining 303-series stainless steel(STS303) small rolling wire rods according to the operating condition of the manufacturing process. For the development of the classifier, manufacturing data for 37 operating variables were collected from the manufacturing execution system(MES) of Company S, and the 12 types of derived variables were generated based on literature review and interviews with field experts. This research was performed with data preprocessing, exploratory data analysis, feature selection, machine learning modeling, and the evaluation of alternative models. In the preprocessing stage, missing values and outliers are removed, and oversampling using SMOTE(Synthetic oversampling technique) to resolve data imbalance. Features are selected by variable importance of LASSO(Least absolute shrinkage and selection operator) regression, extreme gradient boosting(XGBoost), and random forest models. Finally, logistic regression, support vector machine(SVM), random forest, and XGBoost are developed as a classifier to predict the adequate or defective products with new operating conditions. The optimal hyper-parameters for each model are investigated by the grid search and random search methods based on k-fold cross-validation. As a result of the experiment, XGBoost showed relatively high predictive performance compared to other models with an accuracy of 0.9929, specificity of 0.9372, F1-score of 0.9963, and logarithmic loss of 0.0209. The classifier developed in this study is expected to improve productivity by enabling effective management of the manufacturing process for the STS303 small rolling wire rods.