• Title/Summary/Keyword: Accuracy of Selection

Search Result 1,181, Processing Time 0.025 seconds

Numerical Simulations of Water Quality in ManKyong River (QUAL-II E 모델에 의(依)한 만경강(萬頃江)의 수질예측(水質豫測))

  • Shim, Jae-Hwan;Choi, Moon-Sul
    • Korean Journal of Environmental Agriculture
    • /
    • v.10 no.1
    • /
    • pp.67-75
    • /
    • 1991
  • The QUAL-II E Model was applied to predict the water quality of the Mankyong drainage System, and lead to following conclusion. 1. The difference between computed and measured BOD at the M-3 (Bakgugeong) station was within 10%, indicating that the application of the QUAL-IIE Model for the prediction of water quality was satisfactory thus far. 2. The application of the model states that the discharge of concentrated pollutants at the M-1 station on the Jeonju stream, located 41Km upstream from the estuary, causes the worst problems. The sluice which extends residence time and enlarges watery surface improves water quality by a Self-purification process at the M-3 station, 28km upstream from the estuary. 3. The accuracy of the model diminished when this model was applied on the estuary downstream of the sluice. Hence, the application of the model on the estuary needs to be used with caution. 4. Among the conputed water quality parameters, BOD is the worst problem. At the M-3 station, BOD is computed to be 26.6 mg/1 in 1996, 30.7 mg/1 in 2,001, 33.0 mg/l in 2006, and 37.5 mg/1 in 2011. When preventive measures against water pollution are not properly exercised, severe problems in irrigation and water resources are expected. This study will be of used in the selection of irrigation water intake points, the criteria of effluent treatment, the management of water resources, and the establishment of water quality managemont policy.

  • PDF

Development of Prediction Model for Prevalence of Metabolic Syndrome Using Data Mining: Korea National Health and Nutrition Examination Study (국민건강영양조사를 활용한 대사증후군 유병 예측모형 개발을 위한 융복합 연구: 데이터마이닝을 활용하여)

  • Kim, Han-Kyoul;Choi, Keun-Ho;Lim, Sung-Won;Rhee, Hyun-Sill
    • Journal of Digital Convergence
    • /
    • v.14 no.2
    • /
    • pp.325-332
    • /
    • 2016
  • The purpose of this study is to investigate the attributes influencing the prevalence of metabolic syndrome and develop the prediction model for metabolic syndrome over 40-aged people from Korea Health and Nutrition Examination Study 2012. The researcher chose the attributes for prediction model through literature review. Also, we used the decision tree, logistic regression, artificial neural network of data mining algorithm through Weka 3.6. As results, social economic status factors of input attributes were ranked higher than health-related factors. Additionally, prediction model using decision tree algorithm showed finally the highest accuracy. This study suggests that, first of all, prevention and management of metabolic syndrome will be approached by aspect of social economic status and health-related factors. Also, decision tree algorithms known from other research are useful in the field of public health due to their usefulness of interpretation.

Experimental Study of Estimating the Optimized Parameters in OI (서남해안 관측자료를 활용한 OI 자료동화의 최적 매개변수 산정 연구)

  • Gu, Bon-Ho;Woo, Seung-Buhm;Kim, Sangil
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.31 no.6
    • /
    • pp.458-467
    • /
    • 2019
  • The purpose of this study is the suggestion of optimized parameters in OI (Optimal Interpolation) by experimental study. The observation of applying optimal interpolation is ADCP (Acoustic Doppler Current Profiler) data at the southwestern sea of Korea. FVCOM (Finite Volume Coastal Ocean Model) is used for the barotropic model. OI is to the estimation of the gain matrix by a minimum value between the background error covariance and the observation error covariance using the least square method. The scaling factor and correlation radius are very important parameters for OI. It is used to calculate the weight between observation data and model data in the model domain. The optimized parameters from the experiments were found by the Taylor diagram. Constantly each observation point requires optimizing each parameter for the best assimilation. Also, a high accuracy of numerical model means background error covariance is low and then it can decrease all of the parameters in OI. In conclusion, it is expected to have prepared the foundation for research for the selection of ocean observation points and the construction of ocean prediction systems in the future.

A Study on Automatic Threshold Selection in Line Simplification for Pedestrian Road Network Using Road Attribute Data (보행자용 도로망 선형단순화를 위한 도로속성정보 기반 임계값 자동 선정 연구)

  • Park, Bumsub;Yang, Sungchul;Yu, Kiyun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.31 no.4
    • /
    • pp.269-275
    • /
    • 2013
  • Recently, importance of pedestrian road network is getting emphasized as it is possible to provide mobile device users with both route guidance services and surrounding spatial information. However, it costs a tremendous amount of budget for generating and renovating pedestrian road network nationally, which hinder further advances of these services. Hence, algorithms extracting pedestrian road network automatically based on raster data are needed. On the other hand, road dataset generated from raster data usually has unnecessary vertices which lead to maintenance disutility such as excessive turns and increase in data memory. Therefore, this study proposed a method of selecting a proper threshold automatically for separate road entity using not only Douglas-Peucker algorithm but also road attribute data of digital map in order to remove redundant vertices, which maximizes line simplification efficiency and minimizes distortion of shape of roads simultaneously. As a result of the test, proposed method was suitable for automatic line simplification in terms of reduction ratio of vertices and accuracy of position.

A study on TOC monitoring and spatial distribution analysis using a spectrometer in rivers (하천에서의 분광측정기를 이용한 TOC 모니터링 및 공간분포 분석 연구)

  • Yoon, Soo Bin;Lee, Chang Hyun;Kim, Young Do
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.11
    • /
    • pp.815-822
    • /
    • 2023
  • Organic pollution is one of the most common forms of water contamination. Under the Water Quality Conservation Act, indicators for measuring organic substances include BOD, COD, and TOC. Analysis of BOD and COD is labor-intensive, and in the case of organic substances where biological decomposition is not feasible or toxic substances are present, the accuracy is often low. Therefore, the Ministry of Environment is shifting towards TOC-centric management. With advancements in sensor technology today, various parameters can be monitored using sensors. In this study, digital monitoring of river TOC using a spectrophotometer called Spectro::lyser V3 was conducted. Initially, experiments were carried out at the Andong River Experiment Center to assess the applicability of the measurement equipment. Subsequently, data collected at the confluence of the Nakdong River was analyzed for the spatial distribution of TOC using the Kriging technique. This research proposes the utilization of sensors for river TOC monitoring and spatial distribution analysis. Real-time monitoring of changes in river TOC concentration can serve as fundamental data for pollution monitoring and response. Sensor-based river monitoring offers advantages in terms of temporal resolution and real-time data acquisition. When various spatial information interpretation methods are applied, it is expected to contribute to diverse studies such as aquatic ecological health, river water source selection, and stratification analysis in the future.

Adaptability Questions of O-D Table Estimation Models (기종점 통행표 산출모형의 적용성 평가)

  • 오상진;박병호
    • Journal of Korean Society of Transportation
    • /
    • v.17 no.5
    • /
    • pp.99-110
    • /
    • 1999
  • This study deals with the adaptability questions of O-D table estimation models. Its objectives are two-fold; (1) to estimate the characteristics of various O-D table estimation models(i.e. linear regression models. entropy models and statistic models) and (2) to find the model which estimates the O-D table with the best accuracy under the various data conditions. In Pursuing the above, this study gives the particular attentions to the test of the models, using the Sioux Falls network and equilibrium assignment method of MINUTP. The major findings are the followings. Firstly. it finds that the statistic models have the most goodness of fat among all models, if the required data are all Prepared. But it Presents that statistic models are the most sensitive against the underspecification and inconsistency problems of link data. Secondly, It shows that the linear regression models have the worst goodness of fat among all models. But the linear regression models are the most insensitive to the underspecification and inconsistency problems. Thirdly, THE/1 model of entropy model is sensitive against the underspecification and incon-sistency problems, but THE/2 model is insensitive. Finally, other informations like total volume, zonal Production and attraction volumes in 0-D table, help models to gain the better goodness of fit. Especially, in the statistic models. both the zonal production and attraction volume data are helpful to estimate the link volumes. It can be expected that the results dive some implications not only to the selection of optimal model under the various given data, but also to the development or modification of model.

  • PDF

A Study on the Geomagnetic Reference Field Modeling from the Triaxial Magnetometer Data Onboard KOMPSAT-II (아리랑위성 2호의 삼축자력계로부터 관측된 지구자기장 모델 연구)

  • Kim, Hyung-Rae;Hwang, Jong-Sun;Kim, Jeong-Woo;Lee, Seon-Ho
    • Economic and Environmental Geology
    • /
    • v.45 no.4
    • /
    • pp.377-384
    • /
    • 2012
  • The main field component of the Earth's magnetic field was modeled from the tri-axial magnetometer onboard KOrean MultiPurpose SATellite-II (KOMPSAT-II) for the purpose of satellite attitude control. The model computed by the KOMPSAT-II magnetometer measurement data is compared with the International Geomagnetic Reference Field (IGRF) model of a degree of up to 13 in spherical harmonic coefficients. The previous study with KOMPSAT-I (Kim et al. 2004) indicated a good correlation of power spectrum of spherical harmonic coefficients with respect to the degree up to 5. This study, however, showed an agreement of the degree up to 8-9 of the coefficient power spectrum and a discrepancy between degrees 10 and 13. We have concluded that relevant data selection process, removal of the external field from the data in the high latitude region, an accuracy of the magnetometer all play an important role in finding a coherence with the IGRF model. This study will be extended to the secular variation model of geomagnetism if longer-period data become available.

User Oriented clustering of news articles using Tweets Heterogeneous Information Network (트위트 이형 정보 망을 이용한 뉴스 기사의 사용자 지향적 클러스터링)

  • Shoaib, Muhammad;Song, Wang-Cheol
    • Journal of Internet Computing and Services
    • /
    • v.14 no.6
    • /
    • pp.85-94
    • /
    • 2013
  • With the emergence of world wide web, in particular web 2.0 the rapidly growing amount of news articles has created a problem for users in selection of news articles according to their requirements. To overcome this problem different clustering mechanism has been proposed to broadly categorize news articles. However these techniques are totally machine oriented techniques and lack users' participation in the process of decision making for membership of clustering. In order to overcome the issue of zero-participation in the process of clustering news articles in this paper we have proposed a framework for clustering news articles by combining users' judgments that they post on twitter with the news articles to cluster the objects. We have employed twitter hash-tags for this purpose. Furthermore we have computed the credibility of users' based on frequency of retweets for their tweets in order to enhance the accuracy of the clustering membership function. In order to test performance of proposed methodology, we performed experiments on tweets messages tweeted during general election 2013 in Pakistan. Our results proved over claim that using users' output better outcome can be achieved then ordinary clustering algorithms.

Prediction of Genomic Relationship Matrices using Single Nucleotide Polymorphisms in Hanwoo (한우의 유전체 표지인자 활용 개체 혈연관계 추정)

  • Lee, Deuk-Hwan;Cho, Chung-Il;Kim, Nae-Soo
    • Journal of Animal Science and Technology
    • /
    • v.52 no.5
    • /
    • pp.357-366
    • /
    • 2010
  • The emergence of next-generation sequencing technologies has lead to application of new computational and statistical methodologies that allow incorporating genetic information from entire genomes of many individuals composing the population. For example, using single-nucleotide polymorphisms (SNP) obtained from whole genome amplification platforms such as the Ilummina BovineSNP50 chip, many researchers are actively engaged in the genetic evaluation of cattle livestock using whole genome relationship analyses. In this study, we estimated the genomic relationship matrix (GRM) and compared it with one computed using a pedigree relationship matrix (PRM) using a population of Hanwoo. This project is a preliminary study that will eventually include future work on genomic selection and prediction. Data used in this study were obtained from 187 blood samples consisting of the progeny of 20 young bulls collected after parentage testing from the Hanwoo improvement center, National Agriculture Cooperative Federation as well as 103 blood samples from the progeny of 12 proven bulls collected from farms around the Kyong-buk area in South Korea. The data set was divided into two cases for analysis. In the first case missing genotypes were included. In the second case missing genotypes were excluded. The effect of missing genotypes on the accuracy of genomic relationship estimation was investigated. Estimation of relationships using genomic information was also carried out chromosome by chromosome for whole genomic SNP markers based on the regression method using allele frequencies across loci. The average correlation coefficient and standard deviation between relationships using pedigree information and chromosomal genomic information using data which was verified using a parentage test andeliminated missing genotypes was $0.81{\pm}0.04$ and their correlation coefficient when using whole genomic information was 0.98, which was higher. Variation in relationships between non-inbred half sibs was $0.22{\pm}0.17$ on chromosomal and $0.22{\pm}0.04$ on whole genomic SNP markers. The variations were larger and unusual values were observed when non-parentage test data were included. So, relationship matrix by genomic information can be useful for genetic evaluation of animal breeding.

Neural Network Applications to Determining Suitable Tree Species for Site-Specific Conditions (적지적수(適地適樹) 판정(判定)을 위한 Neural Network 기법(技法)의 응용(應用))

  • Kim, Hyungho;Chung, Joosang
    • Journal of Korean Society of Forest Science
    • /
    • v.90 no.4
    • /
    • pp.437-444
    • /
    • 2001
  • This paper discusses applications of neural network to forest stand field data processing and determining suitable tree species for site-specific stand characteristics. For site-specific species selection, considered were 5 major coniferous species : P. densiflora for. erecta, L. leptolepis, P. koraiensis, P. densiflora, P. thunbergii. Among 1,320 sample plot data sets, 200 data sets with the highest site index (40 data sets for each species) were chosen as the test sets for investigation. Each data set includes 13 factors describing the site characteristics of the corresponding sample plot. The results of this investigation indicate high performance of neural network in data processing procedures for extracting data sets or measurement parameters without any recognizable pattern. These data sets or measurement parameters are those which have rare effect on site-specific species suitability or disturb pattern classification procedures of neural network because of unrecognizable patterns involved. Also the results have shown high potential of neural network in determining the best-suitable tree species for site characteristics. The % accuracy of the neural network model in determining the best-suitable tree species for site characteristics ranges from 77.6% to 91.8% associated with the combination of Site factors.

  • PDF