• Title/Summary/Keyword: Data Set Records

Search Result 197, Processing Time 0.023 seconds

ADVANTAGES OF USING ARTIFICIAL NEURAL NETWORKS CALIBRATION TECHNIQUES TO NEAR-INFRARED AGRICULTURAL DATA

  • Buchmann, Nils-Bo;Ian A.Cowe
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1032-1032
    • /
    • 2001
  • Artificial Neural Network (ANN) calibration techniques have been used commercially for agricultural applications since the mid-nineties. Global models, based on transmission data from 850 to 1050 nm, are used routinely to measure protein and moisture in wheat and barley and also moisture in triticale, rye, and oats. These models are currently used commercially in approx. 15 countries throughout the world. Results concerning earlier European ANN models are being published elsewhere. Some of the findings from that study will be discussed here. ANN models have also been developed for coarsely ground samples of compound feed and feed ingredients, again measured in transmission mode from 850 to 1050 nm. The performance of models for pig- and poultry feed will be discussed briefly. These models were developed from a very large data set (more than 20,000 records), and cover a very broad range of finished products. The prediction curves are linear over the entire range for protein, fat moisture, fibre, and starch (measured only on poultry feed), and accuracy is in line with the performance of smaller models based on Partial Least Squares (PLS). A simple bias adjustment is sufficient for calibration transfer across instruments. Recently, we have investigated the possible use of ANN for a different type of NIR spectrometer, based on reflectance data from 1100 to 2500 nm. In one study, based on data for protein, fat, and moisture measured on unground compound feed samples, dedicated ANN models for specific product classes (cattle feed, pig feed, broiler feed, and layers feed) gave moderately better Standard Errors of Prediction (SEP) compared to modified PLS (MPLS). However, if the four product classes were combined into one general calibration model, the performance of the ANN model deteriorated only slightly compared to the class-specific models, while the SEP values for the MPLS predictions doubled. Brix value in molasses is a measure of sugar content. Even with a huge dataset, PLS models were not sufficiently accurate for commercial use. In contrast an ANN model based on the same data improved the accuracy considerably and straightened out non-linearity in the prediction plot. The work of Mr. David Funk (GIPSA, U. S. Department of Agriculture) who has studied the influence of various types of spectral distortions on ANN- and PLS models, thereby providing comparative information on the robustness of these models towards instrument differences, will be discussed. This study was based on data from different classes of North American wheat measured in transmission from 850 to 1050 nm. The distortions studied included the effect of absorbance offset pathlength variation, presence of stray light bandwidth, and wavelength stretch and offset (either individually or combined). It was shown that a global ANN model was much less sensitive to most perturbations than class-specific GIPSA PLS calibrations. It is concluded that ANN models based on large data sets offer substantial advantages over PLS models with respect to accuracy, range of materials that can be handled by a single calibration, stability, transferability, and sensitivity to perturbations.

  • PDF

Improving Efficiency of Food Hygiene Surveillance System by Using Machine Learning-Based Approaches (기계학습을 이용한 식품위생점검 체계의 효율성 개선 연구)

  • Cho, Sanggoo;Cho, Seung Yong
    • The Journal of Bigdata
    • /
    • v.5 no.2
    • /
    • pp.53-67
    • /
    • 2020
  • This study employees a supervised learning prediction model to detect nonconformity in advance of processed food manufacturing and processing businesses. The study was conducted according to the standard procedure of machine learning, such as definition of objective function, data preprocessing and feature engineering and model selection and evaluation. The dependent variable was set as the number of supervised inspection detections over the past five years from 2014 to 2018, and the objective function was to maximize the probability of detecting the nonconforming companies. The data was preprocessed by reflecting not only basic attributes such as revenues, operating duration, number of employees, but also the inspections track records and extraneous climate data. After applying the feature variable extraction method, the machine learning algorithm was applied to the data by deriving the company's risk, item risk, environmental risk, and past violation history as feature variables that affect the determination of nonconformity. The f1-score of the decision tree, one of ensemble models, was much higher than those of other models. Based on the results of this study, it is expected that the official food control for food safety management will be enhanced and geared into the data-evidence based management as well as scientific administrative system.

A Study on the Problem and Improvement Plan of Management of Public Land Parcels (국공유지 필지 관리의 문제점 및 개선 방안)

  • Kim, Jin;Jung, Young-Jin;Kim, Jun-Ho;Lee, Hyun-Joon;Hong, Sung-Eon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.1
    • /
    • pp.136-145
    • /
    • 2015
  • This study analyzes the status and problems of current parcel management in the public land, and it suggests measures for improvement based on the analysis. For this, it looked into the status and problems of the public land's parcel management. One of the major problems was that the number of parcels was continuously increasing due to insufficient parcel management. The study found that the number was increasing every year by 128,520 parcels on average as of 2008. It also found that there was also non-coincidence between the status and the land use conditions in the cadastral records. An increasing set of minute parcels (minute polygon) has appeared because of these outstanding issues, and these errors have been included in the statistics survey on parcels, negatively influencing the credibility of the data. This study suggests ways to improve these issues including reduction in the number of parcels through annexation, enhancement of credibility of cadastral information through cadastral confirmation survey, and registration and management of public land parcels in consideration of the current status.

A Metadata Representation Scheme based on XML Schema for Special Digital Collections (디지털 특수자료를 위한 XML 스키마 기반의 메타데이터 표현 체계)

  • Oh, Sam-Gyun;Chae, Jin-Seok
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.4 s.54
    • /
    • pp.109-131
    • /
    • 2004
  • As there are diverse delivery media and forms of information resources, their management schemes are diverse as well. In library community, cataloguing. rules for describing information resources such as AACR and KCR have been developed. The efforts to automate management of information resources based on these rules resulted in the development of MARC. However, MARC records are restricted in describing the information resources and MARC has various and distinct characteristics of the structural rigidity, which does not support the representation of extended semantic structures that exist among bibliographic entities. Therefore, since the data model for MARC is single-layer data model, it is not appropriate for describing information resources represented by multi-layer data model which can be used to set up the relationships among various objects in digital libraries. In this paper, we propose an a metadata model for digital libraries based on the IFLA FRBR basic model which supports multi-layer data model and a representation scheme based on XML Schema to manage the metadata about old books, old documents, resource related to music, conferences and seminars.

Cumulative survival rate and associated risk factors of Implantium implants: A 10-year retrospective clinical study

  • Park, Jin-Hong;Kim, Young-Soo;Ryu, Jae-Jun;Shin, Sang-Wan;Lee, Jeong-Yol
    • The Journal of Advanced Prosthodontics
    • /
    • v.9 no.3
    • /
    • pp.195-199
    • /
    • 2017
  • PURPOSE. The objective of this study was to determine the cumulative survival rate (CSR) and associated risk factors of Implantium implants by retrospective clinical study. MATERIALS AND METHODS. Patients who received Implantium implants (Dentium Co., Seoul, Korea) at Korea University Guro Hospital from 2004 to 2011 were included. The period between the first surgery and the last hospital visit until December 2015 was set as the observation period for this study. Clinical and radiographic data were collected from patient records, including all complications observed during the follow-up period. Kaplan-Meier analysis was performed to examine CSR. Multiple Cox proportional hazard model was employed to assess the associations between potential risk factors and CSR. RESULTS. A total of 370 implants were placed in 121 patients (mean age, 56.1 years; range, 19 to 75 years). Of the 370 implants, 13 failed, including 7 implants that were lost before loading. The 10-year cumulative survival rate of implants was 94.8%. The multiple Cox proportional hazard model revealed that significant risk factor of implant failure were smoking and maxillary implant (P<.05). CONCLUSION. The 10-year CSR of Implantium implants was 94.8%. Risk factors of implant failure were smoking and maxillary implant.

Severity Measurement Methods and Comparing Hospital Death Rates for Coronary Artery Bypass Graft Surgery (관상동맥우회술의 중증도 측정과 병원 사망률 비교에 관한 연구)

  • Ahn, Hyung-Sik;Shin, Young-Soo;Kwon, Young-Dae
    • Journal of Preventive Medicine and Public Health
    • /
    • v.34 no.3
    • /
    • pp.244-252
    • /
    • 2001
  • Objective : Health insurers and policy makers are increasingly examining the hospital mortality rate as an indicator of hospital quality and performance. To be meaningful, a risk-adjustment of the death rates must be implemented. This study reviewed 5 severity measurement methods and applied them to the same data set to determine whether judgments regarding the severity-adjusted hospital mortality rates were sensitive to the specific severity measure. Methods : The medical records of 584 patients who underwent coronary artery bypass graft surgery in 6 general hospitals during 1996 and 1997 were reviewed by trained nurses. The MedisGroups, Disease Staging, Computerized Severity Index, APACHE III and KDRG were used to quantify severity of the patients. The predictive probability of death was calculated for each patient in the sample from a multivariate logistic regression model including the severity score, age and sex to evaluate the hospitals' performance, the ratio of the observed number of deaths to the expected number for each hospital was calculated. Results : The overall in-hospital mortality rate was 7.0%, ranging from 2.7% to 15.7% depending on the particular hospital. After the severity adjustment, the mortality rates for each hospital showed little difference according to the severity measure. The 5 severity measurement methods varied in their statistical performance. All had a higher c statistic and $R^2$ than the model containing only age and sex. There was a little difference in the relative hospital performance evaluation by the severity measure. Conclusion : These results suggest that judgments regarding a hospital's performance based on severity adjusted mortality can be sensitive to the severity measurement method. Although the 5 severity measures regarding hospital performance concurred, more often than would be expected by chance, the assessment of an individual hospital mortality rates varied by the different severity measurement method used.

  • PDF

BioPebble: Stone-type physiological sensing device Supporting personalized physiological signal analysis (BioPebble: 개인화된 해석을 지원하는 돌 타입 휴대용 생체신호 측정센서)

  • Choi, Ah-Young;Park, Go-Eun;Woo, Woon-Tack
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.13-18
    • /
    • 2008
  • In these days, wearable and mobile physiological sensing devices have been studied according to the increasing interest on the healthy and wellbeing life. However, these sensing devices display just the sensing results, such as heart rate, skin temperature, and its daily records. In this work, we propose the novel type of mobile physiological sensing device which deliver the user comfortable grabbing feeling. In addition, we indicate the personalized physiological signal analysis result which be concluded by the different analysis results according to the person to person. In order to verify this sensing device, we collect the data set from 4 different users during a week and measure the physiological signal such as heart rate, hand temperature, and skin conductance. And we observe the result how the analysis results shows the difference between the users. We expect that this work can be applied in the various health care applications in the near future.

  • PDF

Standard Translation of Terms of Korean Medicine through Consideration of Chinese-Korean Collated Medical Classics - With focus on 『Eonhaegugeupbang』, 『Eonhaetaesanjipyo』 and 『Eonhaetaesanjipyo』 - (언해의서 비교고찰을 통한 한의학용어의 번역표준안 - 『언해두창집요』, 『언해구급방』, 『언해태산집요』를 중심으로)

  • Ku, Hyunhee;Kim, Hyunkoo;Lee, JungHyun;Oh, Junho;Kwon, Ohmin
    • Korean Journal of Oriental Medicine
    • /
    • v.18 no.3
    • /
    • pp.49-61
    • /
    • 2012
  • This article set out to develop an old Chinese - modern Korean collated terminology by analyzing and paralleling Chinese-Korean translational terms relevant to Korean medicine at a minimum meaning unit from "Eonhaegugeupbang", "Eonhaetaesanjipyo" and "Eonhaetaesanjipyo". Those are composed of original Chinese texts and their subsequent corresponding Korean translations. It tries to make a list of translational standards of Korean medicine terms by classifying the cases of translational ambiguity in terms of disease, body position, thumbnail-pressing acupuncture method, and disease-curing method. The above-mentioned ancient books are medical classics written by Huh Jun, the representative medical physician, and published by the Joseon government. Thus, they are appropriate enough as historically legitimate medical documents, from which are drawn out words and terms to form an old Chinese - modern Korean collation dictionary. This collation glossary will contribute to the increased relevance of data ming, or information retrieval. in a database system and information search engine of massive Korean medical records, by means of providing a novel way to obtaining synchronized results between the original writings of old Chinese and the secondary translated ones of modern Korean. The glossary will promote the collective but consistent translation of numerous old archives of Korean medicine and in other related fields as well.

Molecular biomarkers in extrahepatic bile duct cancer patients undergoing chemoradiotherapy for gross residual disease after surgery

  • Koh, Hyeon Kang;Park, Hae Jin;Kim, Kyubo;Chie, Eui Kyu;Min, Hye Sook;Ha, Sung W.
    • Radiation Oncology Journal
    • /
    • v.30 no.4
    • /
    • pp.197-204
    • /
    • 2012
  • Purpose: To analyze the outcomes of chemoradiotherapy for extrahepatic bile duct (EHBD) cancer patients who underwent R2 resection or bypass surgery and to identify prognostic factors affecting clinical outcomes, especially in terms of molecular biomarkers. Materials and Methods: Medical records of 21 patients with EHBD cancer who underwent R2 resection or bypass surgery followed by chemoradiotherapy from May 2001 to June 2010 were retrospectively reviewed. All surgical specimens were reevaluated by immunohistochemical staining using phosphorylated protein kinase B (pAKT), CD24, matrix metalloproteinase 9 (MMP9), survivin, and ${\beta}$-catenin antibodies. The relationship between clinical outcomes and immunohistochemical results was investigated. Results: At a median follow-up of 20 months, the actuarial 2-year locoregional progression-free, distant metastasis-free and overall survival were 37%, 56%, and 54%, respectively. On univariate analysis using clinicopathologic factors, there was no significant prognostic factor. In the immunohistochemical staining, cytoplasmic staining, and nuclear staining of pAKT was positive in 10 and 6 patients, respectively. There were positive CD24 in 7 patients, MMP9 in 16 patients, survivin in 8 patients, and ${\beta}$-catenin in 3 patients. On univariate analysis, there was no significant value of immunohistochemical results for clinical outcomes. Conclusion: There was no significant association between clinical outcomes of patients with EHBD cancer who received chemoradiotherapy after R2 resection or bypass surgery and pAKT, CD24, MMP9, survivin, and ${\beta}$-catenin. Future research is needed on a larger data set or with other molecular biomarkers.

3-Points Average Pivot Quicksort (3-점 평균 피벗 퀵정렬)

  • Lee, Sang-Un
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.6
    • /
    • pp.295-301
    • /
    • 2014
  • In the absence of a sorting algorithm faster than O(n log n), Quicksort remains the best and fastest of its kind in practice. For given n data, Quicksort records running in O(n log n) at best and $O(n^2)$ at its worst. In this paper, I propose an algorithm by which 3-points average P=(L+M+H)/3 is set as a pivot for first array L=a[s], last array H=a[e], and middle array $M=a[{\lfloor}(s+e)/2{\rfloor}]$ in order to find the more fast than Quicksort. Test results prove that the proposed 3-points average pivot Quicksort has the time complexity of O(n log n) at its best, average, and worst cases. And the proposed algorithm can be reduce the $O(n^2)$ time of Quicksort to O(n log n).