• Title/Summary/Keyword: Binary Data

Search Result 1,673, Processing Time 0.04 seconds

A Case Study: Improvement of Wind Risk Prediction by Reclassifying the Detection Results (풍해 예측 결과 재분류를 통한 위험 감지확률의 개선 연구)

  • Kim, Soo-ock;Hwang, Kyu-Hong
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.23 no.3
    • /
    • pp.149-155
    • /
    • 2021
  • Early warning systems for weather risk management in the agricultural sector have been developed to predict potential wind damage to crops. These systems take into account the daily maximum wind speed to determine the critical wind speed that causes fruit drops and provide the weather risk information to farmers. In an effort to increase the accuracy of wind risk predictions, an artificial neural network for binary classification was implemented. In the present study, the daily wind speed and other weather data, which were measured at weather stations at sites of interest in Jeollabuk-do and Jeollanam-do as well as Gyeongsangbuk- do and part of Gyeongsangnam- do provinces in 2019, were used for training the neural network. These weather stations include 210 synoptic and automated weather stations operated by the Korean Meteorological Administration (KMA). The wind speed data collected at the same locations between January 1 and December 12, 2020 were used to validate the neural network model. The data collected from December 13, 2020 to February 18, 2021 were used to evaluate the wind risk prediction performance before and after the use of the artificial neural network. The critical wind speed of damage risk was determined to be 11 m/s, which is the wind speed reported to cause fruit drops and damages. Furthermore, the maximum wind speeds were expressed using Weibull distribution probability density function for warning of wind damage. It was found that the accuracy of wind damage risk prediction was improved from 65.36% to 93.62% after re-classification using the artificial neural network. Nevertheless, the error rate also increased from 13.46% to 37.64%, as well. It is likely that the machine learning approach used in the present study would benefit case studies where no prediction by risk warning systems becomes a relatively serious issue.

Corporate Bond Rating Using Various Multiclass Support Vector Machines (다양한 다분류 SVM을 적용한 기업채권평가)

  • Ahn, Hyun-Chul;Kim, Kyoung-Jae
    • Asia pacific journal of information systems
    • /
    • v.19 no.2
    • /
    • pp.157-178
    • /
    • 2009
  • Corporate credit rating is a very important factor in the market for corporate debt. Information concerning corporate operations is often disseminated to market participants through the changes in credit ratings that are published by professional rating agencies, such as Standard and Poor's (S&P) and Moody's Investor Service. Since these agencies generally require a large fee for the service, and the periodically provided ratings sometimes do not reflect the default risk of the company at the time, it may be advantageous for bond-market participants to be able to classify credit ratings before the agencies actually publish them. As a result, it is very important for companies (especially, financial companies) to develop a proper model of credit rating. From a technical perspective, the credit rating constitutes a typical, multiclass, classification problem because rating agencies generally have ten or more categories of ratings. For example, S&P's ratings range from AAA for the highest-quality bonds to D for the lowest-quality bonds. The professional rating agencies emphasize the importance of analysts' subjective judgments in the determination of credit ratings. However, in practice, a mathematical model that uses the financial variables of companies plays an important role in determining credit ratings, since it is convenient to apply and cost efficient. These financial variables include the ratios that represent a company's leverage status, liquidity status, and profitability status. Several statistical and artificial intelligence (AI) techniques have been applied as tools for predicting credit ratings. Among them, artificial neural networks are most prevalent in the area of finance because of their broad applicability to many business problems and their preeminent ability to adapt. However, artificial neural networks also have many defects, including the difficulty in determining the values of the control parameters and the number of processing elements in the layer as well as the risk of over-fitting. Of late, because of their robustness and high accuracy, support vector machines (SVMs) have become popular as a solution for problems with generating accurate prediction. An SVM's solution may be globally optimal because SVMs seek to minimize structural risk. On the other hand, artificial neural network models may tend to find locally optimal solutions because they seek to minimize empirical risk. In addition, no parameters need to be tuned in SVMs, barring the upper bound for non-separable cases in linear SVMs. Since SVMs were originally devised for binary classification, however they are not intrinsically geared for multiclass classifications as in credit ratings. Thus, researchers have tried to extend the original SVM to multiclass classification. Hitherto, a variety of techniques to extend standard SVMs to multiclass SVMs (MSVMs) has been proposed in the literature Only a few types of MSVM are, however, tested using prior studies that apply MSVMs to credit ratings studies. In this study, we examined six different techniques of MSVMs: (1) One-Against-One, (2) One-Against-AIL (3) DAGSVM, (4) ECOC, (5) Method of Weston and Watkins, and (6) Method of Crammer and Singer. In addition, we examined the prediction accuracy of some modified version of conventional MSVM techniques. To find the most appropriate technique of MSVMs for corporate bond rating, we applied all the techniques of MSVMs to a real-world case of credit rating in Korea. The best application is in corporate bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. For our study the research data were collected from National Information and Credit Evaluation, Inc., a major bond-rating company in Korea. The data set is comprised of the bond-ratings for the year 2002 and various financial variables for 1,295 companies from the manufacturing industry in Korea. We compared the results of these techniques with one another, and with those of traditional methods for credit ratings, such as multiple discriminant analysis (MDA), multinomial logistic regression (MLOGIT), and artificial neural networks (ANNs). As a result, we found that DAGSVM with an ordered list was the best approach for the prediction of bond rating. In addition, we found that the modified version of ECOC approach can yield higher prediction accuracy for the cases showing clear patterns.

A Study on the Perception Changes of Physicians toward Duty to Inform - Focusing on the Influence of the Revised Medical Law - (설명의무에 대한 의사의 인식 변화 조사 연구 -의료법 개정의 영향을 중심으로-)

  • Kim, Rosa
    • The Korean Society of Law and Medicine
    • /
    • v.19 no.2
    • /
    • pp.235-261
    • /
    • 2018
  • The Medical law stipulates regulations about the physician's duty to inform to contribute to patient's self-determination. This law was most recently revised on December 20, 2016, and came into effect on June 21, 2017. There has been much controversy about this, and it has been questioned whether or not it will be effective for physicians to comply with the duty to inform. Therefore, this study investigated perceptions of physicians of whether they observed the duty to inform and their legal judgment about that duty, and analyzed how the revision of the medical law may have affected the legal cognition of physician's duty to inform. This study was conducted through an online questionnaire survey involving 109 physicians over 2 weeks from March 29 to April 12, 2018, and 108 of the collected data were used for analysis. The questionnaire was developed by revising and supplementing the previous research (Lee, 2004). It consisted of 41 items, including 26 items related to the experience of and legal judgment about the duty to inform, 6 items related to awareness of revised medical law, and 9 items on general characteristics. The data were analyzed using SAS 9.4 program and descriptive statistics, Chi-square test, Fisher's exact test and Binary logistic regression were performed. The results are as follows. • Out of eight situations, the median number of situations that did not fulfill the duty to inform was 5 (IQR, 4-6). In addition, 12 respondents (11%) answered that they did not fulfill the duty to inform in all eight cases, while only one (1%) responded that he/she performed explanation obligations in all cases. • The median number of the legal judgment score on the duty to inform was 8 out of 13 (IQR, 7-9), and the scores ranged from a minimum of 4 (4 respondents) to a maximum of 11 (3 respondents). • More than half of the respondents (n=26, 52%) were unaware of the revision of the medical law, 27 (25%) were aware of the fact that the medical law had been revised, 20(18%) had a rough knowledge of the contents of the law, and only 5(5%) said they knew the contents of the law in detail. The level of awareness of the revised medical law was statistically significant difference according to respondents' sex (p<.49), age (p<.0001), career (p<.0001), working type (p<.024), and department (p<.049). • There was no statistically significant relationship between the level of awareness of the revised medical law and the level of legal judgment on the duty to inform. These results suggest that efforts to improve the implementation and cognition of physician's duty to inform are needed, and it is difficult to expect a direct positive effect from the legal regulations per se. Considering the distinct characteristics of medical institutions and hierarchical organizational culture of physicians, it is necessary to develop a credible guideline on the duty to inform within the medical system, and to strengthen the education of physicians about their duty to inform and its purpose.

Comparison of Housewives' Agricultural Food Consumption Characteristics by Age (주부의 연령대별 농식품 소비 특성 비교)

  • Hong, Jun-Ho;Kim, Jin-Sil;Yu, Yeon-Ju;Lee, Kyung-Hee;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.6 no.1
    • /
    • pp.83-89
    • /
    • 2021
  • Lifestyle is changing rapidly, and food consumption patterns vary widely among households as dietary and food processing technologies evolve. This paper reclassified the food group of consumer panel data established by the Rural Development Administration, which contains information on purchasing agricultural products by household unit, and compared the consumption characteristics of agricultural products by age group. The criteria for age classification were divided into groups in their 60s and older with a prevalence of 20% or more metabolic diseases and groups in their 30s and 40s with less than 10%. Using the LightGBM algorithm, we classified the differences in food consumption patterns in their 30s and 50s and 60s and found that the precision was 0.85, the reproducibility was 0.71, and F1_score was 0.77. The results of variable importance were confectionery, folio, seasoned vegetables, fruit vegetables, and marine products, followed by the top five values of the SHAP indicator: confectionery, marine products, seasoned vegetables, fruit vegetables, and folio vegetables. As a result of binary classification of consumption patterns as a median instead of the average sensitive to outliers, confectionery showed that those in their 30s and 40s were more than twice as high as those in their 60s. Other variables also showed significant differences between those in their 30s and 40s and those in their 60s and older. According to the study, people in their 30s and 40s consumed more than twice as much confectionery as those in their 60s, while those in their 60s consumed more than twice as much marine products, seasoned vegetables, fruit vegetables, and folioce or logistics as much as those in their 30s and 40s. In addition to the top five items, consumption of 30s and 40s in wheat-processed snacks, breads and noodles was high, which differed from food consumption patterns in their 60s.

The Relationship between Social Relations and Physical Activity in the Young-old and Old-old Elderly (전·후기 노인들의 사회적 관계와 신체활동 실천과의 관련성)

  • So Youn Jeon;Sok Goo Lee
    • Journal of agricultural medicine and community health
    • /
    • v.48 no.2
    • /
    • pp.103-117
    • /
    • 2023
  • Objectives: This study aims to reveal the relationship between social relations and physical activity in the young-old and old-old elderly. Methods: Data from 2020 National survey of Older Koreans were used, and a total of 10,097 subjects over the age of 65 were included in analysis. The dependent variable was physical activity, and the independent variables were social relations barrier and motivational factors. x2-test and binary logistic regression were performed for data analysis. Results: The physical activity rate in the elderly were 40.8% in the young-old and 29.2% in the old-old. The socio-demographic characteristics affecting physical activity were the young-old elderly were sex, residential area, employment status and household income, and the old-old elderly were sex, age, residential area, education level and household income. The social relations barrier factors affecting physical activity were the young-old elderly were number of close friends, family care, exercise information search and video viewing, and the old-old elderly were household type, number of close friends, participation in exercise education, exercise information search and video viewing. The social relations motivational factors affecting physical activity were the young-old elderly were call with children/relative/friend, participation in sports activity, access time from home to parks, and the old-old elderly were call with children/relative/friend, participation in sports activity, satisfaction with green spaces. Conclusions: It was found that social relations barrier and motivational factors of the elderly are important factors to consider when developing physical activity promotion strategy, and there are also difference between the age of the elderly.

A Study on the Effect of Network Centralities on Recommendation Performance (네트워크 중심성 척도가 추천 성능에 미치는 영향에 대한 연구)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.23-46
    • /
    • 2021
  • Collaborative filtering, which is often used in personalization recommendations, is recognized as a very useful technique to find similar customers and recommend products to them based on their purchase history. However, the traditional collaborative filtering technique has raised the question of having difficulty calculating the similarity for new customers or products due to the method of calculating similaritiesbased on direct connections and common features among customers. For this reason, a hybrid technique was designed to use content-based filtering techniques together. On the one hand, efforts have been made to solve these problems by applying the structural characteristics of social networks. This applies a method of indirectly calculating similarities through their similar customers placed between them. This means creating a customer's network based on purchasing data and calculating the similarity between the two based on the features of the network that indirectly connects the two customers within this network. Such similarity can be used as a measure to predict whether the target customer accepts recommendations. The centrality metrics of networks can be utilized for the calculation of these similarities. Different centrality metrics have important implications in that they may have different effects on recommended performance. In this study, furthermore, the effect of these centrality metrics on the performance of recommendation may vary depending on recommender algorithms. In addition, recommendation techniques using network analysis can be expected to contribute to increasing recommendation performance even if they apply not only to new customers or products but also to entire customers or products. By considering a customer's purchase of an item as a link generated between the customer and the item on the network, the prediction of user acceptance of recommendation is solved as a prediction of whether a new link will be created between them. As the classification models fit the purpose of solving the binary problem of whether the link is engaged or not, decision tree, k-nearest neighbors (KNN), logistic regression, artificial neural network, and support vector machine (SVM) are selected in the research. The data for performance evaluation used order data collected from an online shopping mall over four years and two months. Among them, the previous three years and eight months constitute social networks composed of and the experiment was conducted by organizing the data collected into the social network. The next four months' records were used to train and evaluate recommender models. Experiments with the centrality metrics applied to each model show that the recommendation acceptance rates of the centrality metrics are different for each algorithm at a meaningful level. In this work, we analyzed only four commonly used centrality metrics: degree centrality, betweenness centrality, closeness centrality, and eigenvector centrality. Eigenvector centrality records the lowest performance in all models except support vector machines. Closeness centrality and betweenness centrality show similar performance across all models. Degree centrality ranking moderate across overall models while betweenness centrality always ranking higher than degree centrality. Finally, closeness centrality is characterized by distinct differences in performance according to the model. It ranks first in logistic regression, artificial neural network, and decision tree withnumerically high performance. However, it only records very low rankings in support vector machine and K-neighborhood with low-performance levels. As the experiment results reveal, in a classification model, network centrality metrics over a subnetwork that connects the two nodes can effectively predict the connectivity between two nodes in a social network. Furthermore, each metric has a different performance depending on the classification model type. This result implies that choosing appropriate metrics for each algorithm can lead to achieving higher recommendation performance. In general, betweenness centrality can guarantee a high level of performance in any model. It would be possible to consider the introduction of proximity centrality to obtain higher performance for certain models.

A Studyof Psychiatric Treatment Compliance in Referred Patients at a General Hospital (자문의뢰된 입원환자의 특성과 정신과 치료 순응도에 대한 연구)

  • Shim, In-Bo;Ko, Young-Hoon;Lee, Moon-Soo;Kim, Yong-Ku;Han, Chang-Su
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.19 no.2
    • /
    • pp.66-73
    • /
    • 2011
  • Objectives:The present study investigates the status of inpatient psychiatric consultations at a general hospital in order to find factors that contribute to treatment compliance related to psychiatric consultations. Methods:The subjects were 333 patients who were hospitalized at Korea University Medical Center Ansan Hospital from 1 September 2009 to 31 July 2010.The patients were referred for psychiatric consultation during hospitalization. This study investigates demographic data, request department, referral causes, requestor, psychiatric history and diagnosis, andpsychiatric treatment compliance. Treatment compliance was defined as whether or not the patient had accepted psychiatric treatment during hospitalization or outpatient department(OPD) follow-up. This study ascertains the factors that have impact on compliance, by taking binary logistic regression with compliance and other variables. Results:Among the patients that were offered psychiatric treatment during hospitalization(N=310), treatment compliance was 82.9%. Among the patients that were offered OPD treatment(N=111), compliance was 55.8%. Elderly group(>65 years) showed better compliance to treatment during hospitalization than the younger patient group(OR=4.838, p=0.004). Patients with secondary psychiatric disorders showed better OPD follow-up compliance than patients with secondary psychiatric disorders(OR=8.520, p=.008). Conclusion:Elderly patients showed better compliance for psychiatric treatment during hospitalization. However they commonly have disorders such as delirium and mood disorders that have impact on the patient's physical state, hence further active measures should be carried out. Patients referred due to primary psychiatric disorders showed poor OPD compliance. Therefore clinicians have to suggest multidisciplinary interventions that will improve treatment compliance of such patients.

  • PDF

Related Factors to Handwashing with Soap in Korean Adults (우리나라 성인의 비누로 손씻기 실천 관련요인)

  • Lee, Youn-Hee;Lee, Moo-Sik;Hong, SuJin;Yang, Nam-Young;Hwang, Hae-Jung;Kim, Byung-Hee;Kim, Hyun-Soo;Kim, Eun-Young;Park, Yun-Jin;Lim, Go-Un;Kim, Young-Tek
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.17 no.1
    • /
    • pp.89-99
    • /
    • 2016
  • Objectives: This cross-sectional study aims to investigate the prevalence and factors relating to handwashing with soap among Korean adults. Methods: Study subjects consist of 755 adults who have been contacted in September 2013 via telephone surveys. The data collected has been analyzed using descriptive statistics, a chi-square test and a logistic regression analysis. A primary purpose is to understand the prevalence of handwashing with soap more than 8 times daily and for 30 seconds per wash among adults. Independent variables include socioeconomic levels, the participants' perception and knowledge of handwashing and their educational experiences relating to handwashing. Results: The overall percentile of people who wash their hands with soap 8 time per day for 30 seconds or more per wash was 16.0%, which is 121 people out of 755 study subjects. In univariate analysis, age, education levels, monthly average income, handwashing habits, perceptions relate to the importance of handwashing, self-assessment of handwashing, environment of public toilet, and the completion of handwashing education shows significant result. Significant differences also appear (p<0.05) in logistic regression analysis on binary variables. There is a strong correlation between daily frequency of handwashing and willingness to wash hands while outside. For example, people who wash their hands very often while outside are 2.24 times (95% C.I. 1.29-3.87) more likely to practice handwashing with soap 8 times per day for 30 seconds or more per wash than those people who only intermittently wash their hands while outside. Furthermore, people with general unwillingness to wash their hands while outside are 4.61 times (95% C.I. 1.22-3.28) less likely to practice handwashing with soap 8 times per day for 30 seconds or more per wash than those with general willingness. Conclusions: This study has been carried out to identify the decision factors in practicing handwashing with soap for Korean adults. In univariate analysis, age, education level, monthly average income, handwashing habits, handwashing self-assessment, public toilet environment, completion of handwashing education and so forth have been identified to be the decision factors. This study result shows that the overall level of cleanliness of public toilet perceives to be poor and it suggests that the environment of public toilet needs to be enhanced. As the handwashing habits and handwashing-self assessment have been identified to be the significant decision factors for handwashing, there search and approach in these factors need to be developed further.

Quantitative Analysis of Feldspar Mixture Samples Using the Rietveld Refinement Method (Rietveld Refinement 방법을 응용한 장석 혼합시료의 정량분석 연구)

  • Shim, Sang-Heon;Ahn, Jung-Ho;Kim, Soo-Jin
    • Journal of the Mineralogical Society of Korea
    • /
    • v.7 no.1
    • /
    • pp.62-79
    • /
    • 1994
  • The quanttative and structural analysis of the binary standard mixtures of albite and quartz, and microcline and albite were carried out using the Rietveld refinement method in order to investigate the accuracy and precision of the method. The quantitative analysis using the Rietveld method results in a standard deviation of 4 wt % for the albite-quartz standard mixtures and 1 wt % for the microcline-albite standard mixtures, suggesting that its accuracy is far better than that of the conventional XRD method in which only a few selected peaks are utilized. Furthermore, the unit-cell parameters of component minerals in mixtures were also estimated accurately during the analysis. It was observed that the refined weight fractions deviate systematically from their measured values when the method is applied to the mixtures that contain minerals with different degrees of preferred orientation, such as albite-quartz mixtures. The preferred orientation parameters and R-values suggest that the systematic deviation is caused as a result of the preferred orientation effect of feldspar crystallites. It is evident that the preferred orientation corrections are of help for the accurate determination of unit-cell parameters, although they may not improve the result of quantitative analysis significantly. The refined weight fraction of the mineral with higher degree of preferred orientation in mixture is greater than the measured one. This is apparently caused by the effect of geometry of feldspar crystallites in the surface of the mounted sample. The Rietveld refinement method minimizes the problems inherent in the traditional XRD methods, such as the line overlap, primary extinction, and preferred orientation effect, by fitting every data point in a whole pattern explicitly. Furthermore, accurate unit-cell parameters as well as scale factors that can be obtained from the Rietveld refinement are used for the quqantification. The present stdudy demonstrates that the Rietveld method yields far more accurate analytical result than the conventional XRD quantitative analysis method does.

  • PDF

Comparative Study on the Estimation of CO2 absorption Equilibrium in Methanol using PC-SAFT equation of state and Two-model approach. (메탄올의 이산화탄소 흡수평형 추산에 대한 PC-SAFT모델식과 Two-model approach 모델식의 비교연구)

  • Noh, Jaehyun;Park, Hoey Kyung;Kim, Dongsun;Cho, Jungho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.10
    • /
    • pp.136-152
    • /
    • 2017
  • The thermodynamic models, PC-SAFT (Perturbed-Chain Statistical Associated Fluid Theory) state equation and the Two-model approach liquid activity coefficient model NRTL (Non Random Two Liquid) + Henry + Peng-Robinson, for modeling the Rectisol process using methanol aqueous solution as the $CO_2$ removal solvent were compared. In addition, to determine the new binary interaction parameters of the PC-SAFT state equations and the Henry's constant of the two-model approach, absorption equilibrium experiments between carbon dioxide and methanol at 273.25K and 262.35K were carried out and regression analysis was performed. The accuracy of the newly determined parameters was verified through the regression results of the experimental data. These model equations and validated parameters were used to model the carbon dioxide removal process. In the case of using the two-model approach, the methanol solvent flow rate required to remove 99.00% of $CO_2$ was estimated to be approximately 43.72% higher, the cooling water consumption in the distillation tower was 39.22% higher, and the steam consumption was 43.09% higher than that using PC-SAFT EOS. In conclusion, the Rectisol process operating under high pressure was designed to be larger than that using the PC-SAFT state equation when modeled using the liquid activity coefficient model equation with Henry's relation. For this reason, if the quantity of low-solubility gas components dissolved in a liquid at a constant temperature is proportional to the partial pressure of the gas phase, the carbon dioxide with high solubility in methanol does not predict the absorption characteristics between methanol and carbon dioxide.