• Title/Summary/Keyword: classification criterion

Search Result 284, Processing Time 0.029 seconds

Is it true?: A Meta-analysis on the Efficacy of CBCA in Detecting Truths (그 말은 진실일까?: CBCA의 진실 탐지 효용성에 대한 메타분석적 고찰)

  • Kim, Hye Jin;Lee, Sangmin;Hur, Taekyun;Choi, Seung-Hyuk
    • Korean Journal of Forensic Psychology
    • /
    • v.12 no.2
    • /
    • pp.121-149
    • /
    • 2021
  • Statement Validity Analysis (SVA) is utilized in criminal investigations and the court to assess the credibility of given statements. During this procedure, the criteria for Criteria-Based Content Analysis (CBCA) are used to evaluate whether statements include the characteristics reflecting actual experiences about the event in question. Various studies had been conducted on the efficacy (classification rates) of CBCA criteria, yet the consistency of the findings was not investigated. In the current study, a meta-analysis was conducted with Korean CBCA studies reported from 2004 to 2020 (a total of fourteen studies). As a result, the total score of CBCA was found to successfully discriminate truth and fabrication. A significant positive (+) effect size was found with four criteria (3, 4, 10, and 12), all of which are classified as cognitive criteria. However, contrary to the underlying assumption for CBCA, criterion 18, classified as one of the motivational criteria, showed a significant negative (-) effect size. Meanwhile, moderator analyses were possible for eleven criteria (2~9, 12, 13, 15) and the results showed the significant effects of potential moderator variables such as the gender and status of the participants, study types and designs, number of raters, and publication status. The current results suggests that more careful attention is required to each criterion-especially the cognitive criteria-rather than the total CBCA score as well as the possible moderator effects in order to assess truthfulness of the statements. The implication, limitations, and suggestions for future studies were discussed.

Ensemble Learning with Support Vector Machines for Bond Rating (회사채 신용등급 예측을 위한 SVM 앙상블학습)

  • Kim, Myoung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.29-45
    • /
    • 2012
  • Bond rating is regarded as an important event for measuring financial risk of companies and for determining the investment returns of investors. As a result, it has been a popular research topic for researchers to predict companies' credit ratings by applying statistical and machine learning techniques. The statistical techniques, including multiple regression, multiple discriminant analysis (MDA), logistic models (LOGIT), and probit analysis, have been traditionally used in bond rating. However, one major drawback is that it should be based on strict assumptions. Such strict assumptions include linearity, normality, independence among predictor variables and pre-existing functional forms relating the criterion variablesand the predictor variables. Those strict assumptions of traditional statistics have limited their application to the real world. Machine learning techniques also used in bond rating prediction models include decision trees (DT), neural networks (NN), and Support Vector Machine (SVM). Especially, SVM is recognized as a new and promising classification and regression analysis method. SVM learns a separating hyperplane that can maximize the margin between two categories. SVM is simple enough to be analyzed mathematical, and leads to high performance in practical applications. SVM implements the structuralrisk minimization principle and searches to minimize an upper bound of the generalization error. In addition, the solution of SVM may be a global optimum and thus, overfitting is unlikely to occur with SVM. In addition, SVM does not require too many data sample for training since it builds prediction models by only using some representative sample near the boundaries called support vectors. A number of experimental researches have indicated that SVM has been successfully applied in a variety of pattern recognition fields. However, there are three major drawbacks that can be potential causes for degrading SVM's performance. First, SVM is originally proposed for solving binary-class classification problems. Methods for combining SVMs for multi-class classification such as One-Against-One, One-Against-All have been proposed, but they do not improve the performance in multi-class classification problem as much as SVM for binary-class classification. Second, approximation algorithms (e.g. decomposition methods, sequential minimal optimization algorithm) could be used for effective multi-class computation to reduce computation time, but it could deteriorate classification performance. Third, the difficulty in multi-class prediction problems is in data imbalance problem that can occur when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. SVM ensemble learning is one of machine learning methods to cope with the above drawbacks. Ensemble learning is a method for improving the performance of classification and prediction algorithms. AdaBoost is one of the widely used ensemble learning techniques. It constructs a composite classifier by sequentially training classifiers while increasing weight on the misclassified observations through iterations. The observations that are incorrectly predicted by previous classifiers are chosen more often than examples that are correctly predicted. Thus Boosting attempts to produce new classifiers that are better able to predict examples for which the current ensemble's performance is poor. In this way, it can reinforce the training of the misclassified observations of the minority class. This paper proposes a multiclass Geometric Mean-based Boosting (MGM-Boost) to resolve multiclass prediction problem. Since MGM-Boost introduces the notion of geometric mean into AdaBoost, it can perform learning process considering the geometric mean-based accuracy and errors of multiclass. This study applies MGM-Boost to the real-world bond rating case for Korean companies to examine the feasibility of MGM-Boost. 10-fold cross validations for threetimes with different random seeds are performed in order to ensure that the comparison among three different classifiers does not happen by chance. For each of 10-fold cross validation, the entire data set is first partitioned into tenequal-sized sets, and then each set is in turn used as the test set while the classifier trains on the other nine sets. That is, cross-validated folds have been tested independently of each algorithm. Through these steps, we have obtained the results for classifiers on each of the 30 experiments. In the comparison of arithmetic mean-based prediction accuracy between individual classifiers, MGM-Boost (52.95%) shows higher prediction accuracy than both AdaBoost (51.69%) and SVM (49.47%). MGM-Boost (28.12%) also shows the higher prediction accuracy than AdaBoost (24.65%) and SVM (15.42%)in terms of geometric mean-based prediction accuracy. T-test is used to examine whether the performance of each classifiers for 30 folds is significantly different. The results indicate that performance of MGM-Boost is significantly different from AdaBoost and SVM classifiers at 1% level. These results mean that MGM-Boost can provide robust and stable solutions to multi-classproblems such as bond rating.

Determination of Mean Shear Wave Velocity to the Depth of 30m Based on Shallow Shear Wave Velocity Profile (얕은 심도 전단파속도 분포를 이용한 30m 심도 평균 전단파속도의 결정)

  • Sun, Chang-Guk;Chung, Choong-Ki;Kim, Dong-Soo
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.11 no.1 s.53
    • /
    • pp.45-57
    • /
    • 2007
  • The mean shear wave velocity to the depth of 30 m (Vs30) derived from the western Vs is the current site classification criterion for determining the design seismic ground motion taking into account the site amplification potential. In order to evaluate the Vs30 at a site, a shear wave velocity (Vs) Profile extending to at least 30 m in depth must be acquired from in-situ seismic test. In many cases, however, the resultant depth of the Vs profile may not extend to 30 m, owing to the unfavorable field condition and the limitation of adopted testing techniques. In this study, the Vs30 and the mean shear wave velocity to a depth shallower, than 30 m (VsDs) were computed from the Vs profiles more than 30 m in depth obtained by performing various seismic tests at total 72 sites in Korea, and a correlation between Vs30 and VsDs was drawn based on the computed mean Vs data. In addition, a method for extrapolating the Vs profile from shallow depth to 30 m was developed by building a shape curve based on the average data of all Vs profiles. For evaluating the Vs30 from the shallow Vs profiles, both the methods using VsDs and shape curve result in less bias than the simplest method of extending the lowermost Vs equally to 30 m in depth, and are usefully applicable particularly in the cases of the Vs profiles extending to at least 10 m in depth.

Transient Torsional Vibration Response due to Ice Impact Torque Excitation on Marine Diesel Engine Propulsion Shafting (선박용 디젤엔진 추진축에서 빙 충격 토크 기진에 의한 과도 비틀림 진동 응답)

  • Barro, Ronald D.;Eom, Ki Tak;Lee, Don Chool
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.25 no.5
    • /
    • pp.321-328
    • /
    • 2015
  • In recent years, there has been an increasing demand to apply the new IACS(International Association of Classification Societies) standards for ice and polar-classed ships. For ice-class vessel propulsion system, the ice impact torque design criterion is defined as a periodic harmonic function in relation to the number of the propeller blades. However, irregular or transient ice impact torque is assumed to occur likely in actual circumstances rather than these periodic loadings. In this paper, the reliability and torsional vibration characteristics of a comparatively large six-cylinder marine diesel engine for propulsion shafting system was examined and reviewed in accordance with current regulations. In this particular, the transient ice impact torque and excessive vibratory torque originating from diesel engine were interpreted and the resonant points identified through theoretical analysis. Several floating ice impacts were carried out to evaluate torque responses using the calculation method of classification rule requirement. The Newmark method was used for the transient response analysis of the whole system.

Study on the influence of sewer network simplification on urban inundation modelling results (하수관망의 간소화가 도시침수 모의에 미치는 영향 분석에 관한 연구)

  • Lee, Seung-Soo;Pakdimanivong, Mary;Jung, Kwan-Sue;Kim, Yeonsu
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.4
    • /
    • pp.347-354
    • /
    • 2018
  • In urban areas, runoff flow is drained through sewer networks as well as surface areas. Therefore, it is very important to consider sewer networks as a component of hydrological drainage processes when conducting urban inundation modelling. However, most researchers who have implemented urban inundation/flood modelling, instinctively simplified the sewer networks without the appropriate criteria. In this research, a 1D-2D fully coupled urban inundation model is applied to estimate the influence of sewer network simplification on urban inundation modelling based on the dendritic network classification. The one-dimensional (1D) sewerage system analysis model, which was introduced by Lee et al. (2017), is used to simulate inlet and overflow phenomena by interacting with surface flow. Two-dimensional (2D) unstructured meshes are also applied to simulate surface flow and are combined with the 1D sewerage analysis model. Sewer network pipes are simplified based on the dendritic network classification method, namely the second and third order, and all cases of pipes are conducted as a control group. Each classified network case, including a control group, is evaluated through their application to the 27 July 2011 extreme rainfall event, which caused severe inundation damages in the Sadang area in Seoul, South Korea. All cases are compared together regarding inundation area, inflow discharge and overflow discharge. Finally, relevant criterion for the simplification method is recommended.

An Exploratory Study on the Introduction of Loyalty to Segmentation of Theme Park Users (주제공원 이용자의 시장세분화를 위한 충성도의 사용가능성 검토)

    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.26 no.1
    • /
    • pp.1-11
    • /
    • 1998
  • The purpose of this paper is twofold : to identify loyalty applicable to segmentation of theme park users and to find characteristics of the segments. Thetheme park was regarded as a product and Lotte World was regarded as a brand. One hundred thirty five college students were selected by nonprobability sampling for two waves thirty of data collection. Both behavioral and attitudinal dimension of loyalty were measured in the first wave by the proportion of visit of the Lotte World to 3 major theme parks for one year, including the Lotte World, and by calculating the mean score of selected 7 attitudinal items, respectively. After 14 weeks, the same respondents were asked the number of actual visits of the Lotte World. Medians of two dimensions and cluster analyses were utilized to classify the respondents into 4 categories : high, spurious, latent, and low loyalty. Then ANOVA and $$\chi$^2$ test of independence were conducted to find the difference in intention to visit the Lotte World and actual visitation of it among groups. Only intention was significantly different by the group and the mean score of intention was highest in the high loyalty group. Although no statistical difference was found in actual visitation among groups, the theory of planned behavior provided a theoretical support to conclude that the loyalty is a useful variable for segmentation of theme park users because intention is an antecedent variable to the behavior. Discriminant analyses showed that characteristics of each loyalty group can be differentiated by motivations and constraints. When median was a group classification criterion, 73.2 percent of high loyalty group was correctly classified. A few comments were suggested on data collection, and inclusion of new discriminant variables was discussed for the future research.

  • PDF

Influences of Environmental Factors on Wheat Quality I. Relationship between Grain Yield and Quality of the Wheat as related to Cultivated Locations (재배 환경조건이 소맥품질에 미치는 영향 I. 지역별 수량성과 품질과의 관계)

  • 류인수;장학길;안완식;송현숙
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.22 no.2
    • /
    • pp.59-64
    • /
    • 1977
  • The relationship between grain yield. protein content and sedimentation value in wheat were studied using 9 varieties cultivated at 8 locations in Korea. The grain yields of wheat varied widely, according to varieties and locations. Negative correlations between grain yield and protein content, and grain yield and sedimentation value were observed, while a positive correlation between protein content and sedimentation value was observed. Specific sedimentation values of soft wheats were below 4, while those of hard wheats were above 5. The intermediate varieties had sedimentation values of 4-5. This showed that specific sedimentation values could be used as a criterion in the classification of wheat quality.

  • PDF

Cluster-Based Selection of Diverse Query Examples for Active Learning (능동적 학습을 위한 군집화 기반의 다양한 복수 문의 예제 선정 방법)

  • Kang, Jae-Ho;Ryu, Kwang-Ryel;Kwon, Hyuk-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.11 no.1
    • /
    • pp.169-189
    • /
    • 2005
  • In order to derive a better classifier with a limited number of training examples, active teaming alternately repeats the querying stage fur category labeling and the subsequent learning stage fur rebuilding the calssifier with the newly expanded training set. To relieve the user from the burden of labeling, especially in an on-line environment, it is important to minimize the number of querying steps as well as the total number of query examples. We can derive a good classifier in a small number of querying steps by using only a small number of examples if we can select multiple of diverse, representative, and ambiguous examples to present to the user at each querying step. In this paper, we propose a cluster-based batch query selection method which can select diverse, representative, and highly ambiguous examples for efficient active learning. Experiments with various text data sets have shown that our method can derive a better classifier than other methods which only take into account the ambiguity as the criterion to select multiple query examples.

  • PDF

A Study on the Evaluation of Optimal Program Applicability for Face Recognition Using Machine Learning (기계학습을 이용한 얼굴 인식을 위한 최적 프로그램 적용성 평가에 대한 연구)

  • Kim, Min-Ho;Jo, Ki-Yong;You, Hee-Won;Lee, Jung-Yeal;Baek, Un-Bae
    • Korean Journal of Artificial Intelligence
    • /
    • v.5 no.1
    • /
    • pp.10-17
    • /
    • 2017
  • This study is the first attempt to raise face recognition ability through machine learning algorithm and apply to CRM's information gathering, analysis and application. In other words, through face recognition of VIP customer in distribution field, we can proceed more prompt and subdivided customized services. The interest in machine learning, which is used to implement artificial intelligence, has increased, and it has become an age to automate it by using machine learning beyond the way that a person directly models an object recognition process. Among them, Deep Learning is evaluated as an advanced technology that shows amazing performance in various fields, and is applied to various fields of image recognition. Face recognition, which is widely used in real life, has been developed to recognize criminals' faces and catch criminals. In this study, two image analysis models, TF-SLIM and Inception-V3, which are likely to be used for criminal face recognition, were selected, analyzed, and implemented. As an evaluation criterion, the image recognition model was evaluated based on the accuracy of the face recognition program which is already being commercialized. In this experiment, it was evaluated that the recognition accuracy was good when the accuracy of the image classification was more than 90%. A limit of our study which is a way to raise face recognition is left as a further research subjects.

Geological Characteristics of Kyongju-Ulsan Area : Palaeomagnetism and Magnetic Susceptibility of the Granitic Rocks in the Ulsan Fault Area (경주-울산일원에 대한 지역지질 특성연구 : 울산단층주변 화강암류의 잔류자기와 대자율)

  • Kim, In-Soo;Son, Moon;Jung, Hyun-Jung;Lee, Joon-Dong;Kim, Jeong-Jin;Paik, In Sung
    • Economic and Environmental Geology
    • /
    • v.31 no.1
    • /
    • pp.31-43
    • /
    • 1998
  • A total of 469 granitic samples were collected from 44 sites in the Ulsan fault area, southeast Korea. According to the previous petrographic studies, the granitic rocks have been divided into four groups (Hornblende biotite granodiorite, Hornblende granite, Biotite granite and Alkali-feldspar granite). NRM intensities, values of low field magnetic susceptibility, and magnetic behaviors during stepwise demagnetization experiments suggest rather a three-fold classification: In this scheme, Hornblende granite and Biotite granite are grouped together, as they did not show any significant differences in magnetic characteristics. Based on the Ishihara (1979)'s criterion, Alkali-feldspar granite is classified as ilmenite-series granite, whereas others are classified as magnetite-series granite. In the eastern part of the study area including the Tertiary basin area, declinations of site-mean characteristic remanent magnetizations (ChRMs) show clockwise deflection of more than 30 from the reference direction of east Asia. Both along and in the adjacent region of the Ulsan fault-line, however, no deflection of remanent direction was observed. A boundary line between the deflected and undeflected site-mean ChRMs is defined in this study, which runs roughly parallel to the Ulsan fault-line at the distance of about 6km eastward from the fault-line. We suggest that this newly found boundary line, which we call Yonil tectonic line, released dextral simple shear stress acted in the southeastern part of the Korean peninsula during the opening stage of the East Sea in the Early Cenozoic.

  • PDF