• Title/Summary/Keyword: False Positive data

Search Result 241, Processing Time 0.027 seconds

The cutoff criterion and the accuracy of the polygraph test for crime investigation (범죄수사를 위한 거짓말탐지 검사(polygraph test)의 판정기준과 정확성)

  • Yu Hwa Han ;Kwangbai Park
    • Korean Journal of Culture and Social Issue
    • /
    • v.14 no.4
    • /
    • pp.103-117
    • /
    • 2008
  • The polygraph test administered by the Korean Prosecutors Office for crime investigations customarily uses the score of -12 as the cutoff point separating the subjects who lie from those who tell the truth. The criterion used by the KPO is different from the one (-13) suggested by Backster (1963) who invented the particular method for lie detection. Based on the signal detection theory applied to the real polygraph test data obtained from real crime suspects by the KPO, the present study identified the score of -8 as an optimal criterion resulting in the highest overall accuracy of the polygraph test. The classification of the subjects with the score of -8 as the criterion resulted in the highest accuracy (83.17%) compared with the accuracies of classifications with the Backster's criterion (76.24%) and the KPO's criterion (80.20%). However, the new criterion was also found to result in more false-positive cases. Based on the results from the present study, it was recommended to use the score of -8 as the criterion when the overall accuracy is important but the score of -12 or -13 when avoiding false-positive is more important than securing the overall accuracy.

  • PDF

Understanding the Asymptotic Convergence of Domain of Attraction in Extreme Value Distribution for Establishing Baseline Distribution in Statistical Damage Assessment of a Structure (통계적 구조물 손상진단에서 기저분포 구성을 위한 극치분포의 점근적 수렴성 이해)

  • Kang, Joo-Sung;Park, Hyun-Woo
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.13 no.2 s.54
    • /
    • pp.231-242
    • /
    • 2009
  • The baseline distribution of a structure represents the statistical distribution of dynamic response feature from the healthy state of the structure. Generally, damage-sensitive dynamic response feature of a structure manifest themselves near the tail of a baseline statistical distribution. In this regard, some researchers have paid attention to extreme value distribution for modeling the tail of a baseline distribution. However, few researches have been conducted to theoretically understand the extreme value distribution from a perspective of statistical damage assessment. This study investigates the asymptotic convergence of domain of attraction in extreme value distribution through parameter estimation, which is needed for reliable statistical damage assessment. In particular, the asymptotic convergence of a domain of attraction is quantified with respect to the sample size out of which each extreme value is extracted. The effect of the sample size on false positive alarms in statistical damage assessment is quantitatively investigated as well. The validity of the proposed method is demonstrated through numerically simulated acceleration data on a two span continuous truss bridge.

Malicious Packet Detection Technology Using Machine Learning and Deep Learning (머신러닝과 딥러닝을 활용한 악성 패킷 탐지 기술 연구)

  • Byounguk An;JongChan Lee;JeSung Chi;Wonhyung Park
    • Convergence Security Journal
    • /
    • v.21 no.4
    • /
    • pp.109-115
    • /
    • 2021
  • Currently, with the development of 5G and IoT technology, it is being used in connection with the things used in real life through a network. However, attempts to use networked computers for malicious purposes are increasing, and attacks using malicious codes that infringe the confidentiality and integrity of user information are becoming more intelligent. As a countermeasure to this, research is being conducted on a method of detecting malicious packets using a security control system and AI technology, supervised learning. The cyber security control system is being operated inefficiently in terms of manpower and cost. In addition, in the era of the COVID-19 pandemic, remote work has increased, making it difficult to respond immediately. In addition, malicious code detection using the existing AI technology, supervised learning, does not detect variant malicious code, and has an inaccurate malicious code detection rate depending on the quantity and quality of data. Therefore, in this study, by converging malicious packet detection technologies through various machine learning and deep learning models, the accuracy of malicious packet detection is increased, the false positive rate and the false positive rate are reduced, and a new type of malicious packet can be efficiently detected when intrusion. We propose a malicious packet detection technology.

Surveillance Evaluation of the National Cancer Registry in Sabah, Malaysia

  • Jeffree, Saffree Mohammad;Mihat, Omar;Lukman, Khamisah Awang;Ibrahim, Mohd Yusof;Kamaludin, Fadzilah;Hassan, Mohd Rohaizat;Kaur, Nirmal;Myint, Than
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.7
    • /
    • pp.3123-3129
    • /
    • 2016
  • Background: Cancer is the fourth leading cause of death in Sabah Malaysia with a reported age-standardized incidence rate was 104.9 per 100,000 in 2007. The incidence rate depends on non-mandatory notification in the registry. Under-reporting will provide the false picture of cancer control program effectiveness. The present study was to evaluate the performance of the cancer registry system in terms of representativeness, data quality, simplicity, acceptability and timeliness and provision of recommendations for improvement. Materials and Methods: The evaluation was conducted among key informants in the National Cancer Registry (NCR) and reporting facilities from Feb-May 2012 and was based on US CDC guidelines. Representativeness was assessed by matching cancer case in the Health Information System (HIS) and state pathology records with those in NCR. Data quality was measured through case finding and re-abstracting of medical records by independent auditors. The re-abstracting portion comprised 15 data items. Self-administered questionnaires were used to assess simplicity and acceptability. Timeliness was measured from date of diagnosis to date of notification received and data dissemination. Results: Of 4613 cancer cases reported in HIS, 83.3% were matched with cancer registry. In the state pathology centre, 99.8% was notified to registry. Duplication of notification was 3%. Data completeness calculated for 104 samples was 63.4%. Registrars perceived simplicity in coding diagnosis as moderate. Notification process was moderately acceptable. Median duration of interval 1 was 5.7 months. Conclusions: The performances of registry's attributes are fairly positive in terms of simplicity, case reporting sensitivity, and predictive value positive. It is moderately acceptable, data completeness and inflexible. The usefulness of registry is the area of concern to achieve registry objectives. Timeliness of reporting is within international standard, whereas timeliness to data dissemination was longer up to 4 years. Integration between existing HIS and national registration department will improve data quality.

Parallel Bayesian Network Learning For Inferring Gene Regulatory Networks

  • Kim, Young-Hoon;Lee, Do-Heon
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.202-205
    • /
    • 2005
  • Cell phenotypes are determined by the concerted activity of thousands of genes and their products. This activity is coordinated by a complex network that regulates the expression of genes. Understanding this organization is crucial to elucidate cellular activities, and many researches have tried to construct gene regulatory networks from mRNA expression data which are nowadays the most available and have a lot of information for cellular processes. Several computational tools, such as Boolean network, Qualitative network, Bayesian network, and so on, have been applied to infer these networks. Among them, Bayesian networks that we chose as the inference tool have been often used in this field recently due to their well-established theoretical foundation and statistical robustness. However, the relative insufficiency of experiments with respect to the number of genes leads to many false positive inferences. To alleviate this problem, we had developed the algorithm of MONET(MOdularized NETwork learning), which is a new method for inferring modularized gene networks by utilizing two complementary sources of information: biological annotations and gene expression. Afterward, we have packaged and improved MONET by combining dispersed functional blocks, extending species which can be inputted in this system, reducing the time complexities by improving algorithms, and simplifying input/output formats and parameters so that it can be utilized in actual fields. In this paper, we present the architecture of MONET system that we have improved.

  • PDF

Performance Comparison of Commercial and Customized CNN for Detection in Nodular Lung Cancer (결절성 폐암 검출을 위한 상용 및 맞춤형 CNN의 성능 비교)

  • Park, Sung-Wook;Kim, Seunghyun;Lim, Su-Chang;Kim, Do-Yeon
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.6
    • /
    • pp.729-737
    • /
    • 2020
  • Screening with low-dose spiral computed tomography (LDCT) has been shown to reduce lung cancer mortality by about 20% when compared to standard chest radiography. One of the problems arising from screening programs is that large amounts of CT image data must be interpreted by radiologists. To solve this problem, automated detection of pulmonary nodules is necessary; however, this is a challenging task because of the high number of false positive results. Here we demonstrate detection of pulmonary nodules using six off-the-shelf convolutional neural network (CNN) models after modification of the input/output layers and end-to-end training based on publicly databases for comparative evaluation. We used the well-known CNN models, LeNet-5, VGG-16, GoogLeNet Inception V3, ResNet-152, DensNet-201, and NASNet. Most of the CNN models provided superior results to those of obtained using customized CNN models. It is more desirable to modify the proven off-the-shelf network model than to customize the network model to detect the pulmonary nodules.

Developing a Molecular Prognostic Predictor of a Cancer based on a Small Sample

  • Kim Inyoung;Lee Sunho;Rha Sun Young;Kim Byungsoo
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.195-198
    • /
    • 2004
  • One Important problem in a cancer microarray study is to identify a set of genes from which a molecular prognostic indicator can be developed. In parallel with this problem is to validate the chosen set of genes. We develop in this note a K-fold cross validation procedure by combining a 'pre-validation' technique and a bootstrap resampling procedure in the Cox regression . The pre-validation technique predicts the microarray predictor of a case without having seen the true class level of the case. It was suggested by Tibshirani and Efron (2002) to avoid the possible over-fitting in the regression in which a microarray based predictor is employed. The bootstrap resampling procedure for the Cox regression was proposed by Sauerbrei and Schumacher (1992) as a means of overcoming the instability of a stepwise selection procedure. We apply this K-fold cross validation to the microarray data of 92 gastric cancers of which the experiment was conducted at Cancer Metastasis Research Center, Yonsei University. We also share some of our experience on the 'false positive' result due to the information leak.

  • PDF

Semi-automatic System for Mass Detection in Digital Mammogram (디지털 마모그램 반자동 종괴검출 방법)

  • Cho, Sun-Il;Kwon, Ju-Won;Ro, Yong-Man
    • Journal of Biomedical Engineering Research
    • /
    • v.30 no.2
    • /
    • pp.153-161
    • /
    • 2009
  • Mammogram is one of the important techniques for mass detection, which is the early diagnosis stage of a breast cancer. Especially, the CAD(Computer Aided Diagnosis) using mammogram improves the working performance of radiologists as it offers an effective mass detection. There are two types of CAD systems using mammogram; automatic and semi-automatic CAD systems. However, the automatic segmentation is limited in performance due to the difficulty of obtaining an accurate segmentation since mass occurs in the dense areas of the breast tissue and has smoother boundaries. Semi-automatic CAD systems overcome these limitations, however, they also have problems including high FP (False Positive) rate and a large amount of training data required for training a classifier. The proposed system which overcomes the aforementioned problems to detect mass is composed of the suspected area selection, the level set segmentation and SVM (Support Vector Machine) classification. To assess the efficacy of the system, 60 test images from the FFDM (Full-Field Digital Mammography) are analyzed and compared with the previous semi-automatic system, which uses the ANN classifier. The experimental results of the proposed system indicate higher accuracy of detecting mass in comparison to the previous systems.

AUTOMATIC DETECTION OF EPILEPTIFORM ACTIVITY USING WAVELET AND ARTIFICIAL NEURAL NETWORK (웨이브렛과 신경회로망을 이용한 간질 파형 자동 검출)

  • Park, H.S.;Park, C.H.;Lee, Y.H.;Lee, D.S.;Kim, S.I.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.05
    • /
    • pp.358-361
    • /
    • 1997
  • This paper describes a multichannel epileptic seizure detection algorithm based on wavelet transform(WT), artificial neural network(ANN) and expert system. First, through the WT, a small number of wavelet coefficients is used to represent the single channel epileptic spike. Next, 3-layer feed-forward network employing the error back propagation algorithm is trained and tested using parameters obtained above. Finally, 16 channel expert system which is based on clinical experience is introduced as a artifact rejection and reliable detection. The suggested algorithm was implemented on personal computer(PC). Two main events i.e., epileptiform and normal activities, were selected from 32 person's EEGs(normal: 20, seizure disorder: 12) in consensus among experts. The result was that WT reduced data input size and ANN detected 97 of the 100 EEGs containing definite spike - sensitivity of 97%. Expert rule system was capable of rejecting a wide variety of artifacts commonly found in EEG recordings. It also reduced false positive detections of ANN.

  • PDF

DIAGNOSTIC ACCURACY OF $^{18}F$-FDG IN NODAL NEGATIVE ORAL SQUAMOUS CELL CARCINOMA (구강암 환자에서 경부 임파절 평가에 대한 $^{18}F$-FDG PET(Fluorine 18-Labelled Deoxyglucose Positron Emission Tomography)의 유용성)

  • Choi, Eun-Joo;Kang, Sang-Hoon;Kim, Ki-Ho;Nam, Woong;Kim, Hyung-Jun;Cha, In-Ho
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • v.33 no.6
    • /
    • pp.597-600
    • /
    • 2007
  • PET is one of the most widely used verification methods for evaluation of metastasis on the lymph nodes of the neck in oral cancer patients. The purpose of this study was to assess the correlation between PET findings and histopathologic findings in patients who had been diagnosed as squamous cell carcinoma and performed neck dissection. Thirty-four necks in 25 patients had been evaluated on pathologic lymph nodes and the data were compared with preoperative PET scan. The sensitivity of PET at the level of the neck was 72.7%, specificity was 60%, and accuracy was 79.2%. Since FDG-PET show high false-positive results, it should be used with other diagnostic tools for evaluation of lymph node metastasis.