Search | Korea Science

The Unified Framework for AUC Maximizer

Jun, Jong-Jun;Kim, Yong-Dai;Han, Sang-Tae;Kang, Hyun-Cheol;Choi, Ho-Sik
- Communications for Statistical Applications and Methods
- /
- v.16 no.6
- /
- pp.1005-1012
- /
- 2009
The area under the curve(AUC) is commonly used as a measure of the receiver operating characteristic(ROC) curve which displays the performance of a set of binary classifiers for all feasible ratios of the costs associated with true positive rate(TPR) and false positive rate(FPR). In the bipartite ranking problem where one has to compare two different observations and decide which one is "better", the AUC measures the quantity that ranking score of a randomly chosen sample in one class is larger than that of a randomly chosen sample in the other class and hence, the function which maximizes an AUC of bipartite ranking problem is different to the function which maximizes (minimizes) accuracy (misclassification error rate) of binary classification problem. In this paper, we develop a way to construct the unified framework for AUC maximizer including support vector machines based on maximizing large margin and logistic regression based on estimating posterior probability. Moreover, we develop an efficient algorithm for the proposed unified framework. Numerical results show that the propose unified framework can treat various methodologies successfully.
https://doi.org/10.5351/CKSS.2009.16.6.1005 인용 PDF KSCI

Parameter Optimization and Uncertainty Analysis of the Rainfall-Runoff Model Coupled with Hierarchical Bayesian Inference Scheme (Hierarchical Bayesian 기법을 통한 강우-유출모형 매개변수의 최적화 및 불확실성 분석)

Mun, Yeong-Il;Gwon, Hyeon-Han
- Proceedings of the Korea Water Resources Association Conference
- /
- 2007.05a
- /
- pp.1752-1756
- /
- 2007
정교한 강우-유출 모의를 위해서는 적절한 매개변수의 추정이 필수적이며, 매개변수 추정 방법은 시행착오(trial and error)에 의한 수동보정법과 최적화방법을 사용한 자동보정법으로 구분할 수 있다. 모형의 매개변수의 수가 많은 경우 수동보정법에 의한 매개변수 추정은 매우 어렵다. 자동 보정법에 사용되는 최적화방법은 Rosenbrock 알고리즘, patten search, 컴플렉스(complex) 방법, Powell 방법 등과 같은 지역최적화 방법과 전역최적화 방법으로 나눌 수 있다. 그러나 기존 방법론들은 매개변수의 최적화를 추적하기 위한 알고리즘이 대부분이며 이들 매개변수에 관련된 불확실성을 평가하는데는 미흡한 단접이 있다. 이러한 점에서 본 연구에서는 강우-유출모형의 매개변수 추정에 있어서 불확실성을 평가할 수 있는 새로운 방법론을 검토하고자 한다. 매개변수와 관련된 불확실성을 평가하기 위한 방법은 여러 가지가 있으나 통계적으로 매우 우수한 능력을 보이는 Hierarchical Bayesian 알고리즘을 Probability-Distributed 강우-유출 모형에 적용하였다. 본 방법론은 최적화와 동시에 각 매개변수에 관련된 사후분포(posterior distribution)의 추정이 가능하므로 모형이 갖는 불확실성을 효과적으로 평가할 수 있다. 따라서, 수자원 관리에 있어서 불확실성을 고려할 수 있으므로 보다 수리수문학적 위험도를 저감할 수 있을 것으로 판단된다.
PDF

Texture segmentation using Neural Networks and multi-scale Bayesian image segmentation technique (신경회로망과 다중스케일 Bayesian 영상 분할 기법을 이용한 결 분할)

Kim Tae-Hyung;Eom Il-Kyu;Kim Yoo-Shin
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.4 s.304
- /
- pp.39-48
- /
- 2005
This paper proposes novel texture segmentation method using Bayesian estimation method and neural networks. We use multi-scale wavelet coefficients and the context information of neighboring wavelets coefficients as the input of networks. The output of neural networks is modeled as a posterior probability. The context information is obtained by HMT(Hidden Markov Tree) model. This proposed segmentation method shows better performance than ML(Maximum Likelihood) segmentation using HMT model. And post-processed texture segmentation results as using multi-scale Bayesian image segmentation technique called HMTseg in each segmentation by HMT and the proposed method also show that the proposed method is superior to the method using HMT.
PDF KSCI

Spatial-Temporal Frough Analysis of South Korea Based On Neural Networks (신경망을 이용한 우리나라의 시공 간적 가뭄의 해석)

신현석
- Proceedings of the Korea Water Resources Association Conference
- /
- 1998.05b
- /
- pp.7-13
- /
- 1998
A methodology to analyze and quantify regional meteorological drough based on annual precipitation data has been introduced in this paper In this study, based on posterior probability estimator and Bayesian classifier in Spatial Analysis Neural Network ISANN), point drought probabilities categorized as extreme, severe, mild, and non drought events has been defined, and a Bayesian Drought Severity Index (BPSI) has been introduced to classify the region of interest into four drought serverities. For example, the proposed methodology has been applied to analyze the regional drought of South Korea. This is a new method to classify and quantify the spatial or regional drought based on neural network pattern recognition technique and the results show that it may be apprepriate and valuable to analyze the spatial drought.
PDF

Bayesian Rules Based Optimal Defense Strategies for Clustered WSNs

Zhou, Weiwei;Yu, Bin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.12
- /
- pp.5819-5840
- /
- 2018
Considering the topology of hierarchical tree structure, each cluster in WSNs is faced with various attacks launched by malicious nodes, which include network eavesdropping, channel interference and data tampering. The existing intrusion detection algorithm does not take into consideration the resource constraints of cluster heads and sensor nodes. Due to application requirements, sensor nodes in WSNs are deployed with approximately uncorrelated security weights. In our study, a novel and versatile intrusion detection system (IDS) for the optimal defense strategy is primarily introduced. Given the flexibility that wireless communication provides, it is unreasonable to expect malicious nodes will demonstrate a fixed behavior over time. Instead, malicious nodes can dynamically update the attack strategy in response to the IDS in each game stage. Thus, a multi-stage intrusion detection game (MIDG) based on Bayesian rules is proposed. In order to formulate the solution of MIDG, an in-depth analysis on the Bayesian equilibrium is performed iteratively. Depending on the MIDG theoretical analysis, the optimal behaviors of rational attackers and defenders are derived and calculated accurately. The numerical experimental results validate the effectiveness and robustness of the proposed scheme.
https://doi.org/10.3837/tiis.2018.12.012 인용 PDF KSCI

Delayed rupture of a posttraumatic retromaxillary pseudoaneurysm causing massive bleeding: a case report

Hwang, Jae Ha;Kim, Woo Hyeong;Choi, Jun Ho;Kim, Kwang Seog;Lee, Sam Yong
- Archives of Craniofacial Surgery
- /
- v.22 no.3
- /
- pp.168-172
- /
- 2021
Posttraumatic pseudoaneurysm of the face is caused by blunt, penetrating, or surgical trauma. Although its incidence is low, pseudoaneurysm rupture can cause a life-threatening, massive hemorrhage. A 48-year-old man visited our emergency center due to a fall-down accident. Three-dimensional computed tomography (CT) showed a comminuted zygomaticomaxillary complex fracture of the left face. After open reduction and internal fixation, the surgical wound healed without any complications. However, the patient was readmitted 10 days after surgery due to pus-like discharge from the wound. Contrast-enhanced CT to find the abscess unexpectedly revealed a pseudoaneurysm in the left retromaxillary area. Massive oral bleeding occurred on the night of re-hospitalization and emergency surgery was done. The bleeding site was identified as a pseudo-aneurysmal rupture of the posterior superior alveolar artery in the retromaxillary area. Hemostasis was achieved by packing Vaseline gauze in the maxillary sinus using an endoscope. Delayed rupture and massive bleeding of posttraumatic retromaxillary pseudoaneurysm after a zygomaticomaxillary fracture is a low-probability, but high-impact event. Therefore, additional contrast-enhanced CT should be considered to evaluate the possibility of a posttraumatic pseudoaneurysm in cases of severe comminuted zygomaticomaxillary fracture.
https://doi.org/10.7181/acfs.2021.00213 인용 PDF KSCI

A novel Metropolis-within-Gibbs sampler for Bayesian model updating using modal data based on dynamic reduction

Ayan Das;Raj Purohit Kiran;Sahil Bansal
- Structural Engineering and Mechanics
- /
- v.87 no.1
- /
- pp.1-18
- /
- 2023
The paper presents a Bayesian Finite element (FE) model updating methodology by utilizing modal data. The dynamic condensation technique is adopted in this work to reduce the full system model to a smaller model version such that the degrees of freedom (DOFs) in the reduced model correspond to the observed DOFs, which facilitates the model updating procedure without any mode-matching. The present work considers both the MPV and the covariance matrix of the modal parameters as the modal data. Besides, the modal data identified from multiple setups is considered for the model updating procedure, keeping in view of the realistic scenario of inability of limited number of sensors to measure the response of all the interested DOFs of a large structure. A relationship is established between the modal data and structural parameters based on the eigensystem equation through the introduction of additional uncertain parameters in the form of modal frequencies and partial mode shapes. A novel sampling strategy known as the Metropolis-within-Gibbs (MWG) sampler is proposed to sample from the posterior Probability Density Function (PDF). The effectiveness of the proposed approach is demonstrated by considering both simulated and experimental examples.
https://doi.org/10.12989/sem.2023.87.1.001 인용

Recurrent Neural Network Modeling of Etch Tool Data: a Preliminary for Fault Inference via Bayesian Networks

Nawaz, Javeria;Arshad, Muhammad Zeeshan;Park, Jin-Su;Shin, Sung-Won;Hong, Sang-Jeen
- Proceedings of the Korean Vacuum Society Conference
- /
- 2012.02a
- /
- pp.239-240
- /
- 2012
With advancements in semiconductor device technologies, manufacturing processes are getting more complex and it became more difficult to maintain tighter process control. As the number of processing step increased for fabricating complex chip structure, potential fault inducing factors are prevail and their allowable margins are continuously reduced. Therefore, one of the key to success in semiconductor manufacturing is highly accurate and fast fault detection and classification at each stage to reduce any undesired variation and identify the cause of the fault. Sensors in the equipment are used to monitor the state of the process. The idea is that whenever there is a fault in the process, it appears as some variation in the output from any of the sensors monitoring the process. These sensors may refer to information about pressure, RF power or gas flow and etc. in the equipment. By relating the data from these sensors to the process condition, any abnormality in the process can be identified, but it still holds some degree of certainty. Our hypothesis in this research is to capture the features of equipment condition data from healthy process library. We can use the health data as a reference for upcoming processes and this is made possible by mathematically modeling of the acquired data. In this work we demonstrate the use of recurrent neural network (RNN) has been used. RNN is a dynamic neural network that makes the output as a function of previous inputs. In our case we have etch equipment tool set data, consisting of 22 parameters and 9 runs. This data was first synchronized using the Dynamic Time Warping (DTW) algorithm. The synchronized data from the sensors in the form of time series is then provided to RNN which trains and restructures itself according to the input and then predicts a value, one step ahead in time, which depends on the past values of data. Eight runs of process data were used to train the network, while in order to check the performance of the network, one run was used as a test input. Next, a mean squared error based probability generating function was used to assign probability of fault in each parameter by comparing the predicted and actual values of the data. In the future we will make use of the Bayesian Networks to classify the detected faults. Bayesian Networks use directed acyclic graphs that relate different parameters through their conditional dependencies in order to find inference among them. The relationships between parameters from the data will be used to generate the structure of Bayesian Network and then posterior probability of different faults will be calculated using inference algorithms.
PDF

Prediction of the Gold-silver Deposits from Geochemical Maps - Applications to the Bayesian Geostatistics and Decision Tree Techniques (지화학자료를 이용한 금${\cdot}$은 광산의 배태 예상지역 추정-베이시안 지구통계학과 의사나무 결정기법의 활용)

Hwang, Sang-Gi;Lee, Pyeong-Koo
- Economic and Environmental Geology
- /
- v.38 no.6 s.175
- /
- pp.663-673
- /
- 2005
This study investigates the relationship between the geochemical maps and the gold-silver deposit locations. Geochemical maps of 21 elements, which are published by KIGAM, locations of gold-silver deposits, and 1:1,000,000 scale geological map of Korea are utilized far this investigation. Pixel size of the basic geochemical maps is 250m and these data are resampled in 1km spacing for the statistical analyses. Relationship between the mine location and the geochemical data are investigated using bayesian statistics and decision tree algorithms. For the bayesian statistics, each geochemical maps are reclassified by percentile divisions which divides the data by 5, 25, 50, 75, 95, and $100\%$ data groups. Number of mine locations in these divisions are counted and the probabilities are calculated. Posterior probabilities of each pixel are calculated using the probability of 21 geochemical maps and the geological map. A prediction map of the mining locations is made by plotting the posterior probability. The input parameters for the decision tree construction are 21 geochemical elements and lithology, and the output parameters are 5 types of mines (Ag/Au, Cu, Fe, Pb/Zn, W) and absence of the mine. The locations for the absence of the mine are selected by resampling the overall area by 1 km spacing and eliminating my resampled points, which is in 750m distance from mine locations. A prediction map of each mine area is produced by applying the decision tree to every pixels. The prediction by Bayesian method is slightly better than the decision tree. However both prediction maps show reasonable match with the input mine locations. We interpret that such match indicate the rules produced by both methods are reasonable and therefore the geochemical data has strong relations with the mine locations. This implies that the geochemical rules could be used as background values oi mine locations, therefore could be used for evaluation of mine contamination. Bayesian statistics indicated that the probability of Au/Ag deposit increases as CaO, Cu, MgO, MnO, Pb and Li increases, and Zr decreases.
PDF KSCI

Survival Analysis for White Non-Hispanic Female Breast Cancer Patients

Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Stewart, Tiffanie Shauna-Jeanne;Bhatt, Chintan
- Asian Pacific Journal of Cancer Prevention
- /
- v.15 no.9
- /
- pp.4049-4054
- /
- 2014
Background: Race and ethnicity are significant factors in predicting survival time of breast cancer patients. In this study, we applied advanced statistical methods to predict the survival of White non-Hispanic female breast cancer patients, who were diagnosed between the years 1973 and 2009 in the United States (U.S.). Materials and Methods: Demographic data from the Surveillance Epidemiology and End Results (SEER) database were used for the purpose of this study. Nine states were randomly selected from 12 U.S. cancer registries. A stratified random sampling method was used to select 2,000 female breast cancer patients from these nine states. We compared four types of advanced statistical probability models to identify the best-fit model for the White non-Hispanic female breast cancer survival data. Three model building criterion were used to measure and compare goodness of fit of the models. These include Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC). In addition, we used a novel Bayesian method and the Markov Chain Monte Carlo technique to determine the posterior density function of the parameters. After evaluating the model parameters, we selected the model having the lowest DIC value. Using this Bayesian method, we derived the predictive survival density for future survival time and its related inferences. Results: The analytical sample of White non-Hispanic women included 2,000 breast cancer cases from the SEER database (1973-2009). The majority of cases were married (55.2%), the mean age of diagnosis was 63.61 years (SD = 14.24) and the mean survival time was 84 months (SD = 35.01). After comparing the four statistical models, results suggested that the exponentiated Weibull model (DIC= 19818.220) was a better fit for White non-Hispanic females' breast cancer survival data. This model predicted the survival times (in months) for White non-Hispanic women after implementation of precise estimates of the model parameters. Conclusions: By using modern model building criteria, we determined that the data best fit the exponentiated Weibull model. We incorporated precise estimates of the parameter into the predictive model and evaluated the survival inference for the White non-Hispanic female population. This method of analysis will assist researchers in making scientific and clinical conclusions when assessing survival time of breast cancer patients.
https://doi.org/10.7314/APJCP.2014.15.9.4049 인용 PDF KSCI

Search Result 224, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)