• Title/Summary/Keyword: Discovery method

Search Result 650, Processing Time 0.026 seconds

Comparison of Association Rule Learning and Subgroup Discovery for Mining Traffic Accident Data (교통사고 데이터의 마이닝을 위한 연관규칙 학습기법과 서브그룹 발견기법의 비교)

  • Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.1-16
    • /
    • 2015
  • Traffic accident is one of the major cause of death worldwide for the last several decades. According to the statistics of world health organization, approximately 1.24 million deaths occurred on the world's roads in 2010. In order to reduce future traffic accident, multipronged approaches have been adopted including traffic regulations, injury-reducing technologies, driving training program and so on. Records on traffic accidents are generated and maintained for this purpose. To make these records meaningful and effective, it is necessary to analyze relationship between traffic accident and related factors including vehicle design, road design, weather, driver behavior etc. Insight derived from these analysis can be used for accident prevention approaches. Traffic accident data mining is an activity to find useful knowledges about such relationship that is not well-known and user may interested in it. Many studies about mining accident data have been reported over the past two decades. Most of studies mainly focused on predict risk of accident using accident related factors. Supervised learning methods like decision tree, logistic regression, k-nearest neighbor, neural network are used for these prediction. However, derived prediction model from these algorithms are too complex to understand for human itself because the main purpose of these algorithms are prediction, not explanation of the data. Some of studies use unsupervised clustering algorithm to dividing the data into several groups, but derived group itself is still not easy to understand for human, so it is necessary to do some additional analytic works. Rule based learning methods are adequate when we want to derive comprehensive form of knowledge about the target domain. It derives a set of if-then rules that represent relationship between the target feature with other features. Rules are fairly easy for human to understand its meaning therefore it can help provide insight and comprehensible results for human. Association rule learning methods and subgroup discovery methods are representing rule based learning methods for descriptive task. These two algorithms have been used in a wide range of area from transaction analysis, accident data analysis, detection of statistically significant patient risk groups, discovering key person in social communities and so on. We use both the association rule learning method and the subgroup discovery method to discover useful patterns from a traffic accident dataset consisting of many features including profile of driver, location of accident, types of accident, information of vehicle, violation of regulation and so on. The association rule learning method, which is one of the unsupervised learning methods, searches for frequent item sets from the data and translates them into rules. In contrast, the subgroup discovery method is a kind of supervised learning method that discovers rules of user specified concepts satisfying certain degree of generality and unusualness. Depending on what aspect of the data we are focusing our attention to, we may combine different multiple relevant features of interest to make a synthetic target feature, and give it to the rule learning algorithms. After a set of rules is derived, some postprocessing steps are taken to make the ruleset more compact and easier to understand by removing some uninteresting or redundant rules. We conducted a set of experiments of mining our traffic accident data in both unsupervised mode and supervised mode for comparison of these rule based learning algorithms. Experiments with the traffic accident data reveals that the association rule learning, in its pure unsupervised mode, can discover some hidden relationship among the features. Under supervised learning setting with combinatorial target feature, however, the subgroup discovery method finds good rules much more easily than the association rule learning method that requires a lot of efforts to tune the parameters.

Search for Semantic Web Services Based on the Integrated Concept Model (통합 개념 모델에 기반한 시맨틱 웹 서비스 탐색)

  • Du, Hwa-Jun;Shin, Dong-Hoon;Lee, Kyong-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.2
    • /
    • pp.147-169
    • /
    • 2007
  • Semantic Web Services Discovery matches between users' requirements and the ontological description of Web Services. However, concepts of an ontology can be interpreted differently according to a point of view. Previous works are limited in interpreting concepts. Since they lack a precise scheme of describing the advertisements and requirements of services and users, respectively, and even do not support a sophisticated matching, semantic mismatches may occur. This paper presents a sophisticated method of discovering Web services. The proposed method facilitates specifying semantics precisely and flexibly based on a proposed integrated concept model. Additionally, more sophisticated discovery is supported by computing complex matchings with many-to-many relationships. Experimental results show that the proposed method performs more efficiently for various kinds of user requests, compared with previous works.

  • PDF

Development of Jaspine B analysis using LC-MS/MS and its application: Dose-independent pharmacokinetics of Jaspine B in rats

  • Song, Im-Sook;Jeon, Ji-Hyeon;Lee, Jihoon;Lim, Dong Yu;Lee, Chul Haeng;Lee, Dongjoo;Choi, Min-Koo
    • Analytical Science and Technology
    • /
    • v.34 no.2
    • /
    • pp.37-45
    • /
    • 2021
  • A rapid and simple LC-MS/MS analytical method in determining Jaspine B has been developed and validated in rat plasma. The standard curve value was 25 - 5000 ng/mL and the linearity, inter-day and intra-day accuracy and precision were within 15.0 % of relative standard deviation (RSD). The mean recoveries of Jaspine B ranged from 87.5 % to 91.2 % with less than 3.70 % RSD and the matrix effects ranged from 91.1 % to 108.2 % with less than 2.6 % RSD. The validated LC-MS/MS analytical method of Jaspine B was successfully applied to investigate the dose-escalated pharmacokinetic study of Jaspine B in rats following an intravenous injection of Jaspine B at a dose range of 1 - 10 mg/kg. The initial plasma concentrations and area under plasma concentration curves showed a good correlation with intravenous Jaspine B dose, indicating the dose independent pharmacokinetics of Jaspine B in rats. In conclusion, this analytical method for Jaspine B can be easily applied in the bioanalysis and pharmacokinetic studies of Jaspine B, including its administration at multiple therapeutic doses, or for making pharmacokinetic comparisons for the oral formulations of Jaspine B in small experimental animals as well as in vivo pharmacokinetic-pharmacodynamic correlation studies.

Rule Discovery for Cancer Classification using Genetic Programming based on Arithmetic Operators (산술 연산자 기반 유전자 프로그래밍을 이용한 암 분류 규칙 발견)

  • 홍진혁;조성배
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.8
    • /
    • pp.999-1009
    • /
    • 2004
  • As a new approach to the diagnosis of cancers, bioinformatics attracts great interest these days. Machine teaming techniques have produced valuable results, but the field of medicine requires not only highly accurate classifiers but also the effective analysis and interpretation of them. Since gene expression data in bioinformatics consist of tens of thousands of features, it is nearly impossible to represent their relations directly. In this paper, we propose a method composed of a feature selection method and genetic programming. Rank-based feature selection is adopted to select useful features and genetic programming based arithmetic operators is used to generate classification rules with features selected. Experimental results on Lymphoma cancer dataset, in which the proposed method obtained 96.6% test accuracy as well as useful classification rules, have shown the validity of the proposed method.

Development and validation of an analytical method to quantify baphicacanthin A by LC-MS/MS and its application to pharmacokinetic studies in mice

  • Jeon, So Yeon;Kim, San;Park, Jin-Hyang;Song, Im-Sook;Han, Young Taek;Choi, Min-Koo
    • Analytical Science and Technology
    • /
    • v.35 no.2
    • /
    • pp.60-68
    • /
    • 2022
  • In this study, we developed and validated a sensitive analytical method to quantify baphicacanthin A in mouse plasma using liquid chromatography-tandem mass spectrometry (LC-MS/MS). The standard calibration curves for baphicacanthin A ranged from 0.5 to 200 ng/mL and were linear, with an r2 of 0.985. The inter- and intra-day accuracy and precision and the stability fell within the acceptance criteria. Besides, we investigated the pharmacokinetics of baphicacanthin A following its intravenous (5 mg/kg) and oral administration (30 mg/kg). Intravenously injected baphicacanthin A showed biphasic elimination kinetics with high clearance and volume of distribution values. Furthermore, baphicacanthin A showed a rapid absorption but low aqueous solubility (182.51±0.20 mg/mL), resulting in low plasma concentrations and low oral bioavailability (2.49 %). Thus, we successfully documented the pharmacokinetic properties of baphicacanthin A using this newly developed sensitive LC-MS/MS quantification method, which could be used in future lead optimization and biopharmaceutic studies.

A Study on the Creation Process of Dance based on the Concept of Murray Schafer's Soundscape (Murray Schafer의 사운드스케이프 개념을 바탕으로 한 무용작품 의 창작과정 연구)

  • Ra, Se-Young;Choe, Sang-Cheul
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.6
    • /
    • pp.425-434
    • /
    • 2021
  • This study is subjected to two linked research. First is the creation process of dance work applying noise and non-musical sounds in our daily life based on understanding the classification, perception, and factor of the sound, through Murray Schafer's concept of 'soundscape'. Second is to find the value of new type of choreography and musical effect of creation process of the dance work . According to methodological research of practice-based research, three stages which is practice, theory and evaluation were accumulated as somatic data, And the analysis was provided a basis by presenting in a figuration, form of the movement and method of specialization with reference to the paper 『space design and forming practice』(2003). As a result, the creation process was able to discover the musical effect of the sound in daily life and new method of choreography, and also find the possibilities that sound could convey the theme of the dance work, the meaning of the movement and the overall atmosphere of the work to audience. In addition, It is expected that will have been made another new creation environment by potential that music has based on concept 'soundscape'.

Knowledge Discovery in Nursing Minimum Data Set Using Data Mining

  • Park Myong-Hwa;Park Jeong-Sook;Kim Chong-Nam;Park Kyung-Min;Kwon Young-Sook
    • Journal of Korean Academy of Nursing
    • /
    • v.36 no.4
    • /
    • pp.652-661
    • /
    • 2006
  • Purpose. The purposes of this study were to apply data mining tool to nursing specific knowledge discovery process and to identify the utilization of data mining skill for clinical decision making. Methods. Data mining based on rough set model was conducted on a large clinical data set containing NMDS elements. Randomized 1000 patient data were selected from year 1998 database which had at least one of the five most frequently used nursing diagnoses. Patient characteristics and care service characteristics including nursing diagnoses, interventions and outcomes were analyzed to derive the meaningful decision rules. Results. Number of comorbidity, marital status, nursing diagnosis related to risk for infection and nursing intervention related to infection protection, and discharge status were the predictors that could determine the length of stay. Four variables (age, impaired skin integrity, pain, and discharge status) were identified as valuable predictors for nursing outcome, relived pain. Five variables (age, pain, potential for infection, marital status, and primary disease) were identified as important predictors for mortality. Conclusions. This study demonstrated the utilization of data mining method through a large data set with stan dardized language format to identify the contribution of nursing care to patient's health.

Frequency Resource Obtaining Method Based on D2D Device Discovery in Public Safety Communication Networks (재난 무선통신을 위한 D2D 단말탐색 기반 주파수 자원 확보 기술)

  • Wu, Shanai;Shin, Oh-Soon;Shin, Yoan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.11
    • /
    • pp.1440-1442
    • /
    • 2016
  • As long term evolution (LTE) is the most widely deployed broadband communication technology so far, efforts are being made to develop LTE-based mission critical public safety (PS) communication systems. In this paper, we propose a device-to-device (D2D) discovery-based radio resource acquisition scheme to support the LTE D2D communication to PS systems and the realization of resource forwarding for user equipments in emergency area.

An Implementation of Optimal Rules Discovery System: An Integrated Approach Based on Concept Hierarchies, Information Gain, and Rough Sets (최적 규칙 발견 시스템의 구현: 개념 계층과 정보 이득 및 라프셋에 의한 통합 접근)

  • 김진상
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.10 no.3
    • /
    • pp.232-241
    • /
    • 2000
  • This study suggests an integrated method based on concept hierarchies, information gain, and rough set theory for efficient discovery rules from a large amount of data, and implements an optimal rules discovery system. Our approach applies attribute-oriented concept ascension technique to extract generalized knowledge from a database, knowledge reduction technique to remove superfluous attributes and attribute values, and significance of attributes to induce optimal rules. The system first reduces the size of database by removing the duplicate tuples through the condition attributes which have no influences on the decision attributes, and finally induces simplified optimal rules by removing the superfluous attribute values by analyzing the dependency relationships among the attributes. And we induce some decision rules from actual data by using the system and test rules to new data, and evaluate that the rules are well suited to them.

  • PDF