• Title/Summary/Keyword: Knowledge extraction

Search Result 384, Processing Time 0.029 seconds

Probabilistic filtering for a biological knowledge discovery system with text mining and automatic inference (텍스트 마이닝 및 자동 추론 기반 생물학 지식 발견 시스템을 위한 확률 기반 필터링)

  • Lee, Hee-Jin;Park, Jong-C.
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.2
    • /
    • pp.139-147
    • /
    • 2012
  • In this paper, we discuss the structure of biological knowledge discovery system based on text mining and automatic inference. Given a set of biology documents, the system produces a new hypothesis in an integrated manner. The text mining module of the system first extracts the 'event' information of predefined types from the documents. The inference module then produces a new hypothesis based on the extracted results. Such an integrated system can use information more up-to-date and diverse than other automatic knowledge discovery systems use. However, for the success of such an integrated system, the precision of the text mining module becomes crucial, as any hypothesis based on a single piece of false positive information would highly likely be erroneous. In this paper, we propose a probabilistic filtering method that filters out false positives from the extraction results. Our proposed method shows higher performance over an occurrence-based baseline method.

A Efficient Rule Extraction Method Using Hidden Unit Clarification in Trained Neural Network (인공 신경망에서 은닉 유닛 명확화를 이용한 효율적인 규칙추출 방법)

  • Lee, Hurn-joo;Kim, Hyeoncheol
    • The Journal of Korean Association of Computer Education
    • /
    • v.21 no.1
    • /
    • pp.51-58
    • /
    • 2018
  • Recently artificial neural networks have shown excellent performance in various fields. However, there is a problem that it is difficult for a person to understand what is the knowledge that artificial neural network trained. One of the methods to solve these problems is an algorithm for extracting rules from trained neural network. In this paper, we extracted rules from artificial neural networks using ordered-attribute search(OAS) algorithm, which is one of the methods of extracting rules, and analyzed result to improve extracted rules. As a result, we have found that the distribution of output values of the hidden layer unit affects the accuracy of rules extracted by using OAS algorithm, and it is suggested that efficient rules can be extracted by binarizing hidden layer output values using hidden unit clarification.

Locating Text in Web Images Using Image Based Approaches (웹 이미지로부터 이미지기반 문자추출)

  • Chin, Seongah;Choo, Moonwon
    • Journal of Intelligence and Information Systems
    • /
    • v.8 no.1
    • /
    • pp.27-39
    • /
    • 2002
  • A locating text technique capable of locating and extracting text blocks in various Web images is presented here. Until now this area of work has been ignored by researchers even if this sort of text may be meaningful for internet users. The algorithms associated with the technique work without prior knowledge of the text orientation, size or font. In the work presented in this research, our text extraction algorithm utilizes useful edge detection followed by histogram analysis on the genuine characteristics of letters defined by text clustering region, to properly perform extraction of the text region that does not depend on font styles and sizes. By a number of experiments we have showed impressively acceptable results.

  • PDF

Extraction of Pivotal Entities of Construction Project Management using the CMBOK Framework (CMBOK Framework을 이용한 건설 프로젝트 핵심관리요소의 도출)

  • Lee Jong-Kook;Lee Hyun-Soo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.5 no.1 s.17
    • /
    • pp.140-148
    • /
    • 2004
  • Based on the CMBOK (Construction Project Management Body of Knowledge) Framework previously developed in early study by the authors in conjunction with use of some questionnaire surveys and personal interviews with industry professionals, the authors analyze interactions among the entities in the CMBOK framework for the extraction of pivotal entities of construction project management and identify twelve pivotal entities in construction project management, then verify the existence of twelve pivotal entities in real construction project management of construction company and checked the validity of the entities with a real case of interaction phenomenon. This research provides the construction industry with a starting point for improving construction project management efficiency by identifying the pivotal entities.

Constrained Independent Component Analysis Based Extraction and Mapping of the Brain Alpha Activity in EEG

  • Ahn, S.H.;Rasheed, T.;Lee, W.H.;Kim, T.S.;Cho, M.H.;Lee, S.Y..
    • Journal of Biomedical Engineering Research
    • /
    • v.29 no.5
    • /
    • pp.355-363
    • /
    • 2008
  • In order to extract only the alpha activity related signals from EEG recordings, we have applied Constrained Independent Component Analysis (cICA), a new extension of ICA in which some a priori knowledge of the alpha activity is utilized to extract only desired components. Its extraction (or filtering) performance has been compared to that of the conventional band-pass filtering via the scalp alpha power maps and cortical source maps of the alpha activity. Our results demonstrate that the alpha power maps and cortical source maps from the cICA-extracted alpha signals reveal more focalized alpha generating regions of the brain than those from the band-pass filtered alpha EEG signals. Furthermore they match more closely the activated regions of the brain mapped using fMRI, validating our results. We believe that the cICA-based filtering approach of EEG signals is a more effective means of extracting a specific brain activity reflected in EEG signals that will result in more accurate source localization or imaging maps.

Face region detection algorithm of natural-image (자연 영상에서 얼굴영역 검출 알고리즘)

  • Lee, Joo-shin
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.7 no.1
    • /
    • pp.55-60
    • /
    • 2014
  • In this paper, we proposed a method for face region extraction by skin-color hue, saturation and facial feature extraction in natural images. The proposed algorithm is composed of lighting correction and face detection process. In the lighting correction step, performing correction function for a lighting change. The face detection process extracts the area of skin color by calculating Euclidian distances to the input images using as characteristic vectors color and chroma in 20 skin color sample images. Eye detection using C element in the CMY color model and mouth detection using Q element in the YIQ color model for extracted candidate areas. Face area detected based on human face knowledge for extracted candidate areas. When an experiment was conducted with 10 natural images of face as input images, the method showed a face detection rate of 100%.

Semantic Ontology Speech Information Extraction using Non-parametric Correlation Coefficient (비모수적 상관계수를 이용한 시맨틱 온톨로지 음성 정보 추출)

  • Lee, Byungwook
    • Journal of Digital Convergence
    • /
    • v.11 no.9
    • /
    • pp.147-151
    • /
    • 2013
  • On retrieving high frequency keywords in information retrieval system, mismatchings to user's request are problems because of the various meanings of keywords in the existing ontology configuration. In this paper, it is to construct personnel selection ontology and rules in personnel management which are composed of various concepts and knowledges based on semantic web technology and suggest selection procedures to support these rules and knowledge retrieval system to verify suitability of selection results. This system utilizes a method of extraction of speech features by using non-parametric correlation coefficient. This proposed method has been validated by showing that the result average SNR of the experiment evaluation of the proposed techniques was shown to be decreased by .752dB.

Footprint extraction of urban buildings with LIDAR data

  • Kanniah, Kasturi Devi;Gunaratnam, Kasturi;Mohd, Mohd Ibrahim Seeni
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.113-119
    • /
    • 2003
  • Building information is extremely important for many applications within the urban environment. Sufficient techniques and user-friendly tools for information extraction from remotely sensed imagery are urgently needed. This paper presents an automatic and manual approach for extracting footprints of buildings in urban areas from airborne Light Detection and Ranging (LIDAR) data. First a digital surface model (DSM) was generated from the LIDAR point data. Then, objects higher than the ground surface are extracted using the generated DSM. Based on general knowledge on the study area and field visits, buildings were separated from other objects. The automatic technique for extracting the building footprints was based on different window sizes and different values of image add backs, while the manual technique was based on image segmentation. A comparison was then made to see how precise the two techniques are in detecting and extracting building footprints. Finally, the results were compared with manually digitized building reference data to conduct an accuracy assessment and the result shows that LIDAR data provide a better shape characterization of each buildings.

  • PDF

Extraction of ObjectProperty-UsageMethod Relation from Web Documents

  • Pechsiri, Chaveevan;Phainoun, Sumran;Piriyakul, Rapeepun
    • Journal of Information Processing Systems
    • /
    • v.13 no.5
    • /
    • pp.1103-1125
    • /
    • 2017
  • This paper aims to extract an ObjectProperty-UsageMethod relation, in particular the HerbalMedicinalProperty-UsageMethod relation of the herb-plant object, as a semantic relation between two related sets, a herbal-medicinal-property concept set and a usage-method concept set from several web documents. This HerbalMedicinalProperty-UsageMethod relation benefits people by providing an alternative treatment/solution knowledge to health problems. The research includes three main problems: how to determine EDU (where EDU is an elementary discourse unit or a simple sentence/clause) with a medicinal-property/usage-method concept; how to determine the usage-method boundary; and how to determine the HerbalMedicinalProperty-UsageMethod relation between the two related sets. We propose using N-Word-Co on the verb phrase with the medicinal-property/usage-method concept to solve the first and second problems where the N-Word-Co size is determined by the learning of maximum entropy, support vector machine, and naïve Bayes. We also apply naïve Bayes to solve the third problem of determining the HerbalMedicinalProperty-UsageMethod relation with N-Word-Co elements as features. The research results can provide high precision in the HerbalMedicinalProperty-UsageMethod relation extraction.

An Integrated Accurate-Secure Heart Disease Prediction (IAS) Model using Cryptographic and Machine Learning Methods

  • Syed Anwar Hussainy F;Senthil Kumar Thillaigovindan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.2
    • /
    • pp.504-519
    • /
    • 2023
  • Heart disease is becoming the top reason of death all around the world. Diagnosing cardiac illness is a difficult endeavor that necessitates both expertise and extensive knowledge. Machine learning (ML) is becoming gradually more important in the medical field. Most of the works have concentrated on the prediction of cardiac disease, however the precision of the results is minimal, and data integrity is uncertain. To solve these difficulties, this research creates an Integrated Accurate-Secure Heart Disease Prediction (IAS) Model based on Deep Convolutional Neural Networks. Heart-related medical data is collected and pre-processed. Secondly, feature extraction is processed with two factors, from signals and acquired data, which are further trained for classification. The Deep Convolutional Neural Networks (DCNN) is used to categorize received sensor data as normal or abnormal. Furthermore, the results are safeguarded by implementing an integrity validation mechanism based on the hash algorithm. The system's performance is evaluated by comparing the proposed to existing models. The results explain that the proposed model-based cardiac disease diagnosis model surpasses previous techniques. The proposed method demonstrates that it attains accuracy of 98.5 % for the maximum amount of records, which is higher than available classifiers.