• 제목/요약/키워드: Classification Database

검색결과 940건 처리시간 0.017초

BAYESIAN CLASSIFICATION AND FREQUENT PATTERN MINING FOR APPLYING INTRUSION DETECTION

  • Lee, Heon-Gyu;Noh, Ki-Yong;Ryu, Keun-Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2005년도 Proceedings of ISRS 2005
    • /
    • pp.713-716
    • /
    • 2005
  • In this paper, in order to identify and recognize attack patterns, we propose a Bayesian classification using frequent patterns. In theory, Bayesian classifiers guarantee the minimum error rate compared to all other classifiers. However, in practice this is not always the case owing to inaccuracies in the unrealistic assumption{ class conditional independence) made for its use. Our method addresses the problem of attribute dependence by discovering frequent patterns. It generates frequent patterns using an efficient FP-growth approach. Since the volume of patterns produced can be large, we propose a pruning technique for selection only interesting patterns. Also, this method estimates the probability of a new case using different product approximations, where each product approximation assumes different independence of the attributes. Our experiments show that the proposed classifier achieves higher accuracy and is more efficient than other classifiers.

  • PDF

Preliminary Study of Bioinformatics Patents and Their Classifications Registered in the KIPRIS Database

  • Park, Hyun-Seok
    • Genomics & Informatics
    • /
    • 제10권4호
    • /
    • pp.271-274
    • /
    • 2012
  • Whereas a vast amount of new information on bioinformatics is made available to the public through patents, only a small set of patents are cited in academic papers. A detailed analysis of registered bioinformatics patents, using the existing patent search system, can provide valuable information links between science and technology. However, it is extremely difficult to select keywords to capture bioinformatics patents, reflecting the convergence of several underlying technologies. No single word or even several words are sufficient to identify such patents. The analysis of patent subclasses can provide valuable information. In this paper, I did a preliminary study of the current status of bioinformatics patents and their International Patent Classification (IPC) groups registered in the Korea Intellectual Property Rights Information Service (KIPRIS) database.

옹벽 구조물의 표준 DB화 방안 및 유지관리 특성 연구 (A Study on Characteristics of Maintenance and Standarization Plan Concerned with DB of Retainging Wall)

  • 이송;심민보
    • 한국구조물진단유지관리공학회 논문집
    • /
    • 제4권4호
    • /
    • pp.129-140
    • /
    • 2000
  • Retaining wall is a constructed structure in order to construct road, rail, building for effective use and obtainments of the limited ground. Recently, many kinds of research have been actively developed for a standardization and information to the maintenance and management of bridge, tunnel, road. With the works of database construction of that, many kinds of data with respect to statistics is cumulated. Database work of retaining wall is wholly lacking and lagged behind in the works of database construction. This paper suggests classification system on inspection data. On the basis of that, code work with classification system was practised and DB program of inspection data of retaining wall was developed. And input work for a data of maintenance and management was practised. The purpose of this paper is to suggest a kind of statistics data and investigate a characteristics of inspection using statistic data on retaining wall.

  • PDF

외연적 객체모델의 정형화 (A Formal Presentation of the Extensional Object Model)

  • 정철용
    • Asia pacific journal of information systems
    • /
    • 제5권2호
    • /
    • pp.143-176
    • /
    • 1995
  • We present an overview of the Extensional Object Model (ExOM) and describe in detail the learning and classification components which integrate concepts from machine learning and object-oriented databases. The ExOM emphasizes flexibility in information acquisition, learning, and classification which are useful to support tasks such as diagnosis, planning, design, and database mining. As a vehicle to integrate machine learning and databases, the ExOM supports a broad range of learning and classification methods and integrates the learning and classification components with traditional database functions. To ensure the integrity of ExOM databases, a subsumption testing rule is developed that encompasses categories defined by type expressions as well as concept definitions generated by machine learning algorithms. A prototype of the learning and classification components of the ExOM is implemented in Smalltalk/V Windows.

  • PDF

A Preliminary Study on the Multiple Mapping Structure of Classification Systems for Heterogeneous Databases

  • Lee, Seok-Hyoung;Kim, Hwan-Min;Choe, Ho-Seop
    • International Journal of Knowledge Content Development & Technology
    • /
    • 제2권1호
    • /
    • pp.51-65
    • /
    • 2012
  • While science and technology information service portals and heterogeneous databases produced in Korea and other countries are integrated, methods of connecting the unique classification systems applied to each database have been studied. Results of technologists' research, such as, journal articles, patent specifications, and research reports, are organically related to each other. In this case, if the most basic and meaningful classification systems are not connected, it is difficult to achieve interoperability of the information and thus not easy to implement meaningful science technology information services through information convergence. This study aims to address the aforementioned issue by analyzing mapping systems between classification systems in order to design a structure to connect a variety of classification systems used in the academic information database of the Korea Institute of Science and Technology Information, which provides science and technology information portal service. This study also aims to design a mapping system for the classification systems to be applied to actual science and technology information services and information management systems.

지명 데이터베이스 구축을 통한 웹지도화 방안 (An Web-based Mapping by Constructing Database of Geographical Names)

  • 김남신
    • 한국지역지리학회지
    • /
    • 제16권4호
    • /
    • pp.428-439
    • /
    • 2010
  • 지명은 인간의 지역인식을 반영하기 때문에 지명지도는 지역이해를 위한 정보를 제공해 줄 수 있다. 본 연구는 지명데이터베이스 구축을 통한 웹기반 지도를 제작하고자 하였다. 주요 연구내용은 웹사이트 상에서 지명분류 방법, 데이터베이스 구축 방법 그리고 지명지도화 방법에 관한 연구를 수행하였다. 지명은 분류기준에 따라 자연지리, 문화역사지리, 경제지리, 기타의 4개 영역으로 나누었으며, 다시 18 가지 세부영역으로 분류하였다. 지명은 지형도상의 지명과 지역에만 알려진 소지명을 수집하여 입력할 수 있도록 하였다. 데이터베이스 항목은 주소, 좌표, 지명(한자, 한글), 지명분류, 설명, 사진 자료로 구성하였다. 지명지도는 지역의 지리정보와 함께 표현될 수 있게 하였다. 연구결과는 지명의 지리적 분포는 물론 지역해석을 위한 정보를 제공해 줄 수 있을 것으로 기대된다.

  • PDF

과학기술문헌 데이터베이스의 검색효율 향상을 위한 색인 보완 방안 (A Study on Adding Index Terms for improving the retrieval efficiency of the STI database)

  • 김병규;김태중;강무영;류범종
    • 한국콘텐츠학회:학술대회논문집
    • /
    • 한국콘텐츠학회 2011년도 춘계 종합학술대회 논문집
    • /
    • pp.293-294
    • /
    • 2011
  • ISTI는 국내에서 발간되는 과학기술 학술논문을 가공해서 데이터베이스로 구축, 제공하고 있으며 그 규모는 2010년에 100만건을 넘어서고 있다. 규모가 늘어남에 따라 체계적인 주제 분류 등 검색의 효율화를 위한 부가적인 가공이 필요하다. 전통적으로 정보를 가공하는 방법으로 초록화, 분류, 색인, 추록화 등을 혼용하여 사용하고 있다. 이 가운데 색인과 분류는 특히 정보 검색에 유용한 도구로 활용되고 있다. 이 논문에서는 기 구축된 과학기술문헌 데이터베이스에 분류 코드와 색인어를 부여하여 검색효율을 향상시키기 위한 방안을 제안한다.

  • PDF

전력배전 시스템에서의 취약 선로 분류를 위한 출현 패턴 마이닝 (Emerging Patterns Mining for Classifying Non-Safe Electrical Sections in Power Distribution System)

  • ;;이헌규;신진호;류근호
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2008년도 추계학술발표대회
    • /
    • pp.325-327
    • /
    • 2008
  • In electrical industry, classification methodology has been an important issue for analyzing power consumption patterns. It has many applications including decisions on energy purchasing, load switching as well as helping in infrastructure development. Our aim in this work is to classify the electrical section and find potentially non-safe electrical sections. For this purpose, we use Emerging Patterns based classification. The classification method uses the aggregate score of emerging patterns to build classifier. The proposed methodology was applied to a set of electrical section data of the Korea power. The test data and relational electricity information and knowledge are supported by Korea Electric Power Research Institute (KEPRI).

Classification of HTTP Automated Software Communication Behavior Using a NoSQL Database

  • Tran, Manh Cong;Nakamura, Yasuhiro
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제5권2호
    • /
    • pp.94-99
    • /
    • 2016
  • Application layer attacks have for years posed an ever-serious threat to network security, since they always come after a technically legitimate connection has been established. In recent years, cyber criminals have turned to fully exploiting the web as a medium of communication to launch a variety of forbidden or illicit activities by spreading malicious automated software (auto-ware) such as adware, spyware, or bots. When this malicious auto-ware infects a network, it will act like a robot, mimic normal behavior of web access, and bypass the network firewall or intrusion detection system. Besides that, in a private and large network, with huge Hypertext Transfer Protocol (HTTP) traffic generated each day, communication behavior identification and classification of auto-ware is a challenge. In this paper, based on a previous study, analysis of auto-ware communication behavior, and with the addition of new features, a method for classification of HTTP auto-ware communication is proposed. For that, a Not Only Structured Query Language (NoSQL) database is applied to handle large volumes of unstructured HTTP requests captured every day. The method is tested with real HTTP traffic data collected through a proxy server of a private network, providing good results in the classification and detection of suspicious auto-ware web access.

프레임 기반의 효율적인 수중 천이신호 식별을 위한 참조 신호의 벡터 양자화 (Vector Quantization of Reference Signals for Efficient Frame-Based Classification of Underwater Transient Signals)

  • 임태균;김태환;배건성;황찬식
    • 한국통신학회논문지
    • /
    • 제34권2C호
    • /
    • pp.181-185
    • /
    • 2009
  • 프레임 단위로 식별 데이터베이스에 저장된 참조 신호의 특징 벡터와 유사성을 비교하여 입력 신호를 식별하는 경우에, 참조 신호의 특징 벡터로 데이터베이스를 어떻게 구성하는가에 따라 식별 성능은 영향을 받을 수 있다. 즉, 식별 데이터베이스의 구성 방법에 따라 데이터베이스의 크기와 식별을 위한 계산량, 식별 성능 등이 결정되며, 이것은 실제로 수중 천이신호 식별 시스템을 구성할 때 중요한 문제이다. 본 논문에서는 LBG 벡터 양자화 기법을 이용하여 식별 데이터베이스의 크기를 줄여 줌으로써 프레임 기반 수중 천이신호 식별 기법의 효율성을 증가시킬 수 있는 방법을 제안하고, 실험을 통하여 제안한 방법의 타당성을 검증하였다.