• Title/Summary/Keyword: 데이터베이스 구조화

Search Result 339, Processing Time 0.029 seconds

Web Document Classification Based on Hangeul Morpheme and Keyword Analyses (한글 형태소 및 키워드 분석에 기반한 웹 문서 분류)

  • Park, Dan-Ho;Choi, Won-Sik;Kim, Hong-Jo;Lee, Seok-Lyong
    • The KIPS Transactions:PartD
    • /
    • v.19D no.4
    • /
    • pp.263-270
    • /
    • 2012
  • With the current development of high speed Internet and massive database technology, the amount of web documents increases rapidly, and thus, classifying those documents automatically is getting important. In this study, we propose an effective method to extract document features based on Hangeul morpheme and keyword analyses, and to classify non-structured documents automatically by predicting subjects of those documents. To extract document features, first, we select terms using a morpheme analyzer, form the keyword set based on term frequency and subject-discriminating power, and perform the scoring for each keyword using the discriminating power. Then, we generate the classification model by utilizing the commercial software that implements the decision tree, neural network, and SVM(support vector machine). Experimental results show that the proposed feature extraction method has achieved considerable performance, i.e., average precision 0.90 and recall 0.84 in case of the decision tree, in classifying the web documents by subjects.

A Method of Efficient Conference Event Package Processing in Distributed Conference Environment (분산형 컨퍼런스 환경에서의 효율적인 컨퍼런스 이벤트 패키지 처리 방식)

  • Jang, Choon-Seo;Jo, Hyun-Gyu;Lee, Ky-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.199-205
    • /
    • 2008
  • The centralized conference model has advantage of conference management and control. however it's scalability has been limited as performance degrades largely with increasing number of conference users. So new distributed conference models which improve scalability of centralized conference model have been suggested recently. In the distributed conference model. as conference users exceed a predefined maximum number, a new conference server is added to the conference dynamically. In this paper, We have proposed a new method which increases efficiency of conference event package processing that primary conference server should charge in the distributed conference environment. The primary conference server exchanges informations with each secondary conference servers and conference users by using conference event package. And from the conference information database it selects SIP(Session Initiation Protocol) UA(User Agent) which will share notification to the conference users, and transfers lists to each conference servers. The conference servers make the selected UAs share processing of conference event package, so loads of SIP signal processing decrease, and improve scalability of distributed conference model. The performance of our proposed model is evaluated by experiments.

  • PDF

Generation of Artificial Time History Earthquake Record Family using the Least Squares Fitting Method (최소오차 최적합화 방법에 의한 인공 시간이력 지진기록군의 생성)

  • Kim, Yong-Seok
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.12 no.5
    • /
    • pp.31-38
    • /
    • 2008
  • Recently the necessity of time history analyses is increasing for the seismic analyses of a structure, and the seismic design provisions of IBC2003, ASCE and KBC2005 require the use of a minimum of seven earthquake records for the time history analyses. Earthquake records for the time history analyses could be selected from the database of the field-measured earthquake records having similar site conditions with the designed site, or from simulated sites satisfying the design spectrum. However, in this study seven earthquake records were generated using 50 earthquake records, classified as records measured at the rock, in the database of the Pacific Earthquake Research Center (PEER). Seven earthquake records were first selected by the least squares fitting method comparing the scaling factored response spectra with the specified design spectrum, and a family of seven artificial time history earthquake records was ultimately generated by multiplying scaling factors, which were calculated by the least squares fitting method and the SRSS averaging method, to the corresponding selected earthquake records.

Site Application of Artificial Neural Network for Tunnel Construction (인공신경망을 이용한 터널시공에서 현장 적용성)

  • Song, Joohyeon;Chae, Hwiyoung;Chun, Byungsik
    • Journal of the Korean GEO-environmental Society
    • /
    • v.13 no.8
    • /
    • pp.25-33
    • /
    • 2012
  • Although it is important to reflect the accurate information of the ground condition in the tunnel design, the analysis and design are conducted by limited information because it is very difficult to consider various geographies and geotechnical conditions. When the tunnel is under construction, examination of accurate safety and prediction of behavior are overcome the limits of predicting behavior by Artificial Neural Network in this study. First, construct the suitable structure after the data of field was made sure by the multi-layer back propagation, then apply with algorithm. Employ the result of measured data from database, and consider the influence factor of tunnel, like supporting pattern, RMR, Q, the types of rock, excavation length, excavation shape, excavation over, to carry out the reliable analysis through field applicability of Artificial Neural Network. After studying, using the ANN model to predict the shearing displacement, convergence displacement, underground displacement, Rock bolt output follow the excavation over of tunnel construction field, then determine the field applicability with ANN through field measured value and comparison analysis when tunnel is being constructed.

Recognition and Modeling of 3D Environment based on Local Invariant Features (지역적 불변특징 기반의 3차원 환경인식 및 모델링)

  • Jang, Dae-Sik
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.3
    • /
    • pp.31-39
    • /
    • 2006
  • This paper presents a novel approach to real-time recognition of 3D environment and objects for various applications such as intelligent robots, intelligent vehicles, intelligent buildings,..etc. First, we establish the three fundamental principles that humans use for recognizing and interacting with the environment. These principles have led to the development of an integrated approach to real-time 3D recognition and modeling, as follows: 1) It starts with a rapid but approximate characterization of the geometric configuration of workspace by identifying global plane features. 2) It quickly recognizes known objects in environment and replaces them by their models in database based on 3D registration. 3) It models the geometric details the geometric details on the fly adaptively to the need of the given task based on a multi-resolution octree representation. SIFT features with their 3D position data, referred to here as stereo-sis SIFT, are used extensively, together with point clouds, for fast extraction of global plane features, for fast recognition of objects, for fast registration of scenes, as well as for overcoming incomplete and noisy nature of point clouds.

  • PDF

Study on Determining Core Journals and Network Analysis in the Field of Disaster & Safety (재난안전 분야 핵심 학술지 탐색 및 네트워크 분석 연구)

  • Kim, Byungkyu;You, Beom-Jong
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.4
    • /
    • pp.373-397
    • /
    • 2019
  • Recent disasters are a complex and growing trend. In order to effectively prepare for and respond to disasters that occur without notice, it is very important to use scientific information related to disaster and safety in addition to the standardized disaster safety information that is used. In this paper, we searched and selected major journals in the field of disaster & safety and conducted various network analysis studies using the classification scheme for development of integrated metadata for disaster & safety information developed through Disaster & Safety Information Sharing Platform R&D project as well as KSCD. Also, we have constructed and analyzed citation network, co-authorship network and keyword network through data identification and preprocessing of research paper contents. As a result of this study, based on the network constructed by information analysis unit, the network structure between core domestic and foreign journals, major research institutes, core keywords and individual information by disaster & safety type was identified in detail, and the analysis results were presented on a case-by-case basis.

The Beginning of Decentralization: Seongbuk Village Archive (자치분권의 시작, 성북마을아카이브)

  • Kang, Sungbong
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.1
    • /
    • pp.237-243
    • /
    • 2022
  • Seongbuk Village Archive is a village archive built by Seongbuk-gu Office and Seongbuk Cultural Center to contain the uniqueness and specificity of the region. It is a community archive that preserves the records of the community and a digital archive that builds a database through the digitalization of source data. The management system and home page were established through annual and step-by-step promotion through public-private governance. Seongbuk Village Archive's system is designed to facilitate data accumulation and connection between individual records based on the advanced village record standard classification system. Based on this, Seongbuk Cultural Center tried to produce convergence cultural content by linking records online and off-line. In addition, the composition of items displayed on the website has been diversified to not only preserve records but also produce and utilize content. It is a structure created after contemplating how to show the creation and existence of Seongbuk's historical and cultural resources to users in context. In addition, a richer archive platform was built through various curations and activities of the resident record group.

A Keyword Network Analysis of Standard Medical Terminology for Musculoskeletal System Using Big Data (빅데이터를 활용한 근골격계 표준의료용어에 대한 키워드 네트워크 분석)

  • Choi, Byung-Kwan;Choi, Eun-A;Nam, Moon-Hee
    • Journal of Digital Convergence
    • /
    • v.20 no.5
    • /
    • pp.681-693
    • /
    • 2022
  • The purpose of this study is to suggest a plan to utilize atypical data in the health care field by inferring standard medical terms related to the musculoskeletal system through keyword network analysis of medical records of patients hospitalized for musculoskeletal disorders. The analysis target was 145 summaries of discharge with musculoskeletal disorders from 2015 to 2019, and was analyzed using TEXTOM, a big data analysis solution developed by The IMC. The 177 musculoskeletal related terms derived through the primary and secondary refining processes were finally analyzed. As a result of the study, the frequent term was 'Metastasis', the clinical findings were 'Metastasis', the symptoms were 'Weakness', the diagnosis was 'Hepatitis', the treatment was 'Remove', and the body structure was 'Spine' in the analysis results for each medical terminology system. 'Oxycodone' was used the most. Based on these results, we would like to suggest implications for the analysis, utilization, and management of unstructured medical data.

A Study on the Development and Performance Improvement of Chatbot for Office Automation (행정업무 자동화 챗봇 개발 및 성능 향상에 관한 연구)

  • Park, Junsoo;Kim, Youngjun;Jung, Yoonkyo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.158-160
    • /
    • 2022
  • Many office workers spend a lot of time performing repetitive office tasks in inefficient ways. We developed a user-friendly chatbot system based on Kakaotalk to automate repetitive tasks and used the chatbot in the real workplace. In the process of operating the chatbot, if several people use the chatbot at the same time, the server was down or could not respond. To address these issues, we performed code migration of programs used by chatbot back-end servers and tried several ways to improve server performance such as database redesign and load balancing. To determine how much each method affects performance improvement, we measured total request per second and average latency. After that, we proposed ways to improve the problems of using the chatbot in the work environment.

  • PDF

A Study on Significant Properties for Dataset Type Preservation Format (데이터세트 유형 전자기록의 필수보존속성 연구)

  • Jung-eun Lee;Dongmin Yang
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.34 no.4
    • /
    • pp.259-283
    • /
    • 2023
  • This study acknowledges that prevailing regulation concerning for the long-term preservation of electronic records focus mainly on document types, neglecting the preservation of electronic records from various administrative information systems. With the growing interest in data management in the era of big data, it is imperative to establish clear standards for the long-term preservation of datasets. The choice of preservation format for electronic records is based on the specific standards for each type of electronic record. These standards are formulated according to the significant properties relevant to the electronic record type. This study aims to identify the significant properties of electronic records of each record type, before creating specific preservation format selection criteria for these record types. To achieve this, we reviewed and analyzed R&D studies by the National Archives of Korea and the NARA in the United States. As a result of the research, 9 significant properties were identified for database-type entities, and 7 significant properties were identified for structured data-type entities.