Building Modeling for Unstructured Data Analysis Using Big Data Processing Technology (빅데이터 처리 기술을 활용한 비정형데이터 분석 모델링 구축)

  • Kim, Jung-Hoon;Kim, Sung-Jin;Kwon, Gi-Yeol;Ju, Da-Hye;Oh, Jae-Yong;Lee, Jun-Dong
    • Proceedings of the Korean Society of Computer Information Conference
    • 2020.07a
    • pp.253-255
    • 2020
  • 기업 및 기관 데이터는 워드프로세서, 프레젠테이션, 이메일, open api, 엑셀, XML, JSON 등과 같은 텍스트 기반의 비정형 데이터로 구성되어 있습니다. 텍스트 마이닝(Textmining)을 통해서 자연어 처리 및 기계학습 등의 기술을 이용하여 정보의 추출부터 요약·분류·군집·연관도 분석 등의 과정을 수행울 진행한다. 다양한 시각화 데이터를 보여줄 수 있는 다양한 모델 구축을 진행한 후 민원 신청 내용을 분석 및 변환 작업을 진행한다. 본 논문은 AI 기술과 빅데이터를 활용하여 민원을 분석을 하여 알맞은 부서에 민원을 자동으로 할당해 주는 기술을 다룬다.

Big Data Analytics Applied to the Construction Site Accident Factor Analysis

  • KIM, Joon-soo;Lee, Ji-su;KIM, Byung-soo
    • International conference on construction engineering and project management
    • 2015.10a
    • pp.678-679
    • 2015
  • Recently, safety accidents in construction sites are increasing. Accordingly, in this study, development of 'Big-Data Analysis Modeling' can collect articles from last 10 years which came from the Internet News and draw the cause of accidents that happening per season. In order to apply this study, Web Crawling Modeling that can collect 98% of desired information from the internet by using 'Xml', 'tm', "Rcurl' from the library of R, a statistical analysis program has been developed, and Datamining Model, which can draw useful information by using 'Principal Component Analysis' on the result of Work Frequency of 'Textmining.' Through Web Crawling Modeling, 7,384 out of 7,534 Internet News articles that have been posted from the past 10 years regarding "safety Accidents in construction sites", and recognized the characteristics of safety accidents that happening per season. The result showed that accidents caused by abnormal temperature and localized heavy rain, occurred frequently in spring and winter, and accidents caused by violation of safety regulations and breakdown of structures occurred frequently in spring and fall. Plus, the fact that accidents happening from collision of heavy equipment happens constantly every season was acknowledgeable. The result, which has been obtained from "Big-Data Analysis Modeling" corresponds with prior studies. Thus, the study is reliable and able to be applied to not only construction sites but also in the overall industry.

Data hub system based on SQL/XMDR message using Wrapper for distributed data interoperability (분산 데이터 상호운용을 위한 SQL/XMDR 메시지 기반의 Wrapper를 이용한 데이터 허브 시스템)

  • Moon, Seok-Jae;Jung, Gye-Dong;Choi, Young-Keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • v.11 no.11
    • pp.2047-2058
    • 2007
  • The business environment of enterprises could be difficult to obviate redundancy to filtrate data source occurred on data integrated to standard rules and meta-data and to produce integration of data and single viewer in geographical and spatial distributed environment. Specially, To can interchange various data from a heterogeneous system or various applications without types and forms and synchronize continually exactly integrated information#s is of paramount concern. Therefore data hub system based on SQL/XMDR message to overcome a problem of meaning interoperability occurred on exchanging or jointing between each legacy systems are proposed in this paper. This system use message mapping technique of query transform system to maintain data modified in real-time on cooperating data. It can consistently maintain data modified in realtime on exchanging or jointing data for cooperating legacy systems, it improve clarity and availability of data by providing a single interface on data retrieval.

Generalization by LoD and Coordinate Transformation in On-the-demand Web Mapping (웹환경에서 LoD와 좌표변형에 의한 지도일반화)

  • Kim, Nam-Shin
    • Journal of the Korean association of regional geographers
    • v.15 no.2
    • pp.307-315
    • 2009
  • The purpose of map generalization is a method of map making to transmit the concise cartographic representation and geographic meaning. New generalization algorithm has been developed to be applied in the digital environments by the development of computer cartography. This study aims to look into possibilities of the multiscale mapping by generalization in application with the coordinate transformation and LoD(level of detail) in the web cartography. A method of the coordinate transformation is to improve a transmission of spatial data. Lod is a method which is making web map with selection spatial data by zoom level of users. Layers for test constructed contour line, stream network, the name of a place, a summit of mountain, and administrative office. The generalization was applied to zoom levels by scale for the linear and polygonal features using XML-Based scalable vector graphics(SVG). Resultantly, storage capacity of data was minimized 41% from 9.76mb to 4.08mb in SVG. Generalization of LoD was applied to map elements by stages of the zoom level. In the first stages of zoom level, the main name of places and administrative office, higher order of stream channels, main summit of mountain was represented, and become increase numbers of map elements in the higher levels. Results of this study can help to improve esthetic map and data minimization in web cartography, and also need to make an efforts to research an algorithm on the map generalization over the web.

The Utilization of Electronic Journal Files in the Production of an Abstract Database: A case of KoreaMed System (초록 데이터베이스 구축에 있어서 학술지 전자출판 파일의 활용과 문제점: KoreaMed를 중심으로)

  • 이춘실
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • /
    • /
    • 2001
  • The study examined the current status and the use of electronic publishing files to produce a bibliographic database. In particular . it examined the problems faced in the production of KoreaMed, an abstract database of Korean medical journals. The methodology of KoreaMed to utilize the computer files which was produced in the process of publication of a print journal is found to be very effective. It assures the accuracy of data, accelerates the input speeds, and reduces the input costs. However, such a project can not be accomplished to a satisfactory level without the cooperation of the publishers involved. It turns out that many small publishers and academic societies hardly have saved the electronic publishing files of previous issues. Besides, it is hard to maintain the right channel to receive the files continuously. The input and processing of special characters are very delicate problems. In addition, the diversity of layout and formats of journals, of the electronic publishing software used, and of the storage media, makes the utilization of the electronic publishing files a very complex process. In order to operate the KoreaMed system more efficiently by requiring the publishers to submit XML files which meets the standard of the KoreaMed, it is necessary to educate and train personnels of journal publishers for the management of electronic publishing files.

A Basic Study of iBUM Development based on BIM/GIS Standard Information for Construction of Spatial Database (공간자료 구축을 위한 BIM/GIS 표준정보 기반 건축도시통합모델(iBUM)의 개발에 관한 기초연구)

  • Ryu, Jung Rim;Choo, Seung Yeon
    • Spatial Information Research
    • v.22 no.5
    • pp.27-41
    • 2014
  • Recently, BIM(Building Information Modeling) has been applied to the infrastructure such as road and bridge, and information about the outside environment of buildings is needed for maintaining and managing the large urban facilities. In addition, the convergence between spatial information and Big-data has a large potentiality, in respect that considerable profits and developments in other application problems such as various simulations and urban plans, national land security, may be brought about on the basis of the interoperability of information between BIM and GIS. Therefore, this study attempted to suggest the development direction of a model integrating building for spatial information analysis and city on the subject by comparing and analyzing difference between information system and shape expression of IFC, CityGML and LandXML to efficiently link information between IFC as a standard model of BIM and CityGML as a standard model in the GIS sector and to prepare a basic fusion strategy and a method of utilization between BIM and GIS. The result of the study are as follow. Firstly, contents and structure of IFC, CityGML and LandXML are compared and analyzed. Secondly, the development direction of iBUM(Integrated Building and Urban Model) suggested, which is based on convergence technology for analysis of space information. Finally, a strategy and method of the BIM and GIS are proposed in the iBUM environment.

제안기반 자동 거래협상 시장에서의 사용자 에이전트를 위한 최적 거래안 탐색 전략의 개발

  • 홍준석;김우주;송용욱
    • Proceedings of the Korea Inteligent Information System Society Conference
    • 2002.05a
    • pp.140-148
    • 2002
  • 컴퓨터를 통해 편리한 생활을 추구해온 인간들은 전자상거래 분야에서도 이러한 욕구를 충족시키기 위해 자동협상이라는 기능을 요구하게 되었다. 지능형 에이전트를 이용한 자동협상은 인간의 거래협상 업무의 부담을 많은 부분을 덜어주고 있어 자동협상 에이전트에 관한 연구들이 활성화되고 있다 소비자간 전자상거래에서는 다수의 자동협상 에이전트 연구들이 경매시장에서의 자동협상에 초점을 맞추고 있는데 반해, 가격 이외의 여러 거래속성을 갖는 상품에 대한 제안기반 협상시장에서의 자동협상 에이전트에 관한 연구들이 최근에 활발히 이루어지고 있다. 본 연구에서는 소비자간 전자상거래에서 거래속성의 변화에 따라 개인의 효용가치의 차이를 이용한 다속성 상품의 제안기반 협상시장이 가져야할 특성에 대해 연구하고, 이를 기반으로 자동 거래협상을 수행에 필요한 거래속성 변화에 따른 소비자 개인의 선호체계를 표현하기 위한 방법을 개발하였다. 그리고 이러한 자동 거래협상을 공정하게 수행하기 위해 협상시장이 가져야할 특징과 프로토콜을 제안하고 시장운영 에이전트 시스템의 구조를 설계하였다. 마지막으로 이러한 분산형 시장구조를 갖는 제안기반의 협상시장에 참여하는 사용자 에이전트 시스템이 최적의 거래상대와 최적의 거래안을 찾기 위한 탐색방법을 구체적으로 개발하였다. 본 연구의 결과를 통하여 소비자간 전자상거래에서 구매자 뿐만 아니라 판매자도 협상결과에 따른 거래로 얻어지는 자신의 효용을 극대화할 수 있는 공정한 협상시장을 운영할 수 있을 뿐만 아니라 사용자들도 손쉽게 자신의 협상 선호체계를 쉽게 표현하고, 표현된 선호체계를 반영한 자동 거래협상을 수행할 수 있을 것 이다. 기존의 UN/EDIFACT표준을 사용하고 있는 EDI환경과 기존 VAN 방식의 EDI 중계 시스템과 연동되며, 향후 관세청의 XML/EDI 표준 시행을 미리 대비하는 선도연구로서 자리매김이 된다. 본 연구에서는 개발된 XML/EDI 통관시스템은 향후, 서비스의 최대 걸림돌이 되어왔던 값비싼 EDI 사용료의 부담에서 벗어날 수 있게 할 것이며, 저렴한 EDI구축/운영 비용으로 전자문서교환의 활성화와 XML이 인터넷 기반의 문서유통 표준으로 자리매김할 수 있는 중요한 계기가 될 것이다.재무/비재무적 지표를 고려한 인공신경망기법의 예측적중률이 높은 것으로 나타났다. 즉, 로지스틱회귀 분석의 재무적 지표모형은 훈련, 시험용이 84.45%, 85.10%인 반면, 재무/비재무적 지표모형은 84.45%, 85.08%로서 거의 동일한 예측적중률을 가졌으나 인공신경망기법 분석에서는 재무적 지표모형이 92.23%, 85.10%인 반면, 재무/비재무적 지표모형에서는 91.12%, 88.06%로서 향상된 예측적중률을 나타내었다.ting LMS according to increasing the step-size parameter $\mu$ in the experimentally computed. learning curve. Also we find that convergence speed of proposed algorithm is increased by (B+1) time proportional to B which B is the number of recycled data buffer without

Framework and Tools for Rapid M&S Component Development and Reusability (M&S 컴포넌트의 신속 개발과 재사용을 위한 프레임워크 및 도구 개발)

  • Lee, Yong Heon;Cho, Kyu Tae;Lee, Seung Young;Hwang, Keun Chul;Kim, Sae Hwan
    • Journal of the Korea Society for Simulation
    • v.22 no.4
    • pp.29-38
    • 2013
  • Modeling and simulation technology is being used in various fields. Especially in the field of military, Simulation-Based Acquisition (SBA) is recognized as a essential policy. To effectively carry out SBA, modeling and simulation techniques should be applied in the whole life-cycle for the weapon system development, and the framework and tools which can help the rapid component development and reusability are needed. In this research, we use the simulation framework based on modeling formalism for enhancement of reusability and a GUI-based modeling environment for rapid M&S component development. The Proposed framework can act as plug-in components on the basis of XML-based object model, so that the flexible design is possible for the change of the model and simulation structure. In addition, our methods are effective to implement the functions for supporting simulation such as the model data logging and communication with external systems. In this paper, we describe an architecture and functions for the framework and tools.

Research of Soft-Interface Creation and Provision Methodology According to Applications Based on Mobile Device Environment (모바일 디바이스 환경에서 어플리케이션에 따른 소프트 인터페이스 제작 및 제공 방안 연구)

  • Cho, Changhee;Park, Sanghyun;Lee, Sang-Joon;Kim, Jinsul
    • Journal of Digital Contents Society
    • v.14 no.4
    • pp.513-519
    • 2013
  • In this paper, we provide interfaces according to user application environments and provide tools through web-site that users can create interface to apply a wide range of application environment. HTML5 is used in the creation processing, so users can create various interfaces by dragging mouse and apply it to multimedia, game applications as well as documents by using the ASCII code and key events that are provided in the Android OS. Database of interfaces is stored in HDFS (Hadoop Distributed File System) based on Hadoop for management and users can have their own designed interface or select interfaces through simple login any time. In order to provide interface quickly, HIVE based on Hadoop is used for search and the data is provided in XML file which smart mobile can process quickly.

Development of a GML 3.0 Encoding System Using Mapping Rules (매핑 규칙을 활용한 GML 3.0 엔코딩 시스템의 개발)

  • Lee, Yong-Soo;Kim, Dong-O;Son, Hoon-Soo;Han, Ki-Joon
    • Journal of Korea Spatial Information System Society
    • v.5 no.1 s.9
    • pp.27-38
    • 2003
  • Recently, because of the extensive use of geographic information in the various fields, the requirement for the easy circulation and utilization of the various geographic information among the individuals, organizations, societies and countries is raised. In order to support the interoperability of the heterogeneous geographic information in the various fields, OGC(Open GIS Consortium) proposed the GML(Geography Markup Language) specification that defines the XML encoding rule about the heterogeneous geographic information. In addition, ISO/TC 211 adopted the GML specification to make it as the international standard. Therefore, in this paper we first analyzed the GML 3.0 specification in detail that can support the interoperability of the heterogeneous geographic information. And then we suggested and applied the mapping rule that define the encoding method to improve the encoding process easily and efficiently. Finally we designed and implemented the GML 3.0 encoding system using the mapping rule to encode the geographic information that was constructed in spatial databases into the GML 3.0 document. Especially, we used ZEUS as a spatial database system to test our encoding system in this paper.

