• Title/Summary/Keyword: Text Construction

Search Result 386, Processing Time 0.028 seconds

Characteristics Analysis of Seasonal Construction Site Fall Accident using Text Mining (텍스트 마이닝을 활용한 계절별 건설현장 추락사고 특징 분석)

  • Kim, Joon-Soo;Kim, Byung-Soo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.20 no.3
    • /
    • pp.113-121
    • /
    • 2019
  • The death rate of industrial accidents per 10,000 people in Korea is two to three times higher than that of major countries. Falling accidents at the construction site happened to have caused the most deaths. Analysis of existing research and measures by national institutions showed that the industrial accident management concentrated on falling accidents was insufficient and the seasonal safety management measures were not enough. There is thus the need for research that provides detailed and enough information on falling accidents. This study, therefore, aims to overcome the limitations of existing research and safety management accident response using a methodology that provides the necessary information for the prevention of fall accidents by deriving seasonal crash characteristics of the construction site. In order to provide enough information, 387 cases of seasonal construction site falling were collected, which describes the causal relationship of accidents. Text mining using principal component analysis and cluster analysis was carried out. The analysis showed that: In the spring, snowfall and unreasonable operation of equipment including lifts were the major cause. In summer, most accidents were caused by form, insufficient safety inspection, and installation work. In autumn, weather factors such as wind and typhoon were the cause. In winter, material transportation, exterior wall work, and ignore safety precautions were the cause of the crash.

Development of KTRIMS Using the Technology of Full Text DB Construction (전문(全文) DB 구축(構築)에 의한 한국통신연구정보관리(韓國通信硏究情報管理) 시스템 개발(開發))

  • Lee, Sang-Yeob;Ahn, Hyun-Soo;Lee, Yang-Ok
    • Journal of Information Management
    • /
    • v.24 no.1
    • /
    • pp.1-20
    • /
    • 1993
  • KTRC(Korea Telecom Research Center) has developed the KTRIMS(Korea Telecom Research Information Management System) to keep and share the full text of the various up-to-date research information which many research institutes in KT have produced. This paper has presented the structure and the features of the KTRIMS.

  • PDF

A Study of on Extension Compression Algorithm of Mixed Text by Hangeul-Alphabet

  • Ji, Kang-yoo;Cho, Mi-nam;Hong, Sung-soo;Park, Soo-bong
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.446-449
    • /
    • 2002
  • This paper represents a improved data compression algorithm of mixed text file by 2 byte completion Hangout and 1 byte alphabet from. Original LZW algorithm efficiently compress a alphabet text file but inefficiently compress a 2 byte completion Hangout text file. To solve this problem, data compression algorithm using 2 byte prefix field and 2 byte suffix field for compression table have developed. But it have a another problem that is compression ratio of alphabet text file decreased. In this paper, we proposes improved LZW algorithm, that is, compression table in the Extended LZW(ELZW) algorithm uses 2 byte prefix field for pointer of a table and 1 byte suffix field for repeat counter. where, a prefix field uses a pointer(index) of compression table and a suffix field uses a counter of overlapping or recursion text data in compression table. To increase compression ratio, after construction of compression table, table data are properly packed as different bit string in accordance with a alphabet, Hangout, and pointer respectively. Therefore, proposed ELZW algorithm is superior to 1 byte LZW algorithm as 7.0125 percent and superior to 2 byte LZW algorithm as 11.725 percent. This paper represents a improved data Compression algorithm of mixed text file by 2 byte completion Hangout and 1 byte alphabet form. This document is an example of what your camera-ready manuscript to ITC-CSCC 2002 should look like. Authors are asked to conform to the directions reported in this document.

  • PDF

CALS oriented design/fabrication information system for steel bridges

  • Isohata, Hiroshi;Fukuda, Masahiko;Watanabe, Sueo
    • Steel and Composite Structures
    • /
    • v.3 no.1
    • /
    • pp.13-32
    • /
    • 2003
  • In this paper design and fabrication information system for steel bridge construction is studied and proposed according to the progress of Construction CALS/EC in the construction industry in Japan. The data exchange in this system bases on the text file as well as CAD data with simplified drawings. The concept of this system is discussed following the analysis on the issues of the conventional system. The application of the product model is also discussed including effects and issues on the inspection system. This paper is based on the study carried out by Special Committee on Construction CALS of JASBC to which author belong.

Character Segmentation in Chinese Handwritten Text Based on Gap and Character Construction Estimation

  • Zhang, Cheng Dong;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • v.8 no.1
    • /
    • pp.39-46
    • /
    • 2012
  • Character segmentation is a preprocessing step in many offline handwriting recognition systems. In this paper, Chinese characters are categorized into seven different structures. In each structure, the character size with the range of variations is estimated considering typical handwritten samples. The component removal and merge criteria are presented to remove punctuation symbols or to merge small components which are part of a character. Finally, the criteria for segmenting the adjacent characters concerning each other or overlapped are proposed.

Automated Construction Activities Extraction from Accident Reports Using Deep Neural Network and Natural Language Processing Techniques

  • Do, Quan;Le, Tuyen;Le, Chau
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.744-751
    • /
    • 2022
  • Construction is among the most dangerous industries with numerous accidents occurring at job sites. Following an accident, an investigation report is issued, containing all of the specifics. Analyzing the text information in construction accident reports can help enhance our understanding of historical data and be utilized for accident prevention. However, the conventional method requires a significant amount of time and effort to read and identify crucial information. The previous studies primarily focused on analyzing related objects and causes of accidents rather than the construction activities. This study aims to extract construction activities taken by workers associated with accidents by presenting an automated framework that adopts a deep learning-based approach and natural language processing (NLP) techniques to automatically classify sentences obtained from previous construction accident reports into predefined categories, namely TRADE (i.e., a construction activity before an accident), EVENT (i.e., an accident), and CONSEQUENCE (i.e., the outcome of an accident). The classification model was developed using Convolutional Neural Network (CNN) showed a robust accuracy of 88.7%, indicating that the proposed model is capable of investigating the occurrence of accidents with minimal manual involvement and sophisticated engineering. Also, this study is expected to support safety assessments and build risk management systems.

  • PDF

Machine Learning Based Automatic Categorization Model for Text Lines in Invoice Documents

  • Shin, Hyun-Kyung
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.12
    • /
    • pp.1786-1797
    • /
    • 2010
  • Automatic understanding of contents in document image is a very hard problem due to involvement with mathematically challenging problems originated mainly from the over-determined system induced by document segmentation process. In both academic and industrial areas, there have been incessant and various efforts to improve core parts of content retrieval technologies by the means of separating out segmentation related issues using semi-structured document, e.g., invoice,. In this paper we proposed classification models for text lines on invoice document in which text lines were clustered into the five categories in accordance with their contents: purchase order header, invoice header, summary header, surcharge header, purchase items. Our investigation was concentrated on the performance of machine learning based models in aspect of linear-discriminant-analysis (LDA) and non-LDA (logic based). In the group of LDA, na$\"{\i}$ve baysian, k-nearest neighbor, and SVM were used, in the group of non LDA, decision tree, random forest, and boost were used. We described the details of feature vector construction and the selection processes of the model and the parameter including training and validation. We also presented the experimental results of comparison on training/classification error levels for the models employed.

A Basic Study on the Yuarye of Ji Cheng (계성의『원치』에 관한 기초적 연구)

  • 이유직;황기원
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.23 no.2
    • /
    • pp.223-241
    • /
    • 1995
  • Ji Cheng's great work on garden design, the 'Yuanye'(Craft of garden), written in 1631 and originally published in 1634 is the first surviving manual on landscape gardening in the Chinese tradition. This study aims at investigating not only Ji Cheng's life, achievements companionship and design activities, but also the xylographic copies, literary style, and framework of Yuanye in their historical context in order to provide the bases for further study, Ji Cheng was exellent in poetry and painting. And he constructed Dongdiyuan in Changzhou around 1623, Wuyuan in Yiaheng in 1631, and Yingyuan in Yangzhou around 1634 But no poems, paintings, and gardens designed by hi shill exist Therefore his design phi philosophy is able to be interpreted only by his work, Yuanye. After publishing, Yuanye fell into obscurity for several centuries in Chlna. It was redescovered and reprinted for the first time in 1931. Yuanye is composed of prefaces and main text The main text is divided into 'the Theory of Construction' and 'on Gardens', and the latter also into 10 sections. In this text Ji Cheng explains the aesthetic principles underlying garden design and the appropriate emotional response to various efftcts Especially, he emphasizes the importance of basin the garden design on the taxi ting nature and features of landscape and making use of natural scenery. The literary style of the book is highly mannered, and there are so many poetic descriptions and Ji Cheng's native Jiangsu dialects. So the translation of the original text is very difficult After this, the major design concepts of Ji Cheng's landscape gardening theory and whole network of these concepts have to be studied.

  • PDF

On the Analysis of Natural Language Processing Morphology for the Specialized Corpus in the Railway Domain

  • Won, Jong Un;Jeon, Hong Kyu;Kim, Min Joong;Kim, Beak Hyun;Kim, Young Min
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.14 no.4
    • /
    • pp.189-197
    • /
    • 2022
  • Today, we are exposed to various text-based media such as newspapers, Internet articles, and SNS, and the amount of text data we encounter has increased exponentially due to the recent availability of Internet access using mobile devices such as smartphones. Collecting useful information from a lot of text information is called text analysis, and in order to extract information, it is performed using technologies such as Natural Language Processing (NLP) for processing natural language with the recent development of artificial intelligence. For this purpose, a morpheme analyzer based on everyday language has been disclosed and is being used. Pre-learning language models, which can acquire natural language knowledge through unsupervised learning based on large numbers of corpus, are a very common factor in natural language processing recently, but conventional morpheme analysts are limited in their use in specialized fields. In this paper, as a preliminary work to develop a natural language analysis language model specialized in the railway field, the procedure for construction a corpus specialized in the railway field is presented.

Bankruptcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics (빅데이터 기반의 정성 정보를 활용한 부도 예측 모형 구축)

  • Jo, Nam-ok;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.33-56
    • /
    • 2016
  • Many researchers have focused on developing bankruptcy prediction models using modeling techniques, such as statistical methods including multiple discriminant analysis (MDA) and logit analysis or artificial intelligence techniques containing artificial neural networks (ANN), decision trees, and support vector machines (SVM), to secure enhanced performance. Most of the bankruptcy prediction models in academic studies have used financial ratios as main input variables. The bankruptcy of firms is associated with firm's financial states and the external economic situation. However, the inclusion of qualitative information, such as the economic atmosphere, has not been actively discussed despite the fact that exploiting only financial ratios has some drawbacks. Accounting information, such as financial ratios, is based on past data, and it is usually determined one year before bankruptcy. Thus, a time lag exists between the point of closing financial statements and the point of credit evaluation. In addition, financial ratios do not contain environmental factors, such as external economic situations. Therefore, using only financial ratios may be insufficient in constructing a bankruptcy prediction model, because they essentially reflect past corporate internal accounting information while neglecting recent information. Thus, qualitative information must be added to the conventional bankruptcy prediction model to supplement accounting information. Due to the lack of an analytic mechanism for obtaining and processing qualitative information from various information sources, previous studies have only used qualitative information. However, recently, big data analytics, such as text mining techniques, have been drawing much attention in academia and industry, with an increasing amount of unstructured text data available on the web. A few previous studies have sought to adopt big data analytics in business prediction modeling. Nevertheless, the use of qualitative information on the web for business prediction modeling is still deemed to be in the primary stage, restricted to limited applications, such as stock prediction and movie revenue prediction applications. Thus, it is necessary to apply big data analytics techniques, such as text mining, to various business prediction problems, including credit risk evaluation. Analytic methods are required for processing qualitative information represented in unstructured text form due to the complexity of managing and processing unstructured text data. This study proposes a bankruptcy prediction model for Korean small- and medium-sized construction firms using both quantitative information, such as financial ratios, and qualitative information acquired from economic news articles. The performance of the proposed method depends on how well information types are transformed from qualitative into quantitative information that is suitable for incorporating into the bankruptcy prediction model. We employ big data analytics techniques, especially text mining, as a mechanism for processing qualitative information. The sentiment index is provided at the industry level by extracting from a large amount of text data to quantify the external economic atmosphere represented in the media. The proposed method involves keyword-based sentiment analysis using a domain-specific sentiment lexicon to extract sentiment from economic news articles. The generated sentiment lexicon is designed to represent sentiment for the construction business by considering the relationship between the occurring term and the actual situation with respect to the economic condition of the industry rather than the inherent semantics of the term. The experimental results proved that incorporating qualitative information based on big data analytics into the traditional bankruptcy prediction model based on accounting information is effective for enhancing the predictive performance. The sentiment variable extracted from economic news articles had an impact on corporate bankruptcy. In particular, a negative sentiment variable improved the accuracy of corporate bankruptcy prediction because the corporate bankruptcy of construction firms is sensitive to poor economic conditions. The bankruptcy prediction model using qualitative information based on big data analytics contributes to the field, in that it reflects not only relatively recent information but also environmental factors, such as external economic conditions.