• Title/Summary/Keyword: Text Construction

Search Result 386, Processing Time 0.03 seconds

Automated Prioritization of Construction Project Requirements using Machine Learning and Fuzzy Logic System

  • Hassan, Fahad ul;Le, Tuyen;Le, Chau;Shrestha, K. Joseph
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.304-311
    • /
    • 2022
  • Construction inspection is a crucial stage that ensures that all contractual requirements of a construction project are verified. The construction inspection capabilities among state highway agencies have been greatly affected due to budget reduction. As a result, efficient inspection practices such as risk-based inspection are required to optimize the use of limited resources without compromising inspection quality. Automated prioritization of textual requirements according to their criticality would be extremely helpful since contractual requirements are typically presented in an unstructured natural language in voluminous text documents. The current study introduces a novel model for predicting the risk level of requirements using machine learning (ML) algorithms. The ML algorithms tested in this study included naïve Bayes, support vector machines, logistic regression, and random forest. The training data includes sequences of requirement texts which were labeled with risk levels (such as very low, low, medium, high, very high) using the fuzzy logic systems. The fuzzy model treats the three risk factors (severity, probability, detectability) as fuzzy input variables, and implements the fuzzy inference rules to determine the labels of requirements. The performance of the model was examined on labeled dataset created by fuzzy inference rules and three different membership functions. The developed requirement risk prediction model yielded a precision, recall, and f-score of 78.18%, 77.75%, and 75.82%, respectively. The proposed model is expected to provide construction inspectors with a means for the automated prioritization of voluminous requirements by their importance, thus help to maximize the effectiveness of inspection activities under resource constraints.

  • PDF

Keyword Network Visualization for Text Summarization and Comparative Analysis (문서 요약 및 비교분석을 위한 주제어 네트워크 가시화)

  • Kim, Kyeong-rim;Lee, Da-yeong;Cho, Hwan-Gue
    • Journal of KIISE
    • /
    • v.44 no.2
    • /
    • pp.139-147
    • /
    • 2017
  • Most of the information prevailing in the Internet space consists of textual information. So one of the main topics regarding the huge document analyses that are required in the "big data" era is the development of an automated understanding system for textual data; accordingly, the automation of the keyword extraction for text summarization and abstraction is a typical research problem. But the simple listing of a few keywords is insufficient to reveal the complex semantic structures of the general texts. In this paper, a text-visualization method that constructs a graph by computing the related degrees from the selected keywords of the target text is developed; therefore, two construction models that provide the edge relation are proposed for the computing of the relation degree among keywords, as follows: influence-interval model and word- distance model. The finally visualized graph from the keyword-derived edge relation is more flexible and useful for the display of the meaning structure of the target text; furthermore, this abstract graph enables a fast and easy understanding of the target text. The authors' experiment showed that the proposed abstract-graph model is superior to the keyword list for the attainment of a semantic and comparitive understanding of text.

Construction of Text Summarization Corpus in Economics Domain and Baseline Models

  • Sawittree Jumpathong;Akkharawoot Takhom;Prachya Boonkwan;Vipas Sutantayawalee;Peerachet Porkaew;Sitthaa Phaholphinyo;Charun Phrombut;Khemarath Choke-mangmi;Saran Yamasathien;Nattachai Tretasayuth;Kasidis Kanwatchara;Atiwat Aiemleuk;Thepchai Supnithi
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.1
    • /
    • pp.33-43
    • /
    • 2024
  • Automated text summarization (ATS) systems rely on language resources as datasets. However, creating these datasets is a complex and labor-intensive task requiring linguists to extensively annotate the data. Consequently, certain public datasets for ATS, particularly in languages such as Thai, are not as readily available as those for the more popular languages. The primary objective of the ATS approach is to condense large volumes of text into shorter summaries, thereby reducing the time required to extract information from extensive textual data. Owing to the challenges involved in preparing language resources, publicly accessible datasets for Thai ATS are relatively scarce compared to those for widely used languages. The goal is to produce concise summaries and accelerate the information extraction process using vast amounts of textual input. This study introduced ThEconSum, an ATS architecture specifically designed for Thai language, using economy-related data. An evaluation of this research revealed the significant remaining tasks and limitations of the Thai language.

A Multi-level Representation of the Korean Narrative Text Processing and Construction-Integration Theory: Morpho- syntactic and Discourse-Pragmatic Effects of Verb Modality on Topic Continuity (한국어 서사 텍스트 처리의 다중 표상과 구성 통합 이론: 주제어 연속성에 대한 양태 어미의 형태 통사적, 담화 화용적 기능)

  • Cho Sook-Whan;Kim Say-Young
    • Korean Journal of Cognitive Science
    • /
    • v.17 no.2
    • /
    • pp.103-118
    • /
    • 2006
  • The main purpose of this paper is to investigate the effects of discourse topic and morpho-syntactic verbal information on the resolution of null pronouns in the Korean narrative text within the framework of the construction-integration theory (Kintsch, 1988, Singer & Kintsch, 2001, Graesser, Gernsbacher, & Goldman. 2003). For the purpose of this paper, two conditions were designed: an explicit condition with both a consistently maintained discourse topic and the person-specific verb modals on one hand, and a neutral condition with no discourse topic or morpho-syntactic information provided, on the other. We measured the reading tines far the target sentence containing a null pronoun and the question response times for finding an antecedent, and the accuracy rates for finding an antecedent. During the experiments each passage was presented at a tine on a computer-controlled display. Each new sentence was presented on the screen at the moment the participant pressed the button on the computer keyboard. Main findings indicate that processing is facilitated by macro-structure (topicality) in conjunction with micro-structure (morpho-syntax) in pronoun interpretation. It is speculated that global processing alone may not be able to determine which potential antecedent is to be focused unless aided by lexical information. It is argued that the results largely support the resonance-based model, but not the minimalist hypothesis.

  • PDF

Study on the construction of digital library (디지탈도서관의 구축을 위한 연구)

  • Seo, Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.25
    • /
    • pp.529-567
    • /
    • 1996
  • This paper surveyed the theoretical backgrounds of digital library. Its definition and function and case studies, and basic skills for system construction of digital library were suggested. The differences between the traditional library and the digital library were compared. And the conditions that should be take into consideration of digital library construction were suggested. Suggestions are summarized as follows : 1. For the construction of digital library, library collection should be digitalized by using CD-ROM and commercial online services. 2. The digitalization of library collection should be planned by subject sharing between the libraries. For the control of this cooperation, the orgnization to propel the digitalization should be established. And it is necessary to enact the standards for the digital library. 3. The connection between MARC formatted bibliographic database and full-text should be studied. 4. All the types of information about texts, pictures, sounds and if films should be also digitalized. 5. To satisfy the needs of many users, we have to establish the various users's interface which is fitted for several kinds of users. 6. When a digital library was constructed, the copyright and resources sharing must be guranteed depending on the cost of database's usage. 7. Because the digitalization of library will be related to the various kinds of libraries, the interface for resources sharing will be constantly concerned. 8. The sharing of information resources between the libraries will be enacted on the Internet. And we must be interested in various internet tools such as telecommunication softwares, media convert programs, etc. 9. By training staffs continously, all libraries must be ready to come the library in the future.

  • PDF

The Role of a Local Authority of Multi-Family Housing Management upon the Revision of Housing Act (주택법개정에 따른 공동주택관리영역에서의 지방자치단체의 역할)

  • 곽인숙
    • Journal of Families and Better Life
    • /
    • v.21 no.5
    • /
    • pp.145-153
    • /
    • 2003
  • The Ministry of Construction & Transportation revised the full text of 'The Act for Promoting Housing Construction' that concentrated to the quantitative supply of houses into 'Housing Act' in order to improve the quantity as well as quality of housing construction and management, such as housing welfare, management or improvement of previous houses, in October 2002. Accordingly, local authority need to play more critical roles in the area of multi-family housing management and remodeling. The desirable roles of local authority called for the need are like followings: 1. Local authority should provide professional knowledge for education, direction and consultation of multi-family housing management rather keeping the previous role to control, manage and regulate it. 2. The multi-family housing management should be changed from administration and punishment to incentive-centered institutions. 3. It is necessary to consider neglected people, such as occupants of rental apartment or of a small-sized multi-family housing, who have been excepted from the subject of obligatory management under the current law. 4. For harmonious and professional housing management, local authority need to support the establishment or special companies for housing management and to strengthen the audit on trust management companies. 5. It calls for the studies on management guideline of multi-family housing, standardization of management specifics, reasonable standard for special mending appropriation amount, etc. 6. They should lead the composition of a community by residents harmoniously and support the encouragement of community consciousness to live together.

Neural network based approach for dissemination of field measurement information

  • Shin Hyu-Soung;Pande Gyan N.;Kim Chang-Yong;Bae Gyu-Jin;Hong Sung-Wan
    • 한국지구물리탐사학회:학술대회논문집
    • /
    • 2003.11a
    • /
    • pp.176-183
    • /
    • 2003
  • This paper presents a neural network based approach to disseminating information relating to experimental and field observations in engineering. Although the methodology is generic and can be applied to many areas of engineering science, attention is focussed here solely on geotechnical engineering applications. Field data relating to the settlement of foundations presented by Burland and Burbidge (1985) which led to their well known equation for calculation of settlement, now included in most text books, is re-visited. A part of the data, chosen randomly, is used to train an Artificial Neural Network (ANN), which relates foundation settlement to various causes as identified by the authors. Predictions are made for situations for which data were not used in training. These indicate sufficient accuracy when compared to the original field data. Accuracy of predictions is further improved when all the data are included in the training set. The finally trained ANN is shown to represent these data more accurately than the Burland and Burbidge equation. Based on the above heuristic example, an ANN is presented as an alternative to developing equations and design rules in geotechnical engineering practice. Significant advantages are shown to arise by using this methodology. Ease of updating the ANN, as and when additional data becomes available, being the most important one. Loss of transparency, however, seems to be the main disadvantage.

  • PDF

Adoption of Virtual Technology to the Development of a BIM based PMIS

  • Suh, Bong-Gyo;Lee, Ghang;Yun, Seok-Heon
    • Journal of the Korea Institute of Building Construction
    • /
    • v.13 no.4
    • /
    • pp.333-340
    • /
    • 2013
  • As construction projects become bigger, PMIS is being used as a project collaboration tool for project participants, owners, designers, inspectors and contractors. As the data type used in PMIS is usually text and most PMIS have no standard information classification system, there is a problem with data usability, such as the capacity for data search and analysis. BIM uses Objects and Properties, and this information might be used for relating with other construction information. As such, BIM technologies can be used with PMIS to enhance the data usability. The web environment is very convenient for multiple users, but the problem is that the data transfer speed is low for big files such as BIM model files. In this study, we suggested a Virtual Technology (VT) application to enhance the performance of BIM data exchange in PMIS, and tested and analyzed its efficiency when it is used to integrate BIM and PMIS in the web environment. The results of the study showed that VT can be used to enhance the efficiency of BIM data exchange in the web environment.

Text mining-based Data Preprocessing and Accident Type Analysis for Construction Accident Analysis (건설사고 분석을 위한 텍스트 마이닝 기반 데이터 전처리 및 사고유형 분석)

  • Yoon, Young Geun;Lee, Jae Yun;Oh, Tae Keun
    • Journal of the Korean Society of Safety
    • /
    • v.37 no.2
    • /
    • pp.18-27
    • /
    • 2022
  • Construction accidents are difficult to prevent because several different types of activities occur simultaneously. The current method of accident analysis only indicates the number of occurrences for one or two variables and accidents have not reduced as a result of safety measures that focus solely on individual variables. Even if accident data is analyzed to establish appropriate safety measures, it is difficult to derive significant results due to a large number of data variables, elements, and qualitative records. In this study, in order to simplify the analysis and approach this complex problem logically, data preprocessing techniques, such as latent class cluster analysis (LCCA) and predictor importance were used to discover the most influential variables. Finally, the correlation was analyzed using an alluvial flow diagram consisting of seven variables and fourteen elements based on accident data. The alluvial diagram analysis using reduced variables and elements enabled the identification of accident trends into four categories. The findings of this study demonstrate that complex and diverse construction accident data can yield relevant analysis results, assisting in the prevention of accidents.

A Review of Current Status and Applications of Korean Industrial Standards (KS) on the Floor Slip Resistance Testing (바닥의 미끄럼 시험에 관한 한국산업표준(KS) 현황 및 적용 실태)

  • Baik, Kwon-Hyuk;Ji, Suk-Won;Choi, Soo-Kyung
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2021.05a
    • /
    • pp.19-20
    • /
    • 2021
  • Although various laws and regulations have been put in place to prevent slips and falls, many accidents still occur. In this study, the root cause of slips and falls not decreasing were investigated. There are five types of slip resistance test methods in the Korean Industrial Standards (KS). Namely, KS F 2375:2016, KS F 2601:2020, KS F 2602:2016, KS L 1001:2020, and KS G 5821-1:2020 are listed. These test methods are cited in building certification standards (BF and G-SEED), construction specifications, and other documents that specify slip safety criteria. As a result of the investigation, a number of errors in KS regulations and legal text, errors in the manufacture and operation of slip testers, and errors in the use of measured values were found. These errors threaten the public life safety and disrupt industrial sites, and must be corrected immediately.

  • PDF