• Title/Summary/Keyword: open information extraction

Search Result 107, Processing Time 0.02 seconds

Knowledge Mining from Many-valued Triadic Dataset based on Concept Hierarchy (개념계층구조를 기반으로 하는 다치 삼원 데이터집합의 지식 추출)

  • Suk-Hyung Hwang;Young-Ae Jung;Se-Woong Hwang
    • Journal of Platform Technology
    • /
    • v.12 no.3
    • /
    • pp.3-15
    • /
    • 2024
  • Knowledge mining is a research field that applies various techniques such as data modeling, information extraction, analysis, visualization, and result interpretation to find valuable knowledge from diverse large datasets. It plays a crucial role in transforming raw data into useful knowledge across various domains like business, healthcare, and scientific research etc. In this paper, we propose analytical techniques for performing knowledge discovery and data mining from various data by extending the Formal Concept Analysis method. It defines algorithms for representing diverse formats and structures of the data to be analyzed, including models such as many-valued data table data and triadic data table, as well as algorithms for data processing (dyadic scaling and flattening) and the construction of concept hierarchies and the extraction of association rules. The usefulness of the proposed technique is empirically demonstrated by conducting experiments applying the proposed method to public open data.

  • PDF

Analysis of Impact Between Data Analysis Performance and Database

  • Kyoungju Min;Jeongyun Cho;Manho Jung;Hyangbae Lee
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.3
    • /
    • pp.244-251
    • /
    • 2023
  • Engineering or humanities data are stored in databases and are often used for search services. While the latest deep-learning technologies, such like BART and BERT, are utilized for data analysis, humanities data still rely on traditional databases. Representative analysis methods include n-gram and lexical statistical extraction. However, when using a database, performance limitation is often imposed on the result calculations. This study presents an experimental process using MariaDB on a PC, which is easily accessible in a laboratory, to analyze the impact of the database on data analysis performance. The findings highlight the fact that the database becomes a bottleneck when analyzing large-scale text data, particularly over hundreds of thousands of records. To address this issue, a method was proposed to provide real-time humanities data analysis web services by leveraging the open source database, with a focus on the Seungjeongwon-Ilgy, one of the largest datasets in the humanities fields.

Development of a Blocks Recognition Application for Children's Education using a Smartphone Camera (스마트폰 카메라 기반 아동 교육용 산수 블록 인식 애플리케이션 개발)

  • Park, Sang-A;Oh, Ji-Won;Hong, In-Sik;Nam, Yunyoung
    • Journal of Internet Computing and Services
    • /
    • v.20 no.4
    • /
    • pp.29-38
    • /
    • 2019
  • Currently, information society is rapidly changing and demands innovation and creativity in various fields. Therefore, the importance of mathematics, which can be the basis of creativity and logic, is emphasized. The purpose of this paper is to develop a math education application that can further expand the logical thinking of mathematics and allow voluntary learning to occur through the use of readily available teaching aid for children to form motivation and interest in learning. This paper provides math education applications using a smartphone and blocks for children. The main function of the application is to shoot with the camera and show the calculated values. When a child uses a block to make a formula and shoots a block using a camera, you can directly see the calculated value of your formula. The preprocessing process, text extraction, and character recognition of the photographed images have been implemented using OpenCV libraries and Tesseract-OCR libraries.

Schedule Optimization in Resource Leveling through Open BIM Based Computer Simulations

  • Kim, Hyun-Joo
    • Journal of KIBIM
    • /
    • v.9 no.2
    • /
    • pp.1-10
    • /
    • 2019
  • In this research, schedule optimization is defined as balancing the number of workers while keeping the demand and needs of the project resources, creating the perfect schedule for each activity. Therefore, when one optimizes a schedule, multiple potentials of schedule changes are assessed to get an instant view of changes that avoid any over and under staffing while maximizing productivity levels for the available labor cost. Optimizing the number of workers in the scheduling process is not a simple task since it usually involves many different factors to be considered such as the development of quantity take-offs, cost estimating, scheduling, direct/indirect costs, and borrowing costs in cash flow while each factor affecting the others simultaneously. That is why the optimization process usually requires complex computational simulations/modeling. This research attempts to find an optimal selection of daily maximum workers in a project while considering the impacts of other factors at the same time through OPEN BIM based multiple computer simulations in resource leveling. This paper integrates several different processes such as quantity take-offs, cost estimating, and scheduling processes through computer aided simulations and prediction in generating/comparing different outcomes of each process. To achieve interoperability among different simulation processes, this research utilized data exchanges supported by building SMART-IFC effort in automating the data extraction and retrieval. Numerous computer simulations were run, which included necessary aspects of construction scheduling, to produce sufficient alternatives for a given project.

A Video Expression Recognition Method Based on Multi-mode Convolution Neural Network and Multiplicative Feature Fusion

  • Ren, Qun
    • Journal of Information Processing Systems
    • /
    • v.17 no.3
    • /
    • pp.556-570
    • /
    • 2021
  • The existing video expression recognition methods mainly focus on the spatial feature extraction of video expression images, but tend to ignore the dynamic features of video sequences. To solve this problem, a multi-mode convolution neural network method is proposed to effectively improve the performance of facial expression recognition in video. Firstly, OpenFace 2.0 is used to detect face images in video, and two deep convolution neural networks are used to extract spatiotemporal expression features. Furthermore, spatial convolution neural network is used to extract the spatial information features of each static expression image, and the dynamic information feature is extracted from the optical flow information of multiple expression images based on temporal convolution neural network. Then, the spatiotemporal features learned by the two deep convolution neural networks are fused by multiplication. Finally, the fused features are input into support vector machine to realize the facial expression classification. Experimental results show that the recognition accuracy of the proposed method can reach 64.57% and 60.89%, respectively on RML and Baum-ls datasets. It is better than that of other contrast methods.

A Study on an Open/Closed Eye Detection Algorithm for Drowsy Driver Detection (운전자 졸음 검출을 위한 눈 개폐 검출 알고리즘 연구)

  • Kim, TaeHyeong;Lim, Woong;Sim, Donggyu
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.7
    • /
    • pp.67-77
    • /
    • 2016
  • In this paper, we propose an algorithm for open/closed eye detection based on modified Hausdorff distance. The proposed algorithm consists of two parts, face detection and open/closed eye detection parts. To detect faces in an image, MCT (Modified Census Transform) is employed based on characteristics of the local structure which uses relative pixel values in the area with fixed size. Then, the coordinates of eyes are found and open/closed eyes are detected using MHD (Modified Hausdorff Distance) in the detected face region. Firstly, face detection process creates an MCT image in terms of various face images and extract criteria features by PCA(Principle Component Analysis) on offline. After extraction of criteria features, it detects a face region via the process which compares features newly extracted from the input face image and criteria features by using Euclidean distance. Afterward, the process finds out the coordinates of eyes and detects open/closed eye using template matching based on MHD in each eye region. In performance evaluation, the proposed algorithm achieved 94.04% accuracy in average for open/closed eye detection in terms of test video sequences of gray scale with 30FPS/$320{\times}180$ resolution.

The Geometric Modeling for 3D Information of X-ray Inspection (스테레오 X-선 검색장치를 이용한 3차원 정보 가시화에 관한 연구)

  • Hwang, Young-Gwan;Lee, Seung-Min;Park, Jong-Won
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.63 no.1
    • /
    • pp.145-149
    • /
    • 2014
  • In this study, using X-ray cargo container scanning device and to differentiate the concept of three-dimensional information extraction applied for X-ray scanning device as an ingredient in the rotation of the X-Ray Linear Pushbroom Stereo System by introducing the geometric How to model was introduced. Three-dimensional information obtained through the matching of a single voxel space filled with a random vector operations for each voxel in the three dimensional shape reconstruction algorithm using the definition, and in time, the time required for each step were analyzed. Using OpenCV in each step by applying parallelization techniques approximately 1.8 times improvement in the processing time of the check, but do not meet the target within one minute levels. The other hand, X-ray images by the primary process to convert the point View the results of real-time stereo through a three-dimensional could feel the comfort level.

Automatic Building Extraction Using SpaceNet Building Dataset and Context-based ResU-Net (SpaceNet 건물 데이터셋과 Context-based ResU-Net을 이용한 건물 자동 추출)

  • Yoo, Suhong;Kim, Cheol Hwan;Kwon, Youngmok;Choi, Wonjun;Sohn, Hong-Gyoo
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_2
    • /
    • pp.685-694
    • /
    • 2022
  • Building information is essential for various urban spatial analyses. For this reason, continuous building monitoring is required, but it is a subject with many practical difficulties. To this end, research is being conducted to extract buildings from satellite images that can be continuously observed over a wide area. Recently, deep learning-based semantic segmentation techniques have been used. In this study, a part of the structure of the context-based ResU-Net was modified, and training was conducted to automatically extract a building from a 30 cm Worldview-3 RGB image using SpaceNet's building v2 free open data. As a result of the classification accuracy evaluation, the f1-score, which was higher than the classification accuracy of the 2nd SpaceNet competition winners. Therefore, if Worldview-3 satellite imagery can be continuously provided, it will be possible to use the building extraction results of this study to generate an automatic model of building around the world.

Extension of the IFC Schema for Road Subsidiary Facility (도로 부대시설 수용을 위한 IFC 스키마 확장 개발)

  • Cho, Geun-Ha;Ju, Ki-Beom
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.12
    • /
    • pp.7385-7392
    • /
    • 2014
  • Extension IFC schema of subsidiary facilities were developed for the purpose of establishing an information model standard for roads. The IFC entities, types and properties for subsidiary facilities were defined through an analysis of the road design documents for the extraction physical component and design information. The converter and viewer for applying the new schema were then developed. Subsidiary facilities BIM models were converted to new IFC models to verify the research results. Standard BIM-based delivery and verification systems are enabled by using a standard model converted by new schema. Furthermore, it can establish an open BIM environment using an IFC over the entire life cycle of the civil engineering project.

DEVELOPMENT OF AUGMENTED 3D STEREO URBAN CITY MODELLING SYSTEM BASED ON ANAGLYPH APPROACH

  • Kim, Hak-Hoon;Kim, Seung-Yub;Lee, Ki-Won
    • Proceedings of the KSRS Conference
    • /
    • v.1
    • /
    • pp.98-101
    • /
    • 2006
  • In general, stereo images are widely used to remote sensing or photogrametric applications for the purpose of image understanding and feature extraction or cognition. However, the most cases of these stereo-based application deal with 2-D satellite images or the airborne photos so that its main targets are generation of small-scaled or large-scaled DEM(Digital Elevation Model) or DSM(Digital Surface Model), in the 2.5-D. Contrast to these previous approaches, the scope of this study is to investigate 3-D stereo processing and visualization of true geo-referenced 3-D features based on anaglyph technique, and the aim is at the prototype development for stereo visualization system of complex typed 3-D GIS features. As for complex typed 3-D features, the various kinds of urban landscape components are taken into account with their geometric characteristics and attributes. The main functions in this prototype are composed of 3-D feature authoring and modeling along with database schema, stereo matching, and volumetric visualization. Using these functions, several technical aspects for migration into actual 3-D GIS application are provided with experiment results. It is concluded that this result will contribute to more specialized and realistic applications by linking 3-D graphics with geo-spatial information.

  • PDF