• Title/Summary/Keyword: Segmentation process

Search Result 627, Processing Time 0.033 seconds

Shot Boundary Detection of Video Sequence Using Hierarchical Hidden Markov Models (계층적 은닉 마코프 모델을 이용한 비디오 시퀀스의 셧 경계 검출)

  • Park, Jong-Hyun;Cho, Wan-Hyun;Park, Soon-Young
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.8A
    • /
    • pp.786-795
    • /
    • 2002
  • In this paper, we present a histogram and moment-based vidoe scencd change detection technique using hierarchical Hidden Markov Models(HMMs). The proposed method extracts histograms from a low-frequency subband and moments of edge components from high-frequency subbands of wavelet transformed images. Then each HMM is trained by using histogram difference and directional moment difference, respectively, extracted from manually labeled video. The video segmentation process consists of two steps. A histogram-based HMM is first used to segment the input video sequence into three categories: shot, cut, gradual scene changes. In the second stage, a moment-based HMM is used to further segment the gradual changes into a fade and a dissolve. The experimental results show that the proposed technique is more effective in partitioning video frames than the previous threshold-based methods.

Design and Implemtation of a Road Congestion Analysis System using Regional Information (영역정보를 이용한 교통 혼잡도 측정 시스템의 설계 및 구현)

  • Choe, Byeong-Geol;Jeong, Seong-Il;An, Cheol-Ung;Kim, Seung-Ho
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.5 no.6
    • /
    • pp.748-757
    • /
    • 1999
  • 본 논문에서는 차량 영역의 추출을 이용한 효율적인 교통 혼잡도 측정 시스템을 설계하고 구현한다. 차량 영역 정보의 추출은 첫째 영역 분할, 둘째 작은 영역의 제거와 영역의 직사각형화, 셋째 영역의 병합 및 삭제의 단계로 나눌 수 있다. 영역 분할 단계에서는 획득한 도로 영상을 영역 기반 영역 분할에 의해 영역으로 분할한다. 그 다음 영역 분할 후의 영역 정보 중 차량 영역을 추출하는데 영향을 미치지 않는 작은 영역들을 제거하고, 남은 영역들을 직사각형화한다. 마지막으로 차선 별로 남은 영역들을 병합, 삭제함으로써 각 차선마다 차량 영역 정보를 추출할 수 있다. 이러한 방법은 배경 영상과 같은 부가적인 정보를 사용하지 않고 도로 자체 영상만으로 교통 혼잡도를 측정할 수 있으며, 그림자의 영향이 없을 경우 적용할 수 있는 기법이다.Abstract In this paper, we designed and implemented an efficient road congestion analysis system using regional information. To extract vehicle regions from a road image, the system process the image in five steps: segmentation, small region elimination, region rectangularization, region merging and region deletion. First, we segment road image by a threshold value. Then, we eliminate useless small regions to extract vehicle region, and perform region rectangularization. Finally, we extract vehicle region of each lane of the road by region merging and deletion. This method has the advantage of measuring road congestion without additional information such as background images. But this method must be applied to road images without shadow.

Face Recognition Using Histograms of Multi-resolution Segments Based on Discriminant Face Descriptor (판별 얼굴 기술자 기반의 다중 해상도 분할 영역 히스토그램을 이용한 얼굴인식 방법)

  • Lee, Jang-yoon;Lee, Yonggeol;Choi, Sang-Il
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.2
    • /
    • pp.97-105
    • /
    • 2016
  • We propose a face recognition method using the histograms of multi-resolution segments in order to effectively utilize the local information of faces. Since the variations in faces can occur in various sizes, the DFD method, which uses the histograms from the sub-regions of the same size, is not effective for obtaining local information of faces. In this paper, we first divide an image into several sub-regions and extract the DFD(Discriminant Face Descriptor) from each sub-region. By dividing each sub-region into several segments with multi-resolution and extracting histograms for each segment, we reduce the loss of local information in the process of recognition. The experimental results for the Yale B, AR, CAS-PEAL-R1 databases show that the proposed method improves the recognition performance compared to the existing DFD based method.

Feature Extraction Using Trace Transform for Insect Footprint Recognition (곤충 발자국 패턴 인식을 위한 Trace Transform 기반의 특징값 추출)

  • Shin, Bok-Suk;Cho, Kyoung-Won;Cha, Eui-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.6
    • /
    • pp.1095-1100
    • /
    • 2008
  • In a process of insect foot recognition, footprint segments as basic areas for recognition need to be extracted from scanned insect footprints and appropriate features should be found from the footprint segments in order to discriminate kinds of insects, because the characteristics of the features are important to classify insects. In this paper, we propose methods for automatic footprint segmentation and feature extraction. We use a Trace transform method in order to find out appropriate features from the extracted segments by the above methods. The Trace transform method builds a new type of data structure from the segmented images by functions using parallel trace lines and the new type of data structure has characteristics invariant to translation, rotation and reflection of images. This data structure is converted to Triple features by Diametric and Circus functions, and the Triple features are used for discriminating patterns of insect footprints. In this paper, we show that the Triple features found by the proposed methods are enough distinguishable and appropriate for classifying kinds of insects.

Object Detection and Post-processing of LNGC CCS Scaffolding System using 3D Point Cloud Based on Deep Learning (딥러닝 기반 LNGC 화물창 스캐닝 점군 데이터의 비계 시스템 객체 탐지 및 후처리)

  • Lee, Dong-Kun;Ji, Seung-Hwan;Park, Bon-Yeong
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.58 no.5
    • /
    • pp.303-313
    • /
    • 2021
  • Recently, quality control of the Liquefied Natural Gas Carrier (LNGC) cargo hold and block-erection interference areas using 3D scanners have been performed, focusing on large shipyards and the international association of classification societies. In this study, as a part of the research on LNGC cargo hold quality management advancement, a study on deep-learning-based scaffolding system 3D point cloud object detection and post-processing were conducted using a LNGC cargo hold 3D point cloud. The scaffolding system point cloud object detection is based on the PointNet deep learning architecture that detects objects using point clouds, achieving 70% prediction accuracy. In addition, the possibility of improving the accuracy of object detection through parameter adjustment is confirmed, and the standard of Intersection over Union (IoU), an index for determining whether the object is the same, is achieved. To avoid the manual post-processing work, the object detection architecture allows automatic task performance and can achieve stable prediction accuracy through supplementation and improvement of learning data. In the future, an improved study will be conducted on not only the flat surface of the LNGC cargo hold but also complex systems such as curved surfaces, and the results are expected to be applicable in process progress automation rate monitoring and ship quality control.

Jacking Force and Camber for Precast Concrete Slab Reinforcing (프리캐스트 콘크리트 슬래브 보강을 위한 잭킹력과 솟음)

  • Lho, Byeong-Cheol
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.25 no.2
    • /
    • pp.43-48
    • /
    • 2021
  • Precast concrete can be used to reduce construction period and enhance construct ability. However structural problems could be occurred due to the wrong application of boundary condition and misunderstanding of structural behavior in the process of segmentation of original structure system. I experienced a serious deflections and cracks due to the increase of bending moment and creep after the construction of precast concrete slab, and we learned that this is from the misunderstanding of support conditions and structure behaviors of precast slab panel. Two support columns under the precast slab are inserted to reduce the bending moment, and the camber according to jacking force should be estimated for the structural safety during the reinforcing work. A proper support condition and the flexural stiffness of precast concrete slab were applied to check the deflection and crack for existing structure by inverse analysis, and we can estimate the camber according to jacking force of the precast concrete slab, and suggest a method to make safe structure.

Research on Human Posture Recognition System Based on The Object Detection Dataset (객체 감지 데이터 셋 기반 인체 자세 인식시스템 연구)

  • Liu, Yan;Li, Lai-Cun;Lu, Jing-Xuan;Xu, Meng;Jeong, Yang-Kwon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.1
    • /
    • pp.111-118
    • /
    • 2022
  • In computer vision research, the two-dimensional human pose is a very extensive research direction, especially in pose tracking and behavior recognition, which has very important research significance. The acquisition of human pose targets, which is essentially the study of how to accurately identify human targets from pictures, is of great research significance and has been a hot research topic of great interest in recent years. Human pose recognition is used in artificial intelligence on the one hand and in daily life on the other. The excellent effect of pose recognition is mainly determined by the success rate and the accuracy of the recognition process, so it reflects the importance of human pose recognition in terms of recognition rate. In this human body gesture recognition, the human body is divided into 17 key points for labeling. Not only that but also the key points are segmented to ensure the accuracy of the labeling information. In the recognition design, use the comprehensive data set MS COCO for deep learning to design a neural network model to train a large number of samples, from simple step-by-step to efficient training, so that a good accuracy rate can be obtained.

Pedestrian and Vehicle Distance Estimation Based on Hard Parameter Sharing (하드 파라미터 쉐어링 기반의 보행자 및 운송 수단 거리 추정)

  • Seo, Ji-Won;Cha, Eui-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.3
    • /
    • pp.389-395
    • /
    • 2022
  • Because of improvement of deep learning techniques, deep learning using computer vision such as classification, detection and segmentation has also been used widely at many fields. Expecially, automatic driving is one of the major fields that applies computer vision systems. Also there are a lot of works and researches to combine multiple tasks in a single network. In this study, we propose the network that predicts the individual depth of pedestrians and vehicles. Proposed model is constructed based on YOLOv3 for object detection and Monodepth for depth estimation, and it process object detection and depth estimation consequently using encoder and decoder based on hard parameter sharing. We also used attention module to improve the accuracy of both object detection and depth estimation. Depth is predicted with monocular image, and is trained using self-supervised training method.

Hot Keyword Extraction of Sci-tech Periodicals Based on the Improved BERT Model

  • Liu, Bing;Lv, Zhijun;Zhu, Nan;Chang, Dongyu;Lu, Mengxin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.6
    • /
    • pp.1800-1817
    • /
    • 2022
  • With the development of the economy and the improvement of living standards, the hot issues in the subject area have become the main research direction, and the mining of the hot issues in the subject currently has problems such as a large amount of data and a complex algorithm structure. Therefore, in response to this problem, this study proposes a method for extracting hot keywords in scientific journals based on the improved BERT model.It can also provide reference for researchers,and the research method improves the overall similarity measure of the ensemble,introducing compound keyword word density, combining word segmentation, word sense set distance, and density clustering to construct an improved BERT framework, establish a composite keyword heat analysis model based on I-BERT framework.Taking the 14420 articles published in 21 kinds of social science management periodicals collected by CNKI(China National Knowledge Infrastructure) in 2017-2019 as the experimental data, the superiority of the proposed method is verified by the data of word spacing, class spacing, extraction accuracy and recall of hot keywords. In the experimental process of this research, it can be found that the method proposed in this paper has a higher accuracy than other methods in extracting hot keywords, which can ensure the timeliness and accuracy of scientific journals in capturing hot topics in the discipline, and finally pass Use information technology to master popular key words.

Class Imbalance Resolution Method and Classification Algorithm Suggesting Based on Dataset Type Segmentation (데이터셋 유형 분류를 통한 클래스 불균형 해소 방법 및 분류 알고리즘 추천)

  • Kim, Jeonghun;Kwahk, Kee-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.3
    • /
    • pp.23-43
    • /
    • 2022
  • In order to apply AI (Artificial Intelligence) in various industries, interest in algorithm selection is increasing. Algorithm selection is largely determined by the experience of a data scientist. However, in the case of an inexperienced data scientist, an algorithm is selected through meta-learning based on dataset characteristics. However, since the selection process is a black box, it was not possible to know on what basis the existing algorithm recommendation was derived. Accordingly, this study uses k-means cluster analysis to classify types according to data set characteristics, and to explore suitable classification algorithms and methods for resolving class imbalance. As a result of this study, four types were derived, and an appropriate class imbalance resolution method and classification algorithm were recommended according to the data set type.