• Title/Summary/Keyword: Adaptive feature extraction

Search Result 92, Processing Time 0.028 seconds

Development of surface defect inspection algorithms for cold mill strip (냉연 표면흠 검사 알고리듬 개발에 관한 연구)

  • Kim, Kyoung-Min;Park, Gwi-Tae;Park, Joong-Jo;Lee, Jong-Hak;Jung, Jin-Yang;Lee, Joo-Kang
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.3 no.2
    • /
    • pp.179-186
    • /
    • 1997
  • In this paper we suggest a development of surface defect inspection algorithms for cold mill strip. The defects which exist in a surface of cold mill strip have a scattering or singular distribution. This paper consists of preprocessing, feature extraction and defect classification. By preprocessing, the binarized defect image is achieved. In this procedure, Top-hit transform, adaptive thresholding, thinning and noise rejection are used. Especially, Top-hit transform using local min/max operation diminishes the effect of bad lighting. In feature extraction, geometric, moment and co-occurrence matrix features are calculated. For the defect classification, multilayer neural network is used. The proposed algorithm showed 15% error rate.

  • PDF

Three-dimensional Head Tracking Using Adaptive Local Binary Pattern in Depth Images

  • Kim, Joongrock;Yoon, Changyong
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.16 no.2
    • /
    • pp.131-139
    • /
    • 2016
  • Recognition of human motions has become a main area of computer vision due to its potential human-computer interface (HCI) and surveillance. Among those existing recognition techniques for human motions, head detection and tracking is basis for all human motion recognitions. Various approaches have been tried to detect and trace the position of human head in two-dimensional (2D) images precisely. However, it is still a challenging problem because the human appearance is too changeable by pose, and images are affected by illumination change. To enhance the performance of head detection and tracking, the real-time three-dimensional (3D) data acquisition sensors such as time-of-flight and Kinect depth sensor are recently used. In this paper, we propose an effective feature extraction method, called adaptive local binary pattern (ALBP), for depth image based applications. Contrasting to well-known conventional local binary pattern (LBP), the proposed ALBP cannot only extract shape information without texture in depth images, but also is invariant distance change in range images. We apply the proposed ALBP for head detection and tracking in depth images to show its effectiveness and its usefulness.

Photomosaic Algorithm with Adaptive Tilting and Block Matching (적응적 타일링 및 블록 매칭을 통한 포토 모자이크 알고리즘)

  • Seo, Sung-Jin;Kim, Ki-Wong;Kim, Sun-Myeng;Lee, Hae-Yeoun
    • The KIPS Transactions:PartB
    • /
    • v.19B no.1
    • /
    • pp.1-8
    • /
    • 2012
  • Mosaic is to make a big image by gathering lots of small materials having various colors. With advance of digital imaging techniques, photomosaic techniques using photos are widely used. In this paper, we presents an automatic photomosaic algorithm based on adaptive tiling and block matching. The proposed algorithm is composed of two processes: photo database generation and photomosaic generation. Photo database is a set of photos (or tiles) used for mosaic, where a tile is divided into $4{\times}4$ regions and the average RGB value of each region is the feature of the tile. Photomosaic generation is composed of 4 steps: feature extraction, adaptive tiling, block matching, and intensity adjustment. In feature extraction, the feature of each block is calculated after the image is splitted into the preset size of blocks. In adaptive tiling, the blocks having similar similarities are merged. Then, the blocks are compared with tiles in photo database by comparing euclidean distance as a similarity measure in block matching. Finally, in intensity adjustment, the intensity of the matched tile is replaced as that of the block to increase the similarity between the tile and the block. Also, a tile redundancy minimization scheme of adjacent blocks is applied to enhance the quality of mosaic photos. In comparison with Andrea mosaic software, the proposed algorithm outperforms in quantitative and qualitative analysis.

Adaptive Digital Watermarking with Perceptually Tuned Characteristic Based on Wavelet Transform (웨이브릿 변환영역에서 지각적 동조특성을 갖는 적응적 디지털 워터마킹)

  • 김현천;장봉주;서용수;김종진
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.6
    • /
    • pp.1008-1014
    • /
    • 2003
  • In this paper, we propose the image retrieval method based on object regions using bidirectional round filter in the wavelet transform domain. A conventional method that includes unnecessary background information reduce retrieval efficiency, because of the extraction of feature vectors from the whole region of subband. On proposed method, it extracts accurate feature vectors and keep certainly retrieval efficiency in case of reduced feature vectors, because of the extraction of feature vectors from the only extracted object region. Furthermore, it improve retrieval efficiency by removing unnecessary background information. Consequently, the retrieval efficiency is improved with 2.5%∼5.5% values, which have a little chances to vary according to characteristics of image.

  • PDF

A Novel Two-Stage Training Method for Unbiased Scene Graph Generation via Distribution Alignment

  • Dongdong Jia;Meili Zhou;Wei WEI;Dong Wang;Zongwen Bai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.12
    • /
    • pp.3383-3397
    • /
    • 2023
  • Scene graphs serve as semantic abstractions of images and play a crucial role in enhancing visual comprehension and reasoning. However, the performance of Scene Graph Generation is often compromised when working with biased data in real-world situations. While many existing systems focus on a single stage of learning for both feature extraction and classification, some employ Class-Balancing strategies, such as Re-weighting, Data Resampling, and Transfer Learning from head to tail. In this paper, we propose a novel approach that decouples the feature extraction and classification phases of the scene graph generation process. For feature extraction, we leverage a transformer-based architecture and design an adaptive calibration function specifically for predicate classification. This function enables us to dynamically adjust the classification scores for each predicate category. Additionally, we introduce a Distribution Alignment technique that effectively balances the class distribution after the feature extraction phase reaches a stable state, thereby facilitating the retraining of the classification head. Importantly, our Distribution Alignment strategy is model-independent and does not require additional supervision, making it applicable to a wide range of SGG models. Using the scene graph diagnostic toolkit on Visual Genome and several popular models, we achieved significant improvements over the previous state-of-the-art methods with our model. Compared to the TDE model, our model improved mR@100 by 70.5% for PredCls, by 84.0% for SGCls, and by 97.6% for SGDet tasks.

A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

  • Kim, Hyoung-Gook
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.6 no.2
    • /
    • pp.95-106
    • /
    • 2007
  • This paper proposes an effective tempo feature extraction method for music information retrieval. The tempo information is modeled by the narrow-band temporal modulation components, which are decomposed into a modulation spectrum via joint frequency analysis. In implementation, the tempo feature is directly extracted from the modified discrete cosine transform coefficients, which is the output of partial MP3(MPEG 1 Layer 3) decoder. Then, different features are extracted from the amplitudes of modulation spectrum and applied to different music information retrieval tasks. The logarithmic scale modulation frequency coefficients are employed in automatic music emotion classification and music genre classification. The classification precision in both systems is improved significantly. The bit vectors derived from adaptive modulation spectrum is used in audio fingerprinting task That is proved to be able to achieve high robustness in this application. The experimental results in these tasks validate the effectiveness of the proposed tempo feature.

  • PDF

A Study on Implementation of the High Speed Feature Extraction System Based on Block Type Classification (블록 유형 분류 알고리즘 기반 고속 특징추출 시스템 구현에 관한 연구)

  • Lee, Juseong;An, Ho-Myoung
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.12 no.3
    • /
    • pp.186-191
    • /
    • 2019
  • In this paper, we propose a implementation approach of the high-speed feature extraction algorithm. The proposed method is based on the block type classification algorithm which reduces the computation time when target macro block is divided to smooth block type that has no image features. It is quantitatively identified that occurs at 29.5% of the total image using 200 standard test images with $64{\times}64$ macro block size. This means that within a standard test image containing various image information, 29.5% can reduce the complexity of the operation. When the proposed approach is applied to the Canny edge detection, the required latency of the edge detection can be completely eliminated, such as 2D derivative filter, gradient magnitude/direction computation, non-maximal suppression, adaptive threshold calculation, hysteresis thresholding. Also, it is expected that operation time of the feature detection can be reduced by applying block type classification algorithm to various feature extraction algorithms in this way.

Classification of the PVC Using The Fuzzy-ART Network Based on Wavelet Coefficient (웨이브렛 계수에 근거한 Fuzzy-ART 네트워크를 이용한 PVC 분류)

  • Park, K. L;Lee, K. J.;lee, Y. S.;Yoon, H. R.
    • Journal of Biomedical Engineering Research
    • /
    • v.20 no.4
    • /
    • pp.435-442
    • /
    • 1999
  • A fuzzy-ART(adaptive resonance theory) network for the PVC(premature ventricular contraction) classification using wavelet coefficient is designed. This network consists of the feature extraction and learning of the fuzzy-ART network. In the first step, we have detected the QRS from the ECG signal in order to set the threshold range for feature extraction and the detected QRS was divided into several frequency bands by wavelet transformation using Haar wavelet. Among the low-frequency bands, only the 6th coefficient(D6) are selected as the input feature. After that, the fuzzy-ART network for classification of the PVC is learned by using input feature which comprises of binary data converted by applying threshold to D6. The MIT/BIH database including the PVC is used for the evaluation. The designed fuzzy-ART network showed the PVC classification ratio of 96.52%.

  • PDF

A Hardware Implementation of Pyramidal KLT Feature Tracker (계층적 KLT 특징 추적기의 하드웨어 구현)

  • Kim, Hyun-Jin;Kim, Gyeong-Hwan
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.2
    • /
    • pp.57-64
    • /
    • 2009
  • This paper presents the hardware implementation of the pyramidal KLT(Kanade-Lucas-Tomasi) feature tracker. Because of its high computational complexity, it is not easy to implement a real-time KLT feature tracker using general-purpose processors. A hardware implementation of the pyramidal KLT feature tracker using FPGA(Field Programmable Gate Array) is described in this paper with emphasis on 1) adaptive adjustment of threshold in feature extraction under diverse lighting conditions, and 2) modification of the tracking algorithm to accomodate parallel processing and to overcome memory constraints such as capacity and bandwidth limitation. The effectiveness of the implementation was evaluated over ones produced by its software implementation. The throughput of the FPGA-based tracker was 30 frames/sec for video images with size of $720{\times}480$.

Feature Point Extraction of Sea Urchin using Adaptive Edge Detection (적응적 경계 검출을 이용한 성게의 특징점 추출)

  • Jeon, Young-Cheol;Woo, Young-Bae;Choi, Chul-Jae
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.12 no.1
    • /
    • pp.173-180
    • /
    • 2017
  • The albinism phenomenon of the East Sea is now in progress, and the area of the intensified bedrock has reached up to 61.7% of the whole bedrock area of the East Sea. The methods to eradicate the sea urchin that is known as the main cause of albinism and that influences huge damage on the selfish farm have been continuously studied but they have focused on the food using the sea urchin and recycling and the research on the recognition of the sea urchin has not been performed yet. Therefore, this study suggested the adaptive boundary detection to extract the characteristics of the sea urchin in order to catch the sea urchin in quantity that is the pirate of the sea, and it is believed to help the sea urchin recognition program a lot in the future.