• Title/Summary/Keyword: transform domain processing

Search Result 233, Processing Time 0.021 seconds

Lofargram analysis and identification of ship noise based on Hough transform and convolutional neural network model (허프 변환과 convolutional neural network 모델 기반 선박 소음의 로파그램 분석 및 식별)

  • Junbeom Cho;Yonghoon Ha
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.1
    • /
    • pp.19-28
    • /
    • 2024
  • This paper proposes a method to improve the performance of ship identification through lofargram analysis of ship noise by applying the Hough Transform to a Convolutional Neural Network (CNN) model. When processing the signals received by a passive sonar, the time-frequency domain representation known as lofargram is generated. The machinery noise radiated by ships appears as tonal signals on the lofargram, and the class of the ship can be specified by analyzing it. However, analyzing lofargram is a specialized and time-consuming task performed by well-trained analysts. Additionally, the analysis for target identification is very challenging because the lofargram also displays various background noises due to the characteristics of the underwater environment. To address this issue, the Hough Transform is applied to the lofargram to add lines, thereby emphasizing the tonal signals. As a result of identification using CNN models on both the original lofargrams and the lofargrams with Hough transform, it is shown that the application of the Hough transform improves lofargram identification performance, as indicated by increased accuracy and macro F1 scores for three different CNN models.

An Adaptive Image Enhancement of the DCT Compressed Image using the Spatial Frequency Property (공간주파수 특성을 이용한 DCT 압축영상의 적응 영상 향상)

  • Jeon, Seon-Dong;Kim, Sang-Hee
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.2
    • /
    • pp.104-111
    • /
    • 2010
  • This paper presents an adaptive image enhancement method using the spatial frequency property in the DCT(discrete cosine transform) compressed domain. The dc coefficients, the illumination components of image, are adjusted to compress the dynamic range of image, and the ac coefficients are modified to enhance the contrast by using the human visual system(HVS) and the spatial frequency property. The ac coefficients are separated into vertical direction, horizontal direction, and mixed spatial frequency components, and adaptively modified to minimize the block artifacts that possibly occur in the image enhancement. The proposed method using dynamic range compression and adaptive contrast enhancement shows the advanced performance without the block artifact compared with existing method.

Power Allocation Method of Downlink Non-orthogonal Multiple Access System Based on α Fair Utility Function

  • Li, Jianpo;Wang, Qiwei
    • Journal of Information Processing Systems
    • /
    • v.17 no.2
    • /
    • pp.306-317
    • /
    • 2021
  • The unbalance between system ergodic sum rate and high fairness is one of the key issues affecting the performance of non-orthogonal multiple access (NOMA) system. To solve the problem, this paper proposes a power allocation algorithm to realize the ergodic sum rate maximization of NOMA system. The scheme is mainly achieved by the construction algorithm of fair model based on α fair utility function and the optimal solution algorithm based on the interior point method of penalty function. Aiming at the construction of fair model, the fair target is added to the traditional power allocation model to set the reasonable target function. Simultaneously, the problem of ergodic sum rate and fairness in power allocation is weighed by adjusting the value of α. Aiming at the optimal solution algorithm, the interior point method of penalty function is used to transform the fair objective function with unequal constraints into the unconstrained problem in the feasible domain. Then the optimal solution of the original constrained optimization problem is gradually approximated within the feasible domain. The simulation results show that, compared with NOMA and time division multiple address (TDMA) schemes, the proposed method has larger ergodic sum rate and lower Fairness Index (FI) values.

Block Classifier for Fractal Image Coding (프랙탈 영상 부호화용 블럭 분류기)

  • Park, Gyeong-Bae;Jeong, U-Seok;Kim, Jeong-Il;Jeong, Geun-Won;Lee, Gwang-Bae;Kim, Hyeon-Uk
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.5
    • /
    • pp.691-700
    • /
    • 1995
  • Most fractal image codings using fractal concept require long encoding time because a large amount of computation is needed to find an optimal affine transformation point. Such a problem can be solved by designing a block classifier fitted to characteristics of image blocks. In general, it is possible to predict more precise and various types of blocks in frequency domain than in spatial domain. In this paper, we propose a block classifier to predict the block type using characteristics of DCT(Discrete Cosine Transform). This classifier has merits to enhance the quality of decoded images as well as to reduce the encoding time meeting fractal features. AC coefficient values in frequency domain make it possible to predict various types of blocks. As the results, the number of comparisons between a range block and the correspoding domain blocks to reach an optimal affine transformation point can be reduced. Specially, signs of DCT coefficients help to find the optimal affine transformation point with only two isometric transformations by eliminating unnecessary isometric transformations among eight isometric transformations used in traditional fractal codings.

  • PDF

Frequency Domain Encoding Matrices for Computationally Efficient STBC/SFBC OFDM Transmitters (복잡도 감소를 위한 MIMO-STBC/SFBC OFDM 방식의 주파수 도메인 부호화 행렬)

  • Cho, Se-Young;Shang, Pingping;Kim, Soo-Young;Lim, Kwang-Jae
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.47 no.12
    • /
    • pp.8-16
    • /
    • 2010
  • In this paper, we present a structure of computationally efficient MIMO-OFDM transmitters, using STBC. Straightforward implementation of these systems requires separate WFT processing blocks for each of the transmit antennas. In our proposed scheme, we use symmetry properties of Fourier transform, and thus use a frequency domain encoding strategy. This greatly reduces the computational burden. The proposed scheme uses just a single IFFT block at each OFDM symbol period, and thus resulting in the same complexity as a single antenna OFDM system. As specific examples of the proposed schemes, we present frequency domain encoding matrices for QO-STBC-OFDM and QO-SFBC-OFDM transmitters.

Hardware Architecture and its Design of Real-Time Video Compression Processor for Motion JPEG2000 (Motion JPEG2000을 위한 실시간 비디오 압축 프로세서의 하드웨어 구조 및 설계)

  • 서영호;김동욱
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.53 no.1
    • /
    • pp.1-9
    • /
    • 2004
  • In this paper, we proposed a hardware(H/W) structure which can compress and recontruct the input image in real time operation and implemented it into a FPGA platform using VHDL(VHSIC Hardware Description Language). All the image processing element to process both compression and reconstruction in a FPGA were considered each of them was mapped into a H/W with the efficient structure for FPGA. We used the DWT(discrete wavelet transform) which transforms the data from spatial domain to the frequency domain, because use considered the motion JPEG2000 as the application. The implemented H/W is separated to both the data path part and the control part. The data path part consisted of the image processing blocks and the data processing blocks. The image processing blocks consisted of the DWT Kernel for the filtering by DWT, Quantizer/Huffman Encoder, Inverse Adder/Buffer for adding the low frequency coefficient to the high frequency one in the inverse DWT operation, and Huffman Decoder. Also there existed the interface blocks for communicating with the external application environments and the timing blocks for buffering between the internal blocks. The global operations of the designed H/W are the image compression and the reconstruction, and it is operated by the unit or a field synchronized with the A/D converter. The implemented H/W used the 54%(12943) LAB(Logic Array Block) and 9%(28352) ESB(Embedded System Block) in the APEX20KC EP20K600CB652-7 FPGA chip of ALTERA, and stably operated in the 70MHz clock frequency. So we verified the real time operation. that is. processing 60 fields/sec(30 frames/sec).

Post-processing Technique Based on POCS Using EZW (EZW를 이용한 POCS 기반의 후처리 기법)

  • Kim, Hyo-Kak;Kwon, Goo-Rak;Kim, Yoon;Ko, Sung-Jea
    • Proceedings of the IEEK Conference
    • /
    • 2005.11a
    • /
    • pp.427-430
    • /
    • 2005
  • In this paper, we propose a new post-processing method, based on the theory of the projection onto convex sets (POCS) to reduce the blocking artifacts in decoded images. We propose a new smoothness constraint set (SCS) and its projection operator in the wavelet transform (WT) domain to remove unnecessary high-frequency components caused by blocking artifacts. We also propose a new method to find and preserve the original high frequency components of the image edge. Experimental results show that the proposed method can not only achieve a significantly enhanced subjective quality, but also have the PSNR improvement in the output image.

  • PDF

Fast Codebook Search Algorithm for VQ of Subband Images (서브밴드 영상의 VQ를 위한 고속 코드북 탐색 알고리즘)

  • Hwang, Jae-Ho;Hong, Choong-Seon;Lee, Dae-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10b
    • /
    • pp.883-886
    • /
    • 2000
  • Two fast search algorithms are proposed for VQ encoding in subband/VQ coding schemes. These algorithms exploit the property of the transform domain that the large coefficients rarely exist in the decomposed subbands. And the exit condition of PDE algorithm can be satisfied by comparing the large values of the codeword with the corresponding ones of the input vector. The computational complexity can be reduced at the expense of memories without extra coding errors.

  • PDF

An Efficient Watermarking Technique in Wavelet Transform Domain (웨이브릿 변환 영역에서의 효율적인 워터마킹 기법)

  • Kim, Hyun-Soon;Bae, Sung-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.04a
    • /
    • pp.1071-1076
    • /
    • 2000
  • 워터마킹은 소유권을 주장하기 위하여 '워터마크'라고 불리는 신호를 눈에 보이지 않게 영상에 첨가하는 것이다. 본 논문에서는 웨이브릿 변환 영역에서 이웃한 계수 쌍의 차이와 평균을 이용한 워터마킹 기법을 제안한다. 제안한 방법에서는 영상을 1계층의 웨이브릿 변환 영역으로 변환한 다음, 이진 도장 영상으로 된 워터마크를 첨가할 이웃한 계수 쌍을 기저 대역에서 선택한다. 워터마크 첨가를 위하여 도장 영상의 이름이 새겨진 부분에 대하여 해당 웨이브릿 계수 쌍의 대소 관계를 반전시킨다. 이러한 방법은 워터마크를 직접 첨가하는 방법에 비하여 손실 압축 및 다양한 영상 처리 등의 공격에 강인하다. 또한 대소 관계를 반전시키는 과정에서 이웃 계수 쌍의 평균과 차이를 구하여, 차이에 반비례하는 일정한 값을 평균에 더함으로써 주관적인 화질의 저하를 줄인다. 실험 결과 제안한 방법은 44 dB 정도의 우수한 화질에서 손실 JPEG 압축, 잡음 첨가, 클리핑, 블러링 등의 공격에 강인함을 알 수 있었다.

  • PDF

Copyright Protection for Digital Image by Watermarking Technique

  • Ali, Suhad A.;Jawad, Majid Jabbar;Naser, Mohammed Abdullah
    • Journal of Information Processing Systems
    • /
    • v.13 no.3
    • /
    • pp.599-617
    • /
    • 2017
  • Due to the rapid growth and expansion of the Internet, the digital multimedia such as image, audio and video are available for everyone. Anyone can make unauthorized copying for any digital product. Accordingly, the owner of these products cannot protect his ownership. Unfortunately, this situation will restrict any improvement which can be done on the digital media production in the future. Some procedures have been proposed to protect these products such as cryptography and watermarking techniques. Watermarking means embedding a message such as text, the image is called watermark, yet, in a host such as a text, an image, an audio, or a video, it is called a cover. Watermarking can provide and ensure security, data authentication and copyright protection for the digital media. In this paper, a new watermarking method of still image is proposed for the purpose of copyright protection. The procedure of embedding watermark is done in a transform domain. The discrete cosine transform (DCT) is exploited in the proposed method, where the watermark is embedded in the selected coefficients according to several criteria. With this procedure, the deterioration on the image is minimized to achieve high invisibility. Unlike the traditional techniques, in this paper, a new method is suggested for selecting the best blocks of DCT coefficients. After selecting the best DCT coefficients blocks, the best coefficients in the selected blocks are selected as a host in which the watermark bit is embedded. The coefficients selection is done depending on a weighting function method, where this function exploits the values and locations of the selected coefficients for choosing them. The experimental results proved that the proposed method has produced good imperceptibility and robustness for different types of attacks.