• Title/Summary/Keyword: Image Gradient

Search Result 714, Processing Time 0.027 seconds

3D Object Generation and Renderer System based on VAE ResNet-GAN

  • Min-Su Yu;Tae-Won Jung;GyoungHyun Kim;Soonchul Kwon;Kye-Dong Jung
    • International journal of advanced smart convergence
    • /
    • v.12 no.4
    • /
    • pp.142-146
    • /
    • 2023
  • We present a method for generating 3D structures and rendering objects by combining VAE (Variational Autoencoder) and GAN (Generative Adversarial Network). This approach focuses on generating and rendering 3D models with improved quality using residual learning as the learning method for the encoder. We deep stack the encoder layers to accurately reflect the features of the image and apply residual blocks to solve the problems of deep layers to improve the encoder performance. This solves the problems of gradient vanishing and exploding, which are problems when constructing a deep neural network, and creates a 3D model of improved quality. To accurately extract image features, we construct deep layers of the encoder model and apply the residual function to learning to model with more detailed information. The generated model has more detailed voxels for more accurate representation, is rendered by adding materials and lighting, and is finally converted into a mesh model. 3D models have excellent visual quality and accuracy, making them useful in various fields such as virtual reality, game development, and metaverse.

The Construction Method of Precise DTM of UAV Images Using Sobel-median Filtering (소벨-메디언 필터링을 이용한 UAV 영상의 정밀 DTM 구축 방법에 관한 연구)

  • Na, Young-Woo
    • Journal of Urban Science
    • /
    • v.12 no.2
    • /
    • pp.43-52
    • /
    • 2023
  • UAV have the disadvantage that are weak from rainfall or winds due to the light platform, so use Scale-Invariant Feature Transform (SIFT) method which extrude keypoints in image matching process. To find the efficient filtering method for the construction of precise Digital Terrain Model (DTM) using UAV images, comparatively analyzed sobel and Differential of Gaussian (DoG) and found sobel is more efficient way to extrude buildings, trees, and so on. And edges are extruded more clearly when applying median additionally which have the merit of preserving edge and eliminating noise. In this study, applied sobel-median filtering which plus median to sobel and constructed the 1st filtered DTM that extrude building and trees and 2nd filtered DTM that extrude cars by threshold of gradient, Analysis of the degree of accuracy improvement showed that standard deviations of 1st filtered DTM and 2nd filtered DTM are 0.32m, 0.287m respectively, and both are acceptable for the tolerance of 0.33m for elevation points of 1/1,000 digital map, and the accuracy was increased about 10% by filtering automobiles. Plus, moving things are changed those position and direction in every image, and these are not target to filter because of the characteristic that is excluded from SIFT method.

Image Watermarking for Copyright Protection of Images on Shopping Mall (쇼핑몰 이미지 저작권보호를 위한 영상 워터마킹)

  • Bae, Kyoung-Yul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.147-157
    • /
    • 2013
  • With the advent of the digital environment that can be accessed anytime, anywhere with the introduction of high-speed network, the free distribution and use of digital content were made possible. Ironically this environment is raising a variety of copyright infringement, and product images used in the online shopping mall are pirated frequently. There are many controversial issues whether shopping mall images are creative works or not. According to Supreme Court's decision in 2001, to ad pictures taken with ham products is simply a clone of the appearance of objects to deliver nothing but the decision was not only creative expression. But for the photographer's losses recognized in the advertising photo shoot takes the typical cost was estimated damages. According to Seoul District Court precedents in 2003, if there are the photographer's personality and creativity in the selection of the subject, the composition of the set, the direction and amount of light control, set the angle of the camera, shutter speed, shutter chance, other shooting methods for capturing, developing and printing process, the works should be protected by copyright law by the Court's sentence. In order to receive copyright protection of the shopping mall images by the law, it is simply not to convey the status of the product, the photographer's personality and creativity can be recognized that it requires effort. Accordingly, the cost of making the mall image increases, and the necessity for copyright protection becomes higher. The product images of the online shopping mall have a very unique configuration unlike the general pictures such as portraits and landscape photos and, therefore, the general image watermarking technique can not satisfy the requirements of the image watermarking. Because background of product images commonly used in shopping malls is white or black, or gray scale (gradient) color, it is difficult to utilize the space to embed a watermark and the area is very sensitive even a slight change. In this paper, the characteristics of images used in shopping malls are analyzed and a watermarking technology which is suitable to the shopping mall images is proposed. The proposed image watermarking technology divide a product image into smaller blocks, and the corresponding blocks are transformed by DCT (Discrete Cosine Transform), and then the watermark information was inserted into images using quantization of DCT coefficients. Because uniform treatment of the DCT coefficients for quantization cause visual blocking artifacts, the proposed algorithm used weighted mask which quantizes finely the coefficients located block boundaries and coarsely the coefficients located center area of the block. This mask improves subjective visual quality as well as the objective quality of the images. In addition, in order to improve the safety of the algorithm, the blocks which is embedded the watermark are randomly selected and the turbo code is used to reduce the BER when extracting the watermark. The PSNR(Peak Signal to Noise Ratio) of the shopping mall image watermarked by the proposed algorithm is 40.7~48.5[dB] and BER(Bit Error Rate) after JPEG with QF = 70 is 0. This means the watermarked image is high quality and the algorithm is robust to JPEG compression that is used generally at the online shopping malls. Also, for 40% change in size and 40 degrees of rotation, the BER is 0. In general, the shopping malls are used compressed images with QF which is higher than 90. Because the pirated image is used to replicate from original image, the proposed algorithm can identify the copyright infringement in the most cases. As shown the experimental results, the proposed algorithm is suitable to the shopping mall images with simple background. However, the future study should be carried out to enhance the robustness of the proposed algorithm because the robustness loss is occurred after mask process.

Splitting between Region of Chromatic and Achromatic by Brightness and Chroma (명암과 채도에 의한 색상영역과 비색상영역의 분할)

  • Kwak, Nae-Joung;Hwang, Jae-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.7
    • /
    • pp.107-114
    • /
    • 2010
  • Color is a sense signal for human to perceive being through light, and the color is divided into chromatic color and achromatic color. Chromatic color has hue, intensity, and saturation, but achromatic color has only intensity among the properties of chromatic color and doesn't have hue and saturation. Therefore it is important to split colors of image into area for human to perceive colors and not to perceive ones based on vision of human being. In this paper, we find a function to split colors of image into chromatic region of chromatic color region and achromatic region of achromatic color region. First, the input image of RGB color space is converted into the image of HSI color space in consideration of human vision and get a binary image from the converted image. After then, a function to split colors into ROC(ROC: Region of chromatic.) and ROA(ROA:Region of achromatic) is yield. It is difficult to split color of a general image into ROC and ROA. Therefore, to get the chromatic area and achromatic area, we make gradient images to have all range of intensity and range of saturation and to have a little range of hue and yield the function. The evaluation is tested using subjective-quality by 50 non-experts for result images of test images and general images. The results of the proposed method get better 27.5~32.96% than these of the conventional method

Image Super Resolution Using Neural Architecture Search (심층 신경망 검색 기법을 통한 이미지 고해상도화)

  • Ahn, Joon Young;Cho, Nam Ik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2019.11a
    • /
    • pp.102-105
    • /
    • 2019
  • 본 논문에서는 심층 신경망 검색 방법을 사용하여 이미지 고해상도화를 위한 심층 신경망을 설계하는 방법을 구현하였다. 일반적으로 이미지 고해상도화, 잡음 제거 및 번짐 제거를 위한 심층신경망 구조는 사람이 설계하였다. 최근에는 이미지 분류 등 다른 영상처리 기법에서 사용하는 심층 신경망 구조를 검색하기 위한 방법이 연구되었다. 본 논문에서는 강화학습을 사용하여 이미지 고해상도화를 위한 심층 신경망 구조를 검색하는 방법을 제안하였다. 제안된 방법은 policy gradient 방법의 일종인 REINFORCE 알고리즘을 사용하여 심층 신경망 구조를 출력하여 주는 제어용 RNN(recurrent neural network)을 학습하고, 최종적으로 이미지 고해상도화를 잘 실현할 수 있는 심층 신경망 구조를 검색하여 설계하였다. 제안된 심층 신경망 구조를 사용하여 이미지 고해상도화를 구현하였고, 약 36.54dB 의 피크 신호 대비 잡음 비율(PSNR)을 가지는 것을 확인할 수 있었다.

  • PDF

Model-Based Simulation Analysis of Wicking Behavior in Hygroscopic Cotton Fabric

  • Hong, Cheol-Jae;Kim, Byung-Jick
    • Journal of Fashion Business
    • /
    • v.20 no.6
    • /
    • pp.66-78
    • /
    • 2016
  • Hygroscopic knitted cotton fabric was found to spontaneously absorb water showing a significantly wide concentration gradient in the absorption direction. A semi-empirical diffusion model was introduced to describe how the wicking behavior compared to the classical capillary model (Washburn's equation), which has been widely used in the textiles industry. The capillary sorption curve and the permeability coefficient, which are key variables for the model equations, were measured using an electronic balance. The concentration profile as a function of the wicking distance and the elapsed time was derived, based on the diffusion model. From the concentration profile, the wicking distance detectable by the human eye or a digital camera with the aid of an image-analysis system, could be described realistically as a function of the time. The classical capillary model could be modified by introducing the tortuous correction factor to match the diffusion model. Wicking models and data-processing techniques in the work could provide useful tools for objectively evaluating the textile's wicking performances.

Lineament Extraction from DEM Using Raindrop Tracing Algorithm

  • Yun, Sang-ho
    • Proceedings of the KSRS Conference
    • /
    • 1999.11a
    • /
    • pp.290-295
    • /
    • 1999
  • Lineament extraction from mountain area often provides valuable geological information. In many cases, the lineaments correspond to a series of continuous large valleys. This paper introduces a new lineament extraction method from Digital Elevation Model (DEM) using Raindrop Tracing Algorithm (RTA). The main advantage of this algorithm over conventional Segment Tracing Algorithm (STA) is that it utilizes DEM directly unlike the STA Which utilizes the shaded relief of DEM. The RTA simulates the real life of raindrops that converge into a large valley. The simulation has been done by sprinkling the randomized raindrops over DEM and counting the number of raindrop path that follows the negative gradient of the DEM. The large counting number indicates the location of a big valley where the raindrops converge. With the help of the counting number array (accumulator array) recording the flowing path information, RTA can produce perfectly unbiased binary image of the lineament.

  • PDF

Recovering Surface Orientation from Texture Gradient by Monoculer View (단안시에 의한 무늬그래디언트로부터 연 방향 복구)

  • 정성칠;최연성;최종수
    • Proceedings of the Korean Institute of Communication Sciences Conference
    • /
    • 1987.04a
    • /
    • pp.22-26
    • /
    • 1987
  • Texture provides an important acurce of information about the threedicensfornarry information of visible surface particulary for stationary conccular views. To recover three dicmensinoary information, the distorging effects of pro jection must be distinguished from properties of the texture on which the distrortion acts. In this paper, we show an approximated maximum likelihood estimation method by which we find surface oriemtation of the visible surface in gaussian sphere using local analysis of the texture, In addition assuming that an orthographic projection and a circle is an image formation system and a texel(texture element)respectively we derive the surface orientation from the distribution of variation by means of orthographic pro jemction of a tangent directon which exstis regulary in the are length of a circle we present the orientation parameters of textured surface with saint and tilt and also the surface normal of the resvlted surface orimentation as needle map. This algorithm was applied to geograghic contour and synthetic textures.

  • PDF

Pedestrian Detection using HOG Feature and Multi-Frame Operation (HOG 특징과 다중 프레임 연산을 이용한 보행자 탐지)

  • Seo, Chang-jin;Ji, Hong-il
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.64 no.3
    • /
    • pp.193-198
    • /
    • 2015
  • A large number of vision applications rely on matching keypoints across images. Pedestrian detection is under constant pressure to increase both its quality and speed. Such progress allows for new application. A higher speed enables its inclusion into large systems with extensive subsequent processing, and its deployment in computationally constrained scenarios. In this paper, we focus on improving the speed of pedestrian detection using HOG(histogram of oriented gradient) and multi frame operation which is robust to illumination changes in cluttering images. The result of our simulation indicates that the detection rate and speed of the proposed method is much faster than that of conventional HOG and differential images.

Recovering Incomplete Data using Tucker Model for Tensor with Low-n-rank

  • Thieu, Thao Nguyen;Yang, Hyung-Jeong;Vu, Tien Duong;Kim, Sun-Hee
    • International Journal of Contents
    • /
    • v.12 no.3
    • /
    • pp.22-28
    • /
    • 2016
  • Tensor with missing or incomplete values is a ubiquitous problem in various fields such as biomedical signal processing, image processing, and social network analysis. In this paper, we considered how to reconstruct a dataset with missing values by using tensor form which is called tensor completion process. We applied Tucker factorization to solve tensor completion which was built base on optimization problem. We formulated the optimization objective function using components of Tucker model after decomposing. The weighted least square matric contained only known values of the tensor with low rank in its modes. A first order optimization method, namely Nonlinear Conjugated Gradient, was applied to solve the optimization problem. We demonstrated the effectiveness of the proposed method in EEG signals with about 70% missing entries compared to other algorithms. The relative error was proposed to compare the difference between original tensor and the process output.