Search | Korea Science

3D-Distortion Based Rate Distortion Optimization for Video-Based Point Cloud Compression

Yihao Fu;Liquan Shen;Tianyi Chen
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.17 no.2
- /
- pp.435-449
- /
- 2023
The state-of-the-art video-based point cloud compression(V-PCC) has a high efficiency of compressing 3D point cloud by projecting points onto 2D images. These images are then padded and compressed by High-Efficiency Video Coding(HEVC). Pixels in padded 2D images are classified into three groups including origin pixels, padded pixels and unoccupied pixels. Origin pixels are generated from projection of 3D point cloud. Padded pixels and unoccupied pixels are generated by copying values from origin pixels during image padding. For padded pixels, they are reconstructed to 3D space during geometry reconstruction as well as origin pixels. For unoccupied pixels, they are not reconstructed. The rate distortion optimization(RDO) used in HEVC is mainly aimed at keeping the balance between video distortion and video bitrates. However, traditional RDO is unreliable for padded pixels and unoccupied pixels, which leads to significant waste of bits in geometry reconstruction. In this paper, we propose a new RDO scheme which takes 3D-Distortion into account instead of traditional video distortion for padded pixels and unoccupied pixels. Firstly, these pixels are classified based on the occupancy map. Secondly, different strategies are applied to these pixels to calculate their 3D-Distortions. Finally, the obtained 3D-Distortions replace the sum square error(SSE) during the full RDO process in intra prediction and inter prediction. The proposed method is applied to geometry frames. Experimental results show that the proposed algorithm achieves an average of 31.41% and 6.14% bitrate saving for D1 metric in Random Access setting and All Intra setting on geometry videos compared with V-PCC anchor.
https://doi.org/10.3837/tiis.2023.02.008 인용 PDF HTML

Robust Radiometric and Geometric Correction Methods for Drone-Based Hyperspectral Imaging in Agricultural Applications

Hyoung-Sub Shin;Seung-Hwan Go;Jong-Hwa Park
- Korean Journal of Remote Sensing
- /
- v.40 no.3
- /
- pp.257-268
- /
- 2024
Drone-mounted hyperspectral sensors (DHSs) have revolutionized remote sensing in agriculture by offering a cost-effective and flexible platform for high-resolution spectral data acquisition. Their ability to capture data at low altitudes minimizes atmospheric interference, enhancing their utility in agricultural monitoring and management. This study focused on addressing the challenges of radiometric and geometric distortions in preprocessing drone-acquired hyperspectral data. Radiometric correction, using the empirical line method (ELM) and spectral reference panels, effectively removed sensor noise and variations in solar irradiance, resulting in accurate surface reflectance values. Notably, the ELM correction improved reflectance for measured reference panels by 5-55%, resulting in a more uniform spectral profile across wavelengths, further validated by high correlations (0.97-0.99), despite minor deviations observed at specific wavelengths for some reflectors. Geometric correction, utilizing a rubber sheet transformation with ground control points, successfully rectified distortions caused by sensor orientation and flight path variations, ensuring accurate spatial representation within the image. The effectiveness of geometric correction was assessed using root mean square error(RMSE) analysis, revealing minimal errors in both east-west(0.00 to 0.081 m) and north-south directions(0.00 to 0.076 m).The overall position RMSE of 0.031 meters across 100 points demonstrates high geometric accuracy, exceeding industry standards. Additionally, image mosaicking was performed to create a comprehensive representation of the study area. These results demonstrate the effectiveness of the applied preprocessing techniques and highlight the potential of DHSs for precise crop health monitoring and management in smart agriculture. However, further research is needed to address challenges related to data dimensionality, sensor calibration, and reference data availability, as well as exploring alternative correction methods and evaluating their performance in diverse environmental conditions to enhance the robustness and applicability of hyperspectral data processing in agriculture.
https://doi.org/10.7780/kjrs.2024.40.3.2 인용 PDF HTML

Robust Speaker Identification using Independent Component Analysis (독립성분 분석을 이용한 강인한 화자식별)

Jang, Gil-Jin;Oh, Yung-Hwan
- Journal of KIISE:Software and Applications
- /
- v.27 no.5
- /
- pp.583-592
- /
- 2000
This paper proposes feature parameter transformation method using independent component analysis (ICA) for speaker identification. The proposed method assumes that the cepstral vectors from various channel-conditioned speech are constructed by a linear combination of some characteristic functions with random channel noise added, and transforms them into new vectors using ICA. The resultant vector space can give emphasis to the repetitive speaker information and suppress the random channel distortions. Experimental results show that the transformation method is effective for the improvement of speaker identification system.
PDF

Teleology, Discontinuity and World History: Periodization and Some Creation Myths of Modernity

Pomeranz, Kenneth
- Asian review of World Histories
- /
- v.1 no.2
- /
- pp.189-226
- /
- 2013
Discussions of world history often focus on the pros and cons of thinking on large spatial scales. However, world history also tends to employ unusually large timescales, both for research and teaching; frequently it is framed around a teleology and a series of "revolutions" which mark milestones taking humans from a very distant past to "modernity". Moreover, world history usually rejects regionally specific period markers (e.g. Renaissance), making periodization within this long timespan especially difficult. This article surveys various approaches to these problems, and shows that any of them, if treated as sufficient by itself, introduces significant distortions. It argues for a world history that highlights this problem, rather than hiding it, and which uses the need to deploy multiple timescales simultaneously to clarify the distinctive intellectual contribution of historical thinking.
https://doi.org/10.12773/arwh.2013.1.2.189 인용 PDF

A Design of Matching Engine for a Practical Query-by-Singing/Humming System with Polyphonic Recordings

Lee, Seok-Pil;Yoo, Hoon;Jang, Dalwon
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.2
- /
- pp.723-736
- /
- 2014
This paper proposes a matching engine for a query-by-singing/humming (QbSH) system with polyphonic music files like MP3 files. The pitch sequences extracted from polyphonic recordings may be distorted. So we use chroma-scale representation, pre-processing, compensation, and asymmetric dynamic time warping to reduce the influence of the distortions. From the experiment with 28 hour music DB, the performance of our QbSH system based on polyphonic database is very promising in comparison with the published QbSH system based on monophonic database. It shows 0.725 in MRR(Mean Reciprocal Rank). Our matching engine can be used for the QbSH system based on MIDI DB also and that performance was verified by MIREX 2011.
https://doi.org/10.3837/tiis.2014.02.0024 인용 PDF KSCI KPUBS

Improvement of Maneuvering Feeling of Human-Mechanical Cooperative System and Its Application to Electric Power Steering System

Mukai, Yasuhiko;Ukai, Hiroyuki;Iwasaki, Makoto;Matsui, Nobuyuki;Hayashi, Jiro;Makino, Nobuhiko;Ishikawa, Hiroshi
- 제어로봇시스템학회:학술대회논문집
- /
- 2003.10a
- /
- pp.728-733
- /
- 2003
In human-mechanical cooperative systems, a significant issue is to improve the control performance and the maneuvering feeling of human operation. However, since it is not easy to evaluate the feeling of operators numerically, control engineers design controllers only through experience. Thus, in this paper, a new evaluation method for control performance of human-mechanical cooperative system is proposed based on the reserge waveform. Various distortions of waveform represent deteriorations of control performance and maneuvering feeling. In some cases, since there is a tradeoff between the control performance and the maneuvering feeling, it is difficult to compensate for both of them by usual feedback controllers. To overcome this situation, the two degrees of freedom control system is applied to human-mechanical cooperative system. Some numerical simulation results for an electric power steering system are shown to confirm the effectiveness of proposed control design method.
PDF

Affine-Invariant Image normalization for Log-Polar Images using Momentums

Son, Young-Ho;You, Bum-Jae;Oh, Sang-Rok;Park, Gwi-Tae
- 제어로봇시스템학회:학술대회논문집
- /
- 2003.10a
- /
- pp.1140-1145
- /
- 2003
Image normalization is one of the important areas in pattern recognition. Also, log-polar images are useful in the sense that their image data size is reduced dramatically comparing with conventional images and it is possible to develop faster pattern recognition algorithms. Especially, the log-polar image is very similar with the structure of human eyes. However, there are almost no researches on pattern recognition using the log-polar images while a number of researches on visual tracking have been executed. We propose an image normalization technique of log-polar images using momentums applicable for affine-invariant pattern recognition. We handle basic distortions of an image including translation, rotation, scaling, and skew of a log-polar image. The algorithm is experimented in a PC-based real-time vision system successfully.
PDF

Adaptive Predistortion Compensation for Nonlinearity of High Power Amplifiers

Ding, Yuanming;Ohmori, Hiromitsu;Sano, Akira
- 제어로봇시스템학회:학술대회논문집
- /
- 2003.10a
- /
- pp.122-127
- /
- 2003
In this paper, an adaptive predistortion scheme is proposed to compensate nonlinear distortions caused by high power amplifiers (HPA) in OFDM systems. A complex Wiener-Hammerstein model (WHM) is used to describe input-output relationship of HPA with linear dynamics. The predistorter is directly identified by complex power series model with memory, which is an approximate inverse of the HPA expressed by the WHM. The effectiveness of the proposed adaptive compensation scheme is validated by numerical simulation for 64QAM-OFDM systems.
PDF

Telephone Speech Recognition with Data-Driven Selective Temporal Filtering based on Principal Component Analysis

Jung Sun Gyun;Son Jong Mok;Bae Keun Sung
- Proceedings of the IEEK Conference
- /
- 2004.08c
- /
- pp.764-767
- /
- 2004
The performance of a speech recognition system is generally degraded in telephone environment because of distortions caused by background noise and various channel characteristics. In this paper, data-driven temporal filters are investigated to improve the performance of a specific recognition task such as telephone speech. Three different temporal filtering methods are presented with recognition results for Korean connected-digit telephone speech. Filter coefficients are derived from the cepstral domain feature vectors using the principal component analysis.
PDF

Adaptive Multi-Rate(AMR) Speech Coding Algorithm (Adaptive Multi-Rate(AMR) 음성부호화 알고리즘)

서정욱;배건성
- Proceedings of the IEEK Conference
- /
- 2000.06d
- /
- pp.92-97
- /
- 2000
An AMR(Adaptive Multi-Rate) speech coding algorithm has been adopted as a standard speech codec for IMT-2000. It is based on the algebraic CELP, and consists of eight speech coding modes having the bit rate from 4.75 kbit/s to 12.2 kbit/s. It also contains the VAD(Voice Activity Detector), SCR (Source Controlled Rate) operation, and error concealment scheme for robustness in a radio channel. The bit rate of AMR is changed on a frame basis depending on the channel condition. In this paper, we introduced AMR speech coding algorithm and performed the real-time implementation using TMS320C6201, i.e., a Texas Instrument's fixed-point DSP. With the ANSI C source code released from ETSI and 3GPP, we convert and optimize the program to make it run in real time using the C compiler and assembly language. It is verified that the decoded result of the implemented speech codec on the DSP is identical with the PC simulation result using ANSI C code for test sequences. Also, actual sound input/output test using microphone and speaker demonstrates its proper real-time operation without distortions or delays.
PDF

Search Result 801, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)