• 제목/요약/키워드: Adaptive Threshold Range

검색결과 29건 처리시간 0.026초

Study on OCR Enhancement of Homomorphic Filtering with Adaptive Gamma Value

  • Heeyeon Jo;Jeongwoo Lee;Hongrae Lee
    • 한국컴퓨터정보학회논문지
    • /
    • 제29권2호
    • /
    • pp.101-108
    • /
    • 2024
  • AI-OCR은 광학 문자 인식(OCR) 기술과 Artificial intelligence(AI)의 결합으로 사람의 인식이 필요하던 OCR의 단점을 보완하는 기술 향상을 이뤄내고 있다. AI-OCR의 성능을 높이기 위해서는 다양한 학습데이터의 훈련이 필요하다. 하지만 이미지 색상이 비슷한 밝기를 가진 경우에는 인식률이 떨어지기 때문에, Homomorphic filtering(HF)을 이용한 전처리 과정으로 색상 차이를 분명하게 하여 텍스트 인식률을 높이게 된다. HF은 감마값을 이용해 이미지의 고주파와 저주파를 각각 조절한다는 점에서 텍스트 추출에 적합하지만 감마값의 조절이 수동적으로 이뤄지는 단점이 존재한다. 본 연구는 시험적 과정을 거쳐 이미지의 대비, 밝기 및 엔트로피를 근거하는 감마의 임계값 범위를 제안한다. 제안된 감마값 범위를 적용한 HF의 실험 결과는 효율적인 AI-OCR의 높은 등장 가능성을 시사한다.

다해상도 면 파라미터 추정을 이용한 거리영상 복원 (Range image reconstruction based on multiresolution surface parameter estimation)

  • 장인수;박래홍
    • 전자공학회논문지S
    • /
    • 제34S권6호
    • /
    • pp.58-66
    • /
    • 1997
  • This paper proposes a multiresolution surface parameter estimation method for range images. Based on robust estimation of surface parameters, it approximates a patch to a planar surface in the locally adaptive window. Selection of resolution is made pixelwise by comparing a locally computed homogeneity measure with th eglobal threshold determined by te distribution of the approximation error. The proposed multiresolution surface parameter estimation method is applied to range image reconstruction. Computer simulation results with noisy rnag eimages contaminated by additive gaussian noise and impulse noise show that the proposed multiresolution reconstruction method well preserves step and roof edges compared with the conventional methods. Also the segmentation method based on the estimated surface parameters is shown to be robust to noise.

  • PDF

Depth Map Coding Using Histogram-Based Segmentation and Depth Range Updating

  • Lin, Chunyu;Zhao, Yao;Xiao, Jimin;Tillo, Tammam
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권3호
    • /
    • pp.1121-1139
    • /
    • 2015
  • In texture-plus-depth format, depth map compression is an important task. Different from normal texture images, depth maps have less texture information, while contain many homogeneous regions separated by sharp edges. This feature will be employed to form an efficient depth map coding scheme in this paper. Firstly, the histogram of the depth map will be analyzed to find an appropriate threshold that segments the depth map into the foreground and background regions, allowing the edge between these two kinds of regions to be obtained. Secondly, the two regions will be encoded through rate distortion optimization with a shape adaptive wavelet transform, while the edges are lossless encoded with JBIG2. Finally, a depth-updating algorithm based on the threshold and the depth range is applied to enhance the quality of the decoded depth maps. Experimental results demonstrate the effective performance on both the depth map quality and the synthesized view quality.

시그모이드 추정과 임계 판정 가중 오차를 사용한 새로운 SDD 등화의 자기적응 성능 개선 (Self-Adaptive Performance Improvement of Novel SDD Equalization Using Sigmoid Estimate and Threshold Decision-Weighted Error)

  • 오길남
    • 한국산학기술학회논문지
    • /
    • 제17권8호
    • /
    • pp.17-22
    • /
    • 2016
  • 고차 QAM 시스템에 대한 자기적응 등화에서 눈 모형이 완전히 닫힌 등화 초기에 적용하여 눈 모형을 빠르게 열뿐만 아니라 정상상태 오차 레벨을 크게 낮추는 새로운 SDD 알고리즘을 제안한다. 제안 방법은 M-QAM 응용에서, 관찰에 가장 인접한 두 심볼을 추정의 기반으로 함으로써 기존 SDD의 계산 복잡성을 최소화하고, QAM 차수에 무관하게 연판정을 크게 단순화하였다. 아울러 심볼 추정에 임계 함수에 비해 오판정 회피가 우수한 시그모이드 함수를 적용, 추정의 신뢰도를 높였다. 또한 등화기 갱신을 위한 오차 발생 시 임계 함수에 의한 심볼 판정 값을 오차에 가중하여 오차 변동 범위를 확장함으로써 제안한 자기적응 등화기의 초기화 성능을 개선하였다. 결과적으로 제안 방법은 기존 SDD의 계산 복잡성과 초기화 및 수렴 특성을 현저히 개선하였다. 부가 잡음이 존재하는 다중경로 채널 조건에서 64-QAM 및 256-QAM에 대한 모의실험을 통해 CMA와 제안한 2-SDD 및 가중된 2-SDD의 두 가지 형태의 성능을 비교하고 제안 방법의 유용성을 확인하였다.

거리 기반 적응형 임계값을 활용한 강건한 3차원 물체 탐지 (Robust 3D Object Detection through Distance based Adaptive Thresholding)

  • 이은호;정민우;김종호;이경수;김아영
    • 로봇학회논문지
    • /
    • 제19권1호
    • /
    • pp.106-116
    • /
    • 2024
  • Ensuring robust 3D object detection is a core challenge for autonomous driving systems operating in urban environments. To tackle this issue, various 3D representation, including point cloud, voxels, and pillars, have been widely adopted, making use of LiDAR, Camera, and Radar sensors. These representations improved 3D object detection performance, but real-world urban scenarios with unexpected situations can still lead to numerous false positives, posing a challenge for robust 3D models. This paper presents a post-processing algorithm that dynamically adjusts object detection thresholds based on the distance from the ego-vehicle. While conventional perception algorithms typically employ a single threshold in post-processing, 3D models perform well in detecting nearby objects but may exhibit suboptimal performance for distant ones. The proposed algorithm tackles this issue by employing adaptive thresholds based on the distance from the ego-vehicle, minimizing false negatives and reducing false positives in the 3D model. The results show performance enhancements in the 3D model across a range of scenarios, encompassing not only typical urban road conditions but also scenarios involving adverse weather conditions.

Coronary Artery Lumen Segmentation Using Location-Adaptive Threshold in Coronary Computed Tomographic Angiography: A Proof-of-Concept

  • Cheong-Il Shin;Sang Joon Park;Ji-Hyun Kim;Yeonyee Elizabeth Yoon;Eun-Ah Park;Bon-Kwon Koo;Whal Lee
    • Korean Journal of Radiology
    • /
    • 제22권5호
    • /
    • pp.688-698
    • /
    • 2021
  • Objective: To compare the lumen parameters measured by the location-adaptive threshold method (LATM), in which the inter- and intra-scan attenuation variabilities of coronary computed tomographic angiography (CCTA) were corrected, and the scan-adaptive threshold method (SATM), in which only the inter-scan variability was corrected, with the reference standard measurement by intravascular ultrasonography (IVUS). Materials and Methods: The Hounsfield unit (HU) values of whole voxels and the centerline in each of the cross-sections of the 22 target coronary artery segments were obtained from 15 patients between March 2009 and June 2010, in addition to the corresponding voxel size. Lumen volume was calculated mathematically as the voxel volume multiplied by the number of voxels with HU within a given range, defined as the lumen for each method, and compared with the IVUS-derived reference standard. Subgroup analysis of the lumen area was performed to investigate the effect of lumen size on the studied methods. Bland-Altman plots were used to evaluate the agreement between the measurements. Results: Lumen volumes measured by SATM was significantly smaller than that measured by IVUS (mean difference, 14.6 mm3; 95% confidence interval [CI], 4.9-24.3 mm3); the lumen volumes measured by LATM and IVUS were not significantly different (mean difference, -0.7 mm3; 95% CI, -9.1-7.7 mm3). The lumen area measured by SATM was significantly smaller than that measured by LATM in the smaller lumen area group (mean of difference, 1.07 mm2; 95% CI, 0.89-1.25 mm2) but not in the larger lumen area group (mean of difference, -0.07 mm2; 95% CI, -0.22-0.08 mm2). In the smaller lumen group, the mean difference was lower in the Bland-Altman plot of IVUS and LATM (0.46 mm2; 95% CI, 0.27-0.65 mm2) than in that of IVUS and SATM (1.53 mm2; 95% CI, 1.27-1.79 mm2). Conclusion: SATM underestimated the lumen parameters for computed lumen segmentation in CCTA, and this may be overcome by using LATM.

수중 로봇을 위한 다중 템플릿 및 가중치 상관 계수 기반의 물체 인식 및 추종 (Multiple Templates and Weighted Correlation Coefficient-based Object Detection and Tracking for Underwater Robots)

  • 김동훈;이동화;명현;최현택
    • 로봇학회논문지
    • /
    • 제7권2호
    • /
    • pp.142-149
    • /
    • 2012
  • The camera has limitations of poor visibility in underwater environment due to the limited light source and medium noise of the environment. However, its usefulness in close range has been proved in many studies, especially for navigation. Thus, in this paper, vision-based object detection and tracking techniques using artificial objects for underwater robots have been studied. We employed template matching and mean shift algorithms for the object detection and tracking methods. Also, we propose the weighted correlation coefficient of adaptive threshold -based and color-region-aided approaches to enhance the object detection performance in various illumination conditions. The color information is incorporated into the template matched area and the features of the template are used to robustly calculate correlation coefficients. And the objects are recognized using multi-template matching approach. Finally, the water basin experiments have been conducted to demonstrate the performance of the proposed techniques using an underwater robot platform yShark made by KORDI.

MPE-LPC를 이용한 심전도 신호의 압축 (Compression of Electrocardiogram Using MPE-LPC)

  • 이태진;김원기;차일환;윤대희
    • 전자공학회논문지B
    • /
    • 제28B권11호
    • /
    • pp.866-875
    • /
    • 1991
  • In this paper, multi pulse excited-linear predictive coding (MPE-LPC), where the correlation eliminated residual signal is modeled by a few pules, is shown to be effective for the compression of electrocardiogram (ECG) data, and a more efficient scheme for a faithful reconstruction of ECG is proposed. The reconstruction charateristic of QRS's and P.T waves is improved using the adaptive pulse allocation (APA), and the compression ratio (CR) can be changed by controlling the mumber of modeling pulses. The performance of the proposed method was evaluated using 10 normal and 10 abnormal ECG data. The proposed method had a better performance than the variable threshold amplitude zone time epoch coding (AZTEC) algorithm and the scan-along polygonal approximation (SAPA) algorithm with the same CR. With the CR in kthe range of 8:1 to 14:1, we could compress ECG data efficiently.

  • PDF

고 복잡도 H.264/AVC의 실시간 압축을 위한 고속 인터 예측 부호화 기법 (A Fast Inter Prediction Encoding Algorithm for Real-time Compression of H.264/AVC with High Complexity)

  • 김영현;최현준;서영호;김동욱
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2006년도 하계종합학술대회
    • /
    • pp.411-412
    • /
    • 2006
  • In this paper, we proposed a fast algorithm for inter prediction included the most complexity in H.264/AVC. It decide search range according to direction of predicted motion vector, and then perform adaptive candidate spiral search. Simultaneously, it perform motion estimation of variable loop with threshold for variable block size. Conclusively, it is implemented in JM FME with high complexity applying to rate-distortion optimization. Experimental results show that significant complexity reduction is achieved while the degradation in video quality is negligible.

  • PDF

H.264/AVC의 실시간 압축을 위한 고속 인터 예측 부호화 기술 (A Fast Inter Prediction Encoding Technique for Real-time Compression of H.264/AVC)

  • 김영현;최현준;서영호;김동욱
    • 한국통신학회논문지
    • /
    • 제31권11C호
    • /
    • pp.1077-1084
    • /
    • 2006
  • 본 논문에서는 H.264/AVC에서 가장 많은 연산량을 차지하는 인터 예측(inter prediction)을 고속으로 수행할 수 있는 방법을 제안하였다. 제안한 방법은 율-왜곡 최적화 기법(Rate-Distortion Optimization, RDO)이 적용된 JM(Joint Model)의 FME(Fast Motion Estimation)를 대상으로 예측된 움직임 벡터의 방향성을 고려하여 탐색영역을 결정한 후 적응적인 후보 나선형 탐색을 수행한다. 동시에 가변 블록 크기에 대하여 비용함수의 임계값(threshold)을 결정한 후 가변 구간 움직임 탐색을 수행함으로써 인터 예측의 부호화 복잡도를 감소시킨다. 다양한 영상들을 대상으로 실험한 결과 기존의 예측 방식에 최대 80%의 연산량을 줄일 수 있음을 확인하였다. 이에 따른 화질 열화는 평균 $0.05dB{\sim}0.19dB$에 불과하며, 압축률은 평균 0.58%의 미미한 감소를 보임으로써, 제안한 방법이 고속 인터 예측 알고리즘으로 매우 효율적인 방법임을 확인하였다.