통합 검색 | Korea Science

의미론적 영상 분할의 정확도 향상을 위한 에지 정보 기반 후처리 방법 (Post-processing Algorithm Based on Edge Information to Improve the Accuracy of Semantic Image Segmentation)

김정환;김선혁;김주희;최형일
- 한국콘텐츠학회논문지
- /
- 제21권3호
- /
- pp.23-32
- /
- 2021
컴퓨터 비전 분야의 의미론적 영상 분할(Semantic Image Segmentation) 기술은 이미지를 픽셀 단위로 분할 하여 클래스를 나누는 기술이다. 이 기술도 기계 학습을 이용한 방법으로 성능이 빠르게 향상되는 중이며, 픽셀 단위의 정보를 활용할 수 있는 높은 활용성이 주목받는 기술이다. 그러나 이 기술은 초기부터 최근까지도 계속 '세밀하지 못한 분할'에 대한 문제가 제기되어 왔다. 이 문제는 레이블 맵의 크기를 계속 늘리면서 발생한 문제이기 때문에, 자세한 에지 정보가 있는 원본 영상의 에지 맵을 이용해 레이블 맵을 수정하여 개선할 수 있을 것으로 예상할 수 있었다. 따라서 본 논문은 기존 방법대로 학습 기반의 의미론적 영상 분할을 유지하되, 그 결과인 레이블 맵을 원본 영상의 에지 맵 기반으로 수정하는 후처리 알고리즘을 제안한다. 기존의 방법에 알고리즘의 적용 한 뒤 전후의 정확도를 비교했을 때 평균적으로 약 1.74% 픽셀 정확도와 1.35%의 IoU(Intersection of Union) 정확도가 향상되었으며, 결과를 분석했을 때 성공적으로 본래 목표한 세밀한 분할 기능을 개선했음을 보였다.
https://doi.org/10.5392/JKCA.2021.21.03.023 인용 PDF KSCI HTML

Haar 웨이블릿 변환을 사용한 Watershed 기반 영상 분할의 효율성 증대를 위한 기법 (A Method for the Increasing Efficiency of the Watershed Based Image Segmentation using Haar Wavelet Transform)

김종배;김항준
- 대한전자공학회논문지SP
- /
- 제40권2호
- /
- pp.1-10
- /
- 2003
Watershed 알고리즘은 형태학 분야에서 연구되어 온 것으로 단순화된 영상에 대한 경사 영상 화소의 밝기 값을 고도로 생각함으로써 영상을 분할하는데 많이 적용하였다. 하지만, 노이즈에 의해 훼손된 영상을 분할 할 경우, 수 많은 local minima로 인해 영상이 과 분할되고, 분할된 영역을 병합하기 위한 계산 시간 증가의 문제점이 발생된다. 이러한 문제점을 해결하기 위해, 본 논문에서는 웨이블릿 변환을 사용한 watershed 기반 영상 분할의 효율성 증대를 위한 방법을 제안한다. 제안한 영상 분할 방법은 웨이블릿 변환을 이용한 영상의 계층적 표현인 피라미드 표현 단계, watershed 알고리즘을 이용한 영상 분할 단계, 웨이블릿 계수(coefficient)를 이용한 영역 병합 단계와 웨이블릿 역 변환(inverse wavelet transform)을 이용한 영역 투영 단계고 구성된다. 제안된 방법은 노이즈가 포함된 훼손된 영상을 분할 시 발생하는 과 분할문제를 감소시킬 뿐만 아니라, 분할 성능의 개선됨을 알 수 있다.
PDF KSCI

음성 신호의 음소 단위 구분화에 관한 연구 (A Study on the Segmentation of Speech Signal into Phonemic Units)

이의천;이강성;김순협
- 한국음향학회지
- /
- 제10권4호
- /
- pp.5-11
- /
- 1991
본 연구에서는 음성신호의 음소 단위 구분화 방법을 제안한다. 제안된 구분화 시스템은 화자 독립적이고, 음성신호에 대한 사전 정보 없이도 음소 단위로 구분화를 수행할 수 있는 특징을 갖는다. 구분화 처리는 입력 음성신호를 먼저 순수 유성을 구간과 순수 유성음이 아닌 구간으로 분리 시킨 후, 각각의 구간에 대해 세분화된 음소 단위로 분리시키는 2단계 구분화 알고리즘을 적용하였고, 이때 사용된 파라미터는 유성을 검출 파라미터, 영차 LPC 캡스트럼 계수의 시간변호 파라미터, ZCR 파라미터이다. 본 연구에서 제안한 구분화 알고리즘의 유용성을 입증하기 위해 사용한 대상어는 고립단어와 연속음성으로 구성된 어휘로서 전체 어휘중에 포함된 507개 음소에 대한 구분화율은 91.7% 이다.
PDF

FINE SEGMENTATION USING GEOMETRIC ATTRACTION-DRIVEN FLOW AND EDGE-REGIONS

Hahn, Joo-Young;Lee, Chang-Ock
- Journal of the Korean Society for Industrial and Applied Mathematics
- /
- 제11권2호
- /
- pp.41-47
- /
- 2007
A fine segmentation algorithm is proposed for extracting objects in an image, which have both weak boundaries and highly non-convex shapes. The image has simple background colors or simple object colors. Two concepts, geometric attraction-driven flow (GADF) and edge-regions are combined to detect boundaries of objects in a sub-pixel resolution. The main strategy to segment the boundaries is to construct initial curves close to objects by using edge-regions and then to make a curve evolution in GADF. Since the initial curves are close to objects regardless of shapes, highly non-convex shapes are easily detected and dependence on initial curves in boundary-based segmentation algorithms is naturally removed. Weak boundaries are also detected because the orientation of GADF is obtained regardless of the strength of boundaries. For a fine segmentation, we additionally propose a local region competition algorithm to detect perceptible boundaries which are used for the extraction of objects without visual loss of detailed shapes. We have successfully accomplished the fine segmentation of objects from images taken in the studio and aphids from images of soybean leaves.
PDF

유/무성/묵음 정보를 이용한 TTS용 자동음소분할기 성능향상 (Improvement of an Automatic Segmentation for TTS Using Voiced/Unvoiced/Silence Information)

김민제;이정철;김종진
- 대한음성학회지:말소리
- /
- 제58호
- /
- pp.67-81
- /
- 2006
For a large corpus of time-aligned data, HMM based approaches are most widely used for automatic segmentation, providing a consistent and accurate phone labeling scheme. There are two methods for training in HMM. Flat starting method has a property that human interference is minimized but it has low accuracy. Bootstrap method has a high accuracy, but it has a defect that manual segmentation is required In this paper, a new algorithm is proposed to minimize manual work and to improve the performance of automatic segmentation. At first phase, voiced, unvoiced and silence classification is performed for each speech data frame. At second phase, the phoneme sequence is aligned dynamically to the voiced/unvoiced/silence sequence according to the acoustic phonetic rules. Finally, using these segmented speech data as a bootstrap, phoneme model parameters based on HMM are trained. For the performance test, hand labeled ETRI speech DB was used. The experiment results showed that our algorithm achieved 10% improvement of segmentation accuracy within 20 ms tolerable error range. Especially for the unvoiced consonants, it showed 30% improvement.
PDF

후두 내시경 영상에서의 성문 분할 및 성대 점막 형태의 정량적 평가 (Segmentation of the Glottis and Quantitative Measurement of the Vocal Cord Mucosal Morphology in the Laryngoscopic Image)

이선민;오석;김영재;우주현;김광기
- 한국멀티미디어학회논문지
- /
- 제25권5호
- /
- pp.661-669
- /
- 2022
The purpose of this study is to compare and analyze Deep Learning (DL) and Digital Image Processing (DIP) techniques using the results of the glottis segmentation of the two methods followed by the quantification of the asymmetric degree of the vocal cord mucosa. The data consists of 40 normal and abnormal images. The DL model is based on Deeplab V3 architecture, and the Canny edge detector algorithm and morphological operations are used for the DIP technique. According to the segmentation results, the average accuracy of the DL model and the DIP was 97.5% and 94.7% respectively. The quantification results showed high correlation coefficients for both the DL experiment (r=0.8512, p<0.0001) and the DIP experiment (r=0.7784, p<0.0001). In the conclusion, the DL model showed relatively higher segmentation accuracy than the DIP. In this paper, we propose the clinical applicability of this technique applying the segmentation and asymmetric quantification algorithm to the glottal area in the laryngoscopic images.
https://doi.org/10.9717/kmms.2022.25.5.661 인용 PDF KSCI HTML

영상 시퀀스의 계층 분리를 위한 움직임 분할 (Motion Segmentation for Layer Decomposition of Image Sequences)

장정진;오정수;홍현기;최종수
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 추계종합학술대회 논문집(4)
- /
- pp.29-32
- /
- 2000
This paper proposes a motion segmentation algorithm for layer decomposition of image sequences. The proposed algorithm segments an image into initial regions by using its color and texture and computes a motion model of each initial region. Each pixel assigns one of the motion represented by the models or a motion except them, which segments the image into the motion regions. The proposed algorithm is app]ied image sequences and the segmented motion is shown.
PDF

Parallel Synthesis Algorithm for Layer-based Computer-generated Holograms Using Sparse-field Localization

Park, Jongha;Hahn, Joonku;Kim, Hwi
- Current Optics and Photonics
- /
- 제5권6호
- /
- pp.672-679
- /
- 2021
We propose a high-speed layer-based algorithm for synthesizing computer-generated holograms (CGHs), featuring sparsity-based image segmentation and computational parallelism. The sparsity-based image segmentation of layer-based three-dimensional scenes leads to considerable improvement in the efficiency of CGH computation. The efficiency enhancement of the proposed algorithm is ascribed to the field localization of the fast Fourier transform (FFT), and the consequent reduction of FFT computational complexity.
https://doi.org/10.3807/COPP.2021.5.6.672 인용 PDF KSCI

A New Connected Coherence Tree Algorithm For Image Segmentation

Zhou, Jingbo;Gao, Shangbing;Jin, Zhong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제6권4호
- /
- pp.1188-1202
- /
- 2012
In this paper, we propose a new multi-scale connected coherence tree algorithm (MCCTA) by improving the connected coherence tree algorithm (CCTA). In contrast to many multi-scale image processing algorithms, MCCTA works on multiple scales space of an image and can adaptively change the parameters to capture the coarse and fine level details. Furthermore, we design a Multi-scale Connected Coherence Tree algorithm plus Spectral graph partitioning (MCCTSGP) by combining MCCTA and Spectral graph partitioning in to a new framework. Specifically, the graph nodes are the regions produced by CCTA and the image pixels, and the weights are the affinities between nodes. Then we run a spectral graph partitioning algorithm to partition on the graph which can consider the information both from pixels and regions to improve the quality of segments for providing image segmentation. The experimental results on Berkeley image database demonstrate the accuracy of our algorithm as compared to existing popular methods.
https://doi.org/10.3837/tiis.2012.04.014 인용 PDF KSCI

다중해상도 kd-트리와 클러스터 유효성을 이용한 점증적 EM 알고리즘과 이의 영상 분할에의 적용 (Incremental EM algorithm with multiresolution kd-trees and cluster validation and its application to image segmentation)

이경미
- 한국지능시스템학회논문지
- /
- 제25권6호
- /
- pp.523-528
- /
- 2015
본 논문은 효율적인 영상 분할을 수행하기 위한 다중해상도와 동적인 성질을 가지고 있는 새로운 EM 알고리즘을 제안한다. EM 알고리즘은 가장 많이 사용되고 성능이 우수한 클러스터링 방법이다. 그러나, 기존의 EM 알고리즘은 다중해상도 데이터 처리에 대한 문제점과 클러스터 개수에 대한 사전 지식 요구라는 단점을 가지고 있다. 본 논문에서는 이러한 문제점을 해결하기 위해서 E-단계에 다중해상도 kd-트리를 적용함으로써 다중해상도 데이터 처리 문제를 해결하였고, 순차적 데이터에 따라 클러스터를 할당할 수 있데 하였다. 클러스터의 유효성을 검사하기 위해서, 클러스터 병합 원칙을 이용한다. 본 논문에서는 제안하는 알고리즘을 텍스쳐 영상 분할에 적용하였고, 우수한 성능을 보였다.
https://doi.org/10.5391/JKIIS.2015.25.6.523 인용 PDF KSCI

검색결과 1,332건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)