• Title/Summary/Keyword: error segmentation

Search Result 213, Processing Time 0.025 seconds

MR 영상에서 정규화된 기울기 크기 영상을 이용한 자동 간 분할 기법 (Automatic Liver Segmentation Method on MR Images using Normalized Gradient Magnitude Image)

  • 이정진;김경원;이호
    • 한국멀티미디어학회논문지
    • /
    • 제13권11호
    • /
    • pp.1698-1705
    • /
    • 2010
  • 본 논문에서는 자기 공명 영상에서 고속의 간 분할 기법을 제안한다. 제안 기법은 MR 영상을 정규화된 기울기 크기 정보를 바탕으로 효율적으로 객체와 경계로 구분한다. 다음으로 간 영역에 해당하는 객체를 직전에 분할된 슬라이스의 간 영역에서 추출된 씨앗점들로 2차원 씨앗점 영역 성장법을 이용하여 검출한다. 마지막으로 롤링 볼 알고리즘과 연결 요소 분석 기법을 사용하여 간 경계 부근의 위양성 오차를 최소화한다. 20명의 환자 데이터에 대하여 제안 기법으로 분할한 결과와 수작업으로 분할한 결과를 비교하여 정확성을 검증하였다. 평균 볼륨 오버랩 오차 5.2%였고, 평균 절대값 볼륨 측정 오차는 1.9%였다. 제안 기법으로 한 환자 데이터를 분할하는 데 소요되는 평균 시간은 약 3초 정도였다. 제안 기법은 빠르고, 정확한 간 분할을 필요로 하는 컴퓨터 보조 간 진단 기법에 사용될 수 있다.

A Hybrid Semantic-Geometric Approach for Clutter-Resistant Floorplan Generation from Building Point Clouds

  • Kim, Seongyong;Yajima, Yosuke;Park, Jisoo;Chen, Jingdao;Cho, Yong K.
    • 국제학술발표논문집
    • /
    • The 9th International Conference on Construction Engineering and Project Management
    • /
    • pp.792-799
    • /
    • 2022
  • Building Information Modeling (BIM) technology is a key component of modern construction engineering and project management workflows. As-is BIM models that represent the spatial reality of a project site can offer crucial information to stakeholders for construction progress monitoring, error checking, and building maintenance purposes. Geometric methods for automatically converting raw scan data into BIM models (Scan-to-BIM) often fail to make use of higher-level semantic information in the data. Whereas, semantic segmentation methods only output labels at the point level without creating object level models that is necessary for BIM. To address these issues, this research proposes a hybrid semantic-geometric approach for clutter-resistant floorplan generation from laser-scanned building point clouds. The input point clouds are first pre-processed by normalizing the coordinate system and removing outliers. Then, a semantic segmentation network based on PointNet++ is used to label each point as ceiling, floor, wall, door, stair, and clutter. The clutter points are removed whereas the wall, door, and stair points are used for 2D floorplan generation. A region-growing segmentation algorithm paired with geometric reasoning rules is applied to group the points together into individual building elements. Finally, a 2-fold Random Sample Consensus (RANSAC) algorithm is applied to parameterize the building elements into 2D lines which are used to create the output floorplan. The proposed method is evaluated using the metrics of precision, recall, Intersection-over-Union (IOU), Betti error, and warping error.

  • PDF

이자 분할을 위한 노이즈 제거 알고리즘 기반 기존 임계값 기법 대비 U-Net 모델의 대체 가능성 (Substitutability of Noise Reduction Algorithm based Conventional Thresholding Technique to U-Net Model for Pancreas Segmentation)

  • 임세원;이영진
    • 한국방사선학회논문지
    • /
    • 제17권5호
    • /
    • pp.663-670
    • /
    • 2023
  • 본 연구에서는 기존의 노이즈 제거 알고리즘을 적용한 영역 확장 기반의 분할 방법과 U-Net을 이용한 분할 방법의 성능을 정량적 평가인자를 이용하여 비교평가 하고자 하였다. 먼저, 전산화단층검사 영상에 median filter, median modified Wiener filter, fast non-local means algorithm을 모델링하여 적용한 뒤 영역 확장 기반의 분할을 수행하였다. 그리고 U-Net 기반의 분할 모델로 훈련을 진행하여 분할을 수행하였다. 그 후, 노이즈 제거 알고리즘을 사용한 경우와 U-Net을 사용한 경우의 분할 성능을 비교 평가하기 위해 평균 제곱근 편차 (root mean square error, RMSE), 최대 신호 대 잡음비 (peak signal to noise ratio, PSNR), universal quality image index (UQI), 그리고 dice similarity coefficient (DSC)를 측정하였다. 실험 결과, U-Net을 이용하여 분할을 수행했을 때 분할 성능이 가장 향상되었다. RMSE, PSNR, UQI, 그리고 DSC 값은 각각 약 0.063, 72.11, 0.864, 그리고 0.982로 noisy한 영상에 비해 각각 1.97배, 1.09배, 5.30배, 그리고 1.99배 개선된 것을 확인할 수 있었다. 결론적으로, 전산화단층검사영상에서 U-Net이 노이즈 제거 알고리즘에 비해 분할 성능 향상에 효과적임을 입증하였다.

평균이동 분할을 이용한 임펄스 잡음제거 (Cleaning Method of Impulse Noise Using Mean Shift Segmentation)

  • 권영만;임명재
    • 한국인터넷방송통신학회논문지
    • /
    • 제9권6호
    • /
    • pp.163-168
    • /
    • 2009
  • 본 논문에서는 평균이동 분할을 이용해서 임펄스 잡음을 제거하는 효과적인 방법을 제안한다. 이 방법은 영상에 모든 화소에 대해서 필터링 작업을 하는 기존의 방법과는 달리 평균이동 분할을 사용해서 임펄스 잡음의 위치를 추정하고 그 위치에서만 필터링 작업을 수행하는 방식이다. 실험을 통해 결과 영상의 오차의 제곱의 합을 측정하여 화질이 개선되고, 임펄스 잡음이 효과적으로 제거되는 것을 확인하였다.

  • PDF

Comparison of Active Contour and Active Shape Approaches for Corpus Callosum Segmentation

  • Adiya, Enkhbolor;Izmantoko, Yonny S.;Choi, Heung-Kook
    • 한국멀티미디어학회논문지
    • /
    • 제16권9호
    • /
    • pp.1018-1030
    • /
    • 2013
  • The corpus callosum is the largest connective structure in the brain, and its shape and size are correlated to sex, age, brain growth and degeneration, handedness, musical ability, and neurological diseases. Manually segmenting the corpus callosum from brain magnetic resonance (MR) image is time consuming, error prone, and operator dependent. In this paper, two semi-automatic segmentation methods are present: the active contour model-based approach and the active shape model-based approach. We tested these methods on an MR image of the human brain and found that the active contour approach had better segmentation accuracy but was slower than the active shape approach.

영상 영역화를 이용한 영상 부호화 기법 (An Image Coding Technique Using the Image Segmentation)

  • 정철호;이상욱;박래홍
    • 대한전자공학회논문지
    • /
    • 제24권5호
    • /
    • pp.914-922
    • /
    • 1987
  • An image coding technique based on a segmentation, which utilizes a simplified description of regions composing an image, is investigated in this paper. The proposed coding technique consists of 3 stages: segmentation, contour coding. In this paper, emphasis was given to texture coding in order to improve a quality of an image. Split-and-merge method was employed for a segmentation. In the texture coding, a linear predictive coding(LPC), along with approximation technique based on a two-dimensional polynomial function was used to encode texture components. Depending on a size of region and a mean square error between an original and a reconstructed image, appropriate texture coding techniques were determined. A computer simulation on natural images indicates that an acceptable image quality at a compression ratio as high as 15-25 could be obtained. In comparison with a discrete cosine transform coding technique, which is the most typical coding technique in the first-generation coding, the proposed scheme leads to a better quality at compression ratio higher than 15-20.

  • PDF

Development of ResNet-based WBC Classification Algorithm Using Super-pixel Image Segmentation

  • Lee, Kyu-Man;Kang, Soon-Ah
    • 한국컴퓨터정보학회논문지
    • /
    • 제23권4호
    • /
    • pp.147-153
    • /
    • 2018
  • In this paper, we propose an efficient WBC 14-Diff classification which performs using the WBC-ResNet-152, a type of CNN model. The main point of view is to use Super-pixel for the segmentation of the image of WBC, and to use ResNet for the classification of WBC. A total of 136,164 blood image samples (224x224) were grouped for image segmentation, training, training verification, and final test performance analysis. Image segmentation using super-pixels have different number of images for each classes, so weighted average was applied and therefore image segmentation error was low at 7.23%. Using the training data-set for training 50 times, and using soft-max classifier, TPR average of 80.3% for the training set of 8,827 images was achieved. Based on this, using verification data-set of 21,437 images, 14-Diff classification TPR average of normal WBCs were at 93.4% and TPR average of abnormal WBCs were at 83.3%. The result and methodology of this research demonstrates the usefulness of artificial intelligence technology in the blood cell image classification field. WBC-ResNet-152 based morphology approach is shown to be meaningful and worthwhile method. And based on stored medical data, in-depth diagnosis and early detection of curable diseases is expected to improve the quality of treatment.

유/무성/묵음 정보를 이용한 TTS용 자동음소분할기 성능향상 (Improvement of an Automatic Segmentation for TTS Using Voiced/Unvoiced/Silence Information)

  • 김민제;이정철;김종진
    • 대한음성학회지:말소리
    • /
    • 제58호
    • /
    • pp.67-81
    • /
    • 2006
  • For a large corpus of time-aligned data, HMM based approaches are most widely used for automatic segmentation, providing a consistent and accurate phone labeling scheme. There are two methods for training in HMM. Flat starting method has a property that human interference is minimized but it has low accuracy. Bootstrap method has a high accuracy, but it has a defect that manual segmentation is required In this paper, a new algorithm is proposed to minimize manual work and to improve the performance of automatic segmentation. At first phase, voiced, unvoiced and silence classification is performed for each speech data frame. At second phase, the phoneme sequence is aligned dynamically to the voiced/unvoiced/silence sequence according to the acoustic phonetic rules. Finally, using these segmented speech data as a bootstrap, phoneme model parameters based on HMM are trained. For the performance test, hand labeled ETRI speech DB was used. The experiment results showed that our algorithm achieved 10% improvement of segmentation accuracy within 20 ms tolerable error range. Especially for the unvoiced consonants, it showed 30% improvement.

  • PDF

인근지역 범위 설정이 공간회귀모형 적합에 미치는 영향 (The Effects of Neighborhood Segmentation on the Adequacy of a Spatial Regression Model)

  • 이창로;박기호
    • 대한지리학회지
    • /
    • 제48권6호
    • /
    • pp.978-993
    • /
    • 2013
  • 공간회귀모형은 공간가중행렬을 통해 공간관계를 명시적으로 정량화한다는 점에서 타 모형과 뚜렷하게 구별되는 강점이 있는 동시에, 공간가중행렬 구성에 자의성이 개입된다는 약점을 가지고 있기도 하다. 본 연구에서는 공간가중행렬의 구성에 따라 모형 적합도가 어떻게 변화하는지 인천시를 사례로 실증적으로 검토하였다. 또한 인근지역 범위 설정에 따라 공간시차모형(spatial lag model) 또는 공간오차모형(spatial error model) 중 어떠한 모형이 보다 우수하게 나타는지 검토하였다. 분석 결과, 토지가격 추정에 있어 인근지역 범위를 좁게 파악하는 공간가중행렬을 구성할수록 모형 적합도가 전반적으로 개선되는 것이 확인되었다. 또한, 공간적 이질성이 심한 지역은 공간오차모형의 적합도가 보다 우수한 것으로 파악되었다. 공간적 이질성이 심한 지역은 동질적 성격을 갖는 하부 인근지역으로 세분함으로써 그러한 이질성을 완화시킬 수 있었고, 그 결과 공간오차모형보다 공간시차모형의 적합도가 우수하게 나타날 수 있음을 밝혔다.

  • PDF

The Role of Post-lexical Intonational Patterns in Korean Word Segmentation

  • Kim, Sa-Hyang
    • 음성과학
    • /
    • 제14권1호
    • /
    • pp.37-62
    • /
    • 2007
  • The current study examines the role of post-lexical tonal patterns of a prosodic phrase in word segmentation. In a word spotting experiment, native Korean listeners were asked to spot a disyllabic or trisyllabic word from twelve syllable speech stream that was composed of three Accentual Phrases (AP). Words occurred with various post-lexical intonation patterns. The results showed that listeners spotted more words in phrase-initial than in phrase-medial position, suggesting that the AP-final H tone from the preceding AP helped listeners to segment the phrase-initial word in the target AP. Results also showed that listeners' error rates were significantly lower when words occurred with initial rising tonal pattern, which is the most frequent intonational pattern imposed upon multisyllabic words in Korean, than with non-rising patterns. This result was observed both in AP-initial and in AP-medial positions, regardless of the frequency and legality of overall AP tonal patterns. Tonal cues other than initial rising tone did not positively influence the error rate. These results not only indicate that rising tone in AP-initial and AP_final position is a reliable cue for word boundary detection for Korean listeners, but further suggest that phrasal intonation contours serve as a possible word boundary cue in languages without lexical prominence.

  • PDF