• 제목/요약/키워드: Perceptual evaluation

검색결과 248건 처리시간 0.026초

음성 향상을 위한 최소값 제어 음성 존재 부정확성의 추적기법 (Minima Controlled Speech Presence Uncertainty Tracking Method for Speech Enhancement)

  • 이우정;장준혁
    • 한국음향학회지
    • /
    • 제28권7호
    • /
    • pp.668-673
    • /
    • 2009
  • 본 논문에서는 최소값 제어 음성 존재 부정확성의 추정기법을 이용한 음성 향상 기법을 제안한다. 기존의 음성 존재 부정확성 추정기법에서는 간단한 a posteriori SNR에 근거하여 프레임, 채널마다 다른 a priori음성 부재 확률값을 결정하여 음성 부재 확률 계산에 적용하였다. 본 논문에서 제안된 알고리즘은 기존 음성 존재 부정확성 추적방법과는 달리 최소값 제어방법을 이용하여 주파수성분별 최소값에 근거한 강인한 a priori음성 부재 확률값 추정방법을 통해 음성 부재 확률에 적용하여 음성을 향상시킨다. 제안된 음성 향상 기법은 ITU-T P.862 perceptual evaluation of speech quality (PESQ)를 이용하여 평가하였고 기존의 음성 존재 부정확성 추적방법보다 향상된 결과를 나타내었다.

최소 통계법과 Short-Term 예측계수 코드북을 이용한 Non-Stationary/Mixed 배경잡음 추정 기법 (Non-Stationary/Mixed Noise Estimation Algorithm Based on Minimum Statistics and Codebook Driven Short-Term Predictor Parameter Estimation)

  • 이명석;노명훈;박성주;이석필;김무영
    • 한국음향학회지
    • /
    • 제29권3호
    • /
    • pp.200-208
    • /
    • 2010
  • 본 논문에서는 배경잡음에 강인한 잡음제거 알고리즘 설계를 위해서 minimum statistics (MS) 기법을 codebook driven short-term predictor parameter estimation (CDSTP) 기법에 접목하는 방법을 제안한다. MS는 stationary 배경잡음에는 강인하지만, non-stationary 배경잡음에는 상대적으로 취약하다. CDSTP는 non-stationary 배경잡음에 강인한 특성을 보이지만, 코드북에 없는 배경잡음 환경에는 취약하다. 따라서 non-stationary 배경잡음에 강인한 CDSTP 방법과 별도의 코드북 학습 과정이 필요 없는 MS를 결합해서 다양한 배경잡음에 강인한 알고리즘을 제안한다. 제안방법은 MS나 CDSTP 방법에 비해서 전체적으로 향상된 perceptual evaluation of speech quality (PESQ) 성능을 나타냈으며, 특히 stationary 배경잡음과 non-stationary 배경잡음이 섞여 있는 mixed 배경잡음 환경에서 강인한 특성을 보였다.

문화기반 산업유산 공간의 장소성 평가 연구 - 북경 798 예술지구를 중심으로 - (A Study on the Evaluation of Placeness of Industrial Heritages Space with Cultural Characteristics - Focused on Beijing 798 Art Zone in China -)

  • 왕발부;장징위;윤지영
    • 한국실내디자인학회논문집
    • /
    • 제26권1호
    • /
    • pp.101-113
    • /
    • 2017
  • This study was to build a frame and direction of an analysis of the evaltion of placeness of industrial heritages space, and then to evaluate Beijing 798. First, by studying references, 15 elements in 6 dimensions were derived from the evaluation of placeness of industrial heritage and applied in the evaluation of placeness of Beijing 798 art zone. Second, the changes of Bejing 798 art zone can be classified in 4 steps, latency, quickening, growth, and union, which has been growing from studios with artists to complex cultural art place based on studios and gallery. Third, place characteristics of 798 art zone was analysis with the measurements of morphological, perceptual, social, visual, functional, and temporal points. Fourth, a survey was done in order to evaluate placeness of Beijing 798 art zone.In conclusion, the result of the evaluation of placeness of industrial heritages space through Beijing 798 art zone shows that uniqueness and indigenousness are highly valued which verifies that the differentiation from other places and uniqueness are the essential element.

S-JND 모델을 사용한 주관적인 율 제어 알고리즘 기반의 HEVC 부호화 방법 (A Perceptual Rate Control Algorithm with S-JND Model for HEVC Encoder)

  • 김재련;안용조;임웅;심동규
    • 방송공학회논문지
    • /
    • 제21권6호
    • /
    • pp.929-943
    • /
    • 2016
  • 본 논문에서는 인지 화질을 고려하기 위해 S-JND 모델 기반의 율 제어 알고리즘을 제안한다. 제안하는 율 제어 알고리즘은 인간이 가지는 시각 시스템의 특징을 반영하기 위하여 시각적 민감도와 시각적 관심도를 동시에 반영할 수 있도록 제작된 S-JND (Saliency-Just Noticeable Difference) 모델을 사용한다. 율 제어 알고리즘을 통해 비트를 분배하는 과정에서 픽쳐 내에 존재하는 각 CTU (Coding Tree Unit)가 가지는 S-JND threshold를 구한다. 각 CTU의 threshold는 적응적으로 적절한 비트를 분배하는데 사용되고, 따라서 제안하는 비트 분배 모델은 인지 화질을 향상 시킬 수 있다. 제안하는 방법의 성능 검증을 위해서 제안하는 방법을 HM 16.9에 구현하였으며, CTC (Common Test Condition) RA (Random Access), Low-delay B와 Low-delay P의 경우에 Class B와 Class C 영상들에 대해 실험 하였다. 실험 결과, 제안하는 방법은 기존 율 제어 알고리즘 대비 평균 2.3%의 비트율이 감소했고 BD-PSNR은 약 0.07dB 향상이 있었으며 비트 정확도 또한 0.06% 정도 증가하였다. DSCQS (Double Stimulus Continuous Quality Scale) 방법으로 측정한 결과, 제안하는 방법은 기존 방법 대비 0.03 MOS (Mean Opinion Score) 향상을 보였다.

CIECAM02에서의 밝기 분포 기반 모바일 디스플레이의 인지적 대비 (Perceptual Contrast based on Distribution of Brightness in CIECAM02 for Mobile Display)

  • 남의원;경왕준;하호건;하영호
    • 전자공학회논문지
    • /
    • 제52권2호
    • /
    • pp.141-147
    • /
    • 2015
  • 디스플레이의 대비는 일반적으로 디스플레이 최대 밝기와 최소 밝기의 비율을 이용하여 나타낸다. 그러나 이와 같은 명암비는 인간 시각의 인지 특성을 고려하지 않고 디스플레이의 물리적 특성만을 고려했기 때문에 인지 대비와 일치 하지 않는다. 본 논문에서는 디스플레이 밝기 범위 내에서 인지적으로 구별 가능한 밝기를 고려한 대비 측정 방법을 제안한다. 먼저, 디스플레이의 인지 밝기 범위를 측정하기 위해 CIECAM02 색 공간에서 최대 밝기와 최소 밝기 사이의 길이를 계산한다. 다음으로, Weber-Fechner 법칙을 기반으로 하여 각 밝기에서 인지적으로 동일한 밝기 범위를 결정하고, 각 범위 내에 존재하는 색의 수를 계산한다. 마지막으로, 각각의 동일하게 인지되는 밝기 범위내의 색의 수와 인지적 대비 길이의 비율로 인지적으로 구별 가능한 밝기를 계산한다. 주관적 실험에서 제안한 방법은 이전의 인지 대비 측정법에 비해 주관적 인지 대비 실험 결과와 일치하는 결과를 보였다.

스펙트로그램을 이용한 내전형 연축성 발성 장애와 근긴장성 발성 장애의 감별 (Differentiation of Adductor-Type Spasmodic Dysphonia from Muscle Tension Dysphonia Using Spectrogram)

  • 노승호;김소연;조재경;이상혁;진성민
    • 대한후두음성언어의학회지
    • /
    • 제28권2호
    • /
    • pp.100-105
    • /
    • 2017
  • Background and Objectives : Adductor type spasmodic dysphonia (ADSD) is neurogenic disorder and focal laryngeal dystonia, while muscle tension dysphonia (MTD) is caused by functional voice disorder. Both ADSD and MTD may be associated with excessive supraglottic contraction and compensation, resulting in a strained voice quality with spastic voice breaks. The aim of this study was to determine the utility of spectrogram analysis in the differentiation of ADSD from MTD. Materials and Methods : From 2015 through 2017, 17 patients of ADSD and 20 of MTD, underwent acoustic recording and phonatory function studies, were enrolled. Jitter (frequency perturbation), Shimmer (amplitude perturbation) were obtained using MDVP (Multi-dimensional Voice Program) and GRBAS scale was used for perceptual evaluation. The two speech therapist evaluated a wide band (11,250 Hz) spectrogram by blind test using 4 scales (0-3 point) for four spectral findings, abrupt voice breaks, irregular wide spaced vertical striations, well defined formants and high frequency spectral noise. Results : Jitter, Shimmer and GRBAS were not found different between two groups with no significant correlation (p>0.05). Abrupt voice breaks and irregular wide spaced vertical striations of ADSD were significantly higher than those of MTD with strong correlation (p<0.01). High frequency spectral noise of MTD were higher than those of ADSD with strong correlation (p<0.01). Well defined formants were not found different between two groups. Conclusion : The wide band spectrograms provided visual perceptual information can differentiate ADSD from MTD. Spectrogram analysis is a useful diagnostic tool for differentiating ADSD from MTD where perceptual analysis and clinical evaluation alone are insufficient.

  • PDF

구개열 화자의 과다비성 감소를 위한 CPAP 치료 효과 연구 (Efficacy of CPAP (Continuous Positive Airway Pressure) Therapy on Reducing the Degree of Hypernasality in Speakers with Repaired Cleft Palate)

  • 하승희;정승은;고경석
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.171-177
    • /
    • 2012
  • The purpose of this study was to investigate whether CPAP therapy was effective for reducing the degree of hypernasality in individuals with repaired cleft palate and whether the efficacy of CPAP therapy was maintained. Five individuals with cleft palate participated in an 8-week home-based CPAP program. Results from perceptual evaluation of hypernasality and nasalance scores before and after CPAP therapy and at the follow-up speech evaluation were compared. The results of the study showed that the responses of the CPAP therapy were various among individuals. Three individuals exhibited reductions in the degree of perceived hypernasality, while nasalance scores in all individuals decreased after the therapy. The results showed that the effect of CPAP therapy was generally maintained until approximately three months after the completion of CPAP therapy.

아파트 입면계획에서 시각적 디자인 이미지에 관한 연구 (A Study on the Visual Elevation Image of Apartment Buildings)

  • 손세욱;구시온
    • 한국주거학회논문집
    • /
    • 제10권2호
    • /
    • pp.247-257
    • /
    • 1999
  • This study is aimed to propose the evaluation model of forecasting visual quality of apartment buildings, which would be a useful tool to make the architectural concepts corresponding to user needs. This study was carried out through the four-step experiments as follows. The first step is to take the user's visual evaluation construct. To do this, the 22 adjective phrases were extracted, which were applicable to all apartment buildings. The second step is to analyze the user's visual preference, which is measured by the user's psychological quantity on the apartment buildings by S. D.(Semantic Differential) Method. The third step is to analyzethe five psychological-factors obtained from the Factor Analysis. The perceptual images on the 41 experimental subjects were checked up through evaluating and analyzing the factor scores of each subjects for each psychological-factor. The fourth step is to analyze, the similarity of various characters in a building, which is mirrored on the user's psychological quantity and how buildings are grouped by it.

  • PDF

임상 신경심리학적 평가 (Clinical Neuropsychological Evaluation)

  • 오병훈
    • 생물정신의학
    • /
    • 제2권1호
    • /
    • pp.28-37
    • /
    • 1995
  • Clinical neuropsychology which belongs to the necuroscience field is concerned with relationship between human behaviors and the brain structure. Clinical neuropsychology has grown to be a specialized separate field within psychology over the last twenty years. Clinical neuropsychology offers an objective methodology to consider the mind-body interaction and evaluate the behavioral consequences and functional deficits associated with brain lesions. Clinical neuropsychological assessment is composed of cognitive, perceptual, motor and emotional function through various neuropsychological examinations such as Halsted-Reitan and Luria-Nebraska batteries, and computerized neuropsychological test such as PCIS Vienna Test System and Stim. The goals of neuropsychological evaluation are to identify of neuropsychological dysfuncitions, to develop execute and monitor treatment plans, and to make rehabilitation programs. Recently, the neuropsychiatric patients are increasing in number and 15-20% of acute psychiatric patients suffer from organic mental problems. Moreover, clinical neuropsychology has an increasingly important role in both neurobehavioral foundation and clinical application. So, psychiatrists must play a major role in the development of clinical neuropsychology in psychiatry.

  • PDF

VoIP 코더들의 프레임손실은닉 알고리즘 성능평가 (Performance Evaluation of Frame Erasure Concealment Algorithms in VoIP Coders)

  • 한승호;문광;한민수
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2004년도 춘계 학술대회 발표논문집
    • /
    • pp.235-238
    • /
    • 2004
  • Frame erasures cause speech quality degradation in wireless communication networks or packet networks. The degradation becomes worse when consecutive frame erasures occur. Speech coders have a frame erasure concealment(FEC) mechanism to compensate for frame erasures. It is meaningful to evaluate the performance of FEC mechanisms for frame erasures that occur in communications networks. In this paper, various frame erasures are designed. And the FEC algorithms of speech coders are evaluated and analyzed with the Perceptual Evaluation of Speech Quality(PESQ). It is found that the performances vary in accordance with frame erasure types, frame erasure rates, and utterance lengths.

  • PDF