통합 검색 | Korea Science

고음질을 갖는 음색변경에 관한 연구 (A Study on the Voice Conversion Algorithm with High Quality)

박형빈;배명진
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 제13회 신호처리 합동 학술대회 논문집
- /
- pp.157-160
- /
- 2000
In the generally a voice conversion has used VQ(Vector Quantization) for partitioning the spectral feature and has performed by adding an appropriate offset vector to the source speaker's spectral vector. But there is not represented the target speaker's various characteristics because of discrete characteristics of transformed parameter. In this paper, these problems are solved by using the LMR(Linear Multivariate Regression) instead of the mapping codebook which is determined to the relationship of source and target speaker vocal tract characteristics. Also we propose the method for solved the discontinuity which is caused by applying to time aligned parameters using Dynamic Time Warping the time or pitch-scale modified speech. In our proposed algorithm for overcoming the transitional discontinuities, first of all, we don't change time or pitch scale and by using the LMR change a speaker's vocal tract characteristics in speech with non-modified time or pitch. Compared to existed methods based on VQ and LMR, we have much better voice quality in the result of the proposed algorithm.
PDF

연속음성중 키워드(Keyword) 인식을 위한 Binary Clustering Network (Binary clustering network for recognition of keywords in continuous speech)

최관선;한민홍
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1993년도 한국자동제어학술회의논문집(국내학술편); Seoul National University, Seoul; 20-22 Oct. 1993
- /
- pp.870-876
- /
- 1993
This paper presents a binary clustering network (BCN) and a heuristic algorithm to detect pitch for recognition of keywords in continuous speech. In order to classify nonlinear patterns, BCN separates patterns into binary clusters hierarchically and links same patterns at root level by using the supervised learning and the unsupervised learning. BCN has many desirable properties such as flexibility of dynamic structure, high classification accuracy, short learning time, and short recall time. Pitch Detection algorithm is a heuristic model that can solve the difficulties such as scaling invariance, time warping, time-shift invariance, and redundance. This recognition algorithm has shown recognition rates as high as 95% for speaker-dependent as well as multispeaker-dependent tests.
PDF

거리 행렬 연산 구조 최적화를 위한 확산 동적 시간 왜곡(Diffusive DTW) 알고리즘 (Diffusive DTW Algorithm for Optimizing Distance Matrix Computation Structure)

김영탁;진교홍
- 한국정보통신학회:학술대회논문집
- /
- 한국정보통신학회 2022년도 추계학술대회
- /
- pp.93-96
- /
- 2022
DTW는 길이가 서로 다른 시퀀스 사이의 간격을 제거하고 패턴의 유사성을 알아낼 수 있지만, 시공간 복잡성 때문에 대규모 데이터셋에서 많은 계산 비용이 필요로 한다. 본 논문에서는 계산 비용을 줄일 뿐만 아니라 결괏값의 오차도 없는 DDTW 알고리즘을 제안한다. 그리고 시퀀스의 길이에 따른 연산 시간을 측정하여 DTW와 DDTW의 알고리즘 복잡도를 비교한다. 시뮬레이션 결과 DTW에 비해 DDTW에서 연산 시간이 눈에 띄게 줄어듦을 확인하였다.
PDF

회전하는 복합재-VEM 박판보의 GHM 기법을 이용한 진동해석 (The Vibration Analysis of Composite-VEM Thin-Walled Rotating Beam Using GHM Methodology)

박재용;나성수
- 한국소음진동공학회:학술대회논문집
- /
- 한국소음진동공학회 2004년도 춘계학술대회논문집
- /
- pp.337-341
- /
- 2004
This paper concerns the analytical modeling and dynamic analysis of advanced rotating blade structure implemented by a dual approach based on structural tailoring and viscoelastic materials technology. Whereas structural tailoring uses the directionality properties of advanced composite materials, the passive materials technology exploits the damping capabilities of viscoelastic material(VEM) embedded into the host structure. The structure is modeled as a composite thin-walled beam incorporating a number of nonclassical features such as transverse shear, warping restraint, anisotropy of constituent materials, and warping and rotary inertias. The VEM layer damping treatment is modeled by using the Golla-Mushes-McTavish(GHM) method, which is employed to account for the frequency-dependent characteristic o the VEM. The displayed numerical results provide a comprehensive picture of the synergistic implications of the application of both techniques, namely, the tailoring and damping technology on vibration response of thin-walled beam structure exposed to external time-dependent excitations.
PDF

DSP Processor(TMS320C32)를 이용한 화자인증 보안시스템의 구현 (Implementation of Speaker Verification Security System Using DSP Processor(TMS320C32))

함영준;권혁재;최수영;정익주
- 산업기술연구
- /
- 제21권B호
- /
- pp.107-116
- /
- 2001
The speech includes various kinds of information : language information, speaker's information, affectivity, hygienic condition, utterance environment etc. when a person communicates with others. All technologies to utilize in real life processing this speech are called the speech technology. The speech technology contains speaker's information that among them and it includes a speech which is known as a speaker recognition. DTW(Dynamic Time Warping) is the speaker recognition technology that seeks the pattern of standard speech signal and the similarity degree in an inputted speech signal using dynamic programming. ln this study, using TMS320C32 DSP processor, we are to embody this DTW and to construct a security system.
PDF

온라인 서명자동인식을 위한 개선된 DTW (The Modified DTW Method for on-line Automatic Signature Verification)

조동욱;배영래
- 정보처리학회논문지B
- /
- 제10B권4호
- /
- pp.451-458
- /
- 2003
Dynamic Programming Matching(DPM)은 순차적으로 구성된 문제를 수학적으로 최적화 시키는 기술로서 패턴인식 분야에서 다년간 중요한 역할을 해왔다. 서명인식을 위한 대부분의 실제적 적용에서는 Sakoe and Chiba [9]의 실제구현 버전이 기반이 되어 왔는데, 일반적으로 slope constraint p = 0의 방법이 적용되어 왔다. 이 논문에서는 이 경우에는 전진탐색에 의한 휴리스틱한 방법을 적용한 MDPM이 상당한 처리 시간의 단축 뿐만 아니라 약간의 인식능력 향상을 가질 수 있음을 보여준다.
https://doi.org/10.3745/KIPSTB.2003.10B.4.451 인용 PDF KSCI

불변 모멘트를 이용한 DSTW 기반의 동적 손동작 인식 방법 (Recognition of Dynamic Hand Gestures based on DSTW using Invariant Moments)

지재영;장경현;박기태;문영식
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2009년도 추계학술대회
- /
- pp.273-276
- /
- 2009
본 논문에서는 Dynamic Space Time Warping(DSTW) 알고리즘을 이용하여 손동작을 다양한 배경에서도 정확하게 인식할 수 있는 방법을 제안한다. DSTW 알고리즘을 이용한 기존의 손동작 인식 방법은 질의영상의 매 프레임 마다 검출된 다수의 손 후보영역을 사용하여 모델영상과 시간 축 상으로 비교하는 방법이다. 그러나 기존의 DSTW 알고리즘을 이용한 손동작 인식 방법은 손을 포함하지 않은 후보영역들(배경, 팔꿈치 등)에 의해 오인식될 수 있는 경로를 생성하며, 그 결과로 사용자가 의도하지 않은 손동작으로 인식될 수 있다. 이러한 단점을 해결하기 위해서, 본 논문에서는 손 후보영역의 불변 모멘트를 이용하여 질감 정보를 추출한 후 후보영역들 사이의 유사도를 비교하였다. 제안한 방법은 유사도를 모델과 질의의 매칭비용에 가중치로 적용하였고, 다양한 실험 결과 제안한 방법이 기존의 방법에 비해 사용자의 손동작을 정확하게 인식하는 것을 확인하였다.
PDF

점탄성-복합재 박판 블레이드 구조물의 진동 해석 (Dynamic Analysis of Viscoelastic Composite Thin-Walled Blade Structures)

신재현;나성수;박철휴
- 대한기계학회:학술대회논문집
- /
- 대한기계학회 2003년도 추계학술대회
- /
- pp.1684-1689
- /
- 2003
This paper concerns the analytical modeling and dynamic analysis of advanced cantilevered blade structure implemented by a dual approach based on structural tailoring and viscoelastic materials technology. Whereas structural tailoring uses the directionality properties of advanced composite materials, the passive materials technology exploits the damping capabilities of viscoelastic material(VEM) embedded into the host structure. The structure is modeled as a composite thin-walled beam incorporating a number of nonclassical features such as transverse shear, secondary warping, anisotropy of constituent materials, and rotary inertias. The case of VEM spreaded over the entire span of the structure is considered. The displayed numerical results provide a comprehensive picture of the synergisitic implications of the application of both techniques, namely, the tailoring and damping technology on vibration response of thin-walled beam structure exposed to external time-dependent excitations.
PDF

음성 질의 기반 디지털 사진 검색 기법 (A Query-by-Speech Scheme for Photo Albuming)

김태성;서영주;이용주;김회린
- 대한음성학회지:말소리
- /
- 제57호
- /
- pp.99-112
- /
- 2006
In this paper, we introduce two retrieval methods for photos with speech documents. We compare the pattern of speech query with those of speech documents recorded in digital cameras, and measure the similarities, and retrieve photos corresponding to the speech documents which have high similarity scores. As the first approach, a phoneme recognition scheme is used as the pre-processor for the pattern matching, and in the second one, the vector quantization (VQ) and the dynamic time warping (DTW) are applied to match the speech query with the documents in signal domain itself. Experimental results show that the performance of the first approach is highly dependent on that of phoneme recognition while the processing time is short. The second method provides a great improvement of performance. While the processing time is longer than that of the first method due to DTW, but we can reduce it by taking approximated methods.
PDF

다중 시계열 패턴인식을 이용한 반도체 생산장치의 지능형 감시시스템 (An Intelligent Monitoring System of Semiconductor Processing Equipment using Multiple Time-Series Pattern Recognition)

이중재;권오범;김계영
- 정보처리학회논문지D
- /
- 제11D권3호
- /
- pp.709-716
- /
- 2004
본 논문에서는 다중 시계열 패턴인식 사용하여 생산장치의 상태자료부터 공정결과를 예측하여 정상 또는 비정상을 판정하는 지능형 감시시스템에 관하여 기술한다. 제안하는 감시스템은 초기화, 학습 그리고 인식의 세 단계로 구성된다. 초기화 단계에서는 감시대상의 생산장치가 가지는 인사들 각각의 가중치와 각 인자들이 가지는 시계열 자료 중에서 학습과 인식에 유효단계를 설정한다. 학습단계에서는 LBG알고리즘을 사용하여 이 생산장치에 의하여 생성되고 수집된 패턴들을 군집화 한다. 각 패턴은 시계열 형태의 자료와 처리 완료 후 계측기에 의하여 측정된 ACI로 구성된다. 인식단계에서는 DTW를 사용하여 실시간으로 입력된 패턴과 군집화된 패턴들 사이의 대응을 수행하여 가장 잘 정합되는 패턴을 찾는다. 다음은 이 패턴이 가지는 ACI, 차 그리고 가중치들의 조합으로 예측된 ACI 값을 산출한다. 최종적으로 예측된 ACI가 정상으로 수용할 수 있는 값 범위에 없는지 여부를 결정한다. 제안하는 시스템의 성능평가를 위하여 식각장치로부터 획득된 자료를 대상으로 실험하였다. 실험결과에서는 학습횟수가 증가함에 따라 예측 ACI값과 실측ACI값 사이의 오차가 현저히 감소함을 볼 수 있다
https://doi.org/10.3745/KIPSTD.2004.11D.3.709 인용 PDF KSCI

검색결과 188건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)