Search | Korea Science

A Study on Noise-Robust Speaker Recognition Methods Based on Ensemble of Decision Scores (앙상블 기법을 이용한 잡음 환경에서의 화자인식 방법에 관한 연구)

Yang, Joon-Young;Chang, Joon-Hyuk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2018.05a
- /
- pp.457-459
- /
- 2018
화자인식 기술은 주어진 임의의 두 발화로부터 발화자의 일치 여부를 판단하여 등록된 화자의 목록으로부터 임의로 입력된 발화의 발화자를 식별하는 기술이다. 그러나, 배경잡음이나 반향이 존재하는 경우에는 음성신호가 왜곡되어 화자인식 성능이 저하될 수 있기 때문에 별도의 음성신호 전처리 알고리즘을 함께 사용할 수 있다. 본 논문에서는 배경잡음이 존재하는 환경에서 다수의 마이크로폰을 통해 수집한 음성신호에 대해 화자인식을 수행하는 방법으로써 parametric multi-channel Wiener filter (PMWF)를 이용한 화자일치 점수 앙상블 기법을 제안한다. 입력신호의 신호대잡음비를 기준으로 점수 결합 시 사용되는 결합계수를 정하고, Wiener filter 로 잡음을 제거하여 얻은 점수와 minimum variance distortionless response (MVDR) 빔포머를 통해 잡음을 제거하여 얻은 정수를 가중결합하는 방식으로 동일오류율을 측정한 결과, 각 전처리 알고리즘을 독립적으로 사용하여 점수를 계산한 경우보다 우수한 성능을 보임을 확인할 수 있었다.
https://doi.org/10.3745/PKIPS.y2018m05a.457 인용 PDF

A Method of Containing Additional Visual Information in Motioncode (모션코드를 위한 부가적 시각 정보 포함 방법)

Park, Hyungkun;Lee, Yillbyung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2012.04a
- /
- pp.460-463
- /
- 2012
유비쿼터스 네트워크 환경에서, 객체들 간의 통신은 매우 중요하다. 이 과정은 더 많은 정보의 전달을 위해 온라인 네트워크 상에서 수행되는 것이 일반적이다. 반면, 오프라인 네트워크 상에서의 데이터 통신의 경우 객체들 사이에 극히 적은 양의 정보 전송만이 가능한 것이 현실이다. 현재로서는 기존에 제시되었던 태그 인터페이스들을 활용하는 것이 비록 극소량이라 하더라도 오프라인 네트워크 상에서 객체들 사이에 데이터를 전송할 수 있는 거의 유일한 방법이다. 기존의 태그 인터페이스들 중 2 차원 이미지 코드로서 QR 코드는 태그 심벌 자체에 삽입하는 형태로 직관적 시각 정보를 부가적으로 포함할 수 있다. 그러나 물리적 공간의 제약과 오류 복원 능력의 한계로 인해 태그 내부에 부가적 시각 정보를 삽입하는 경우 그 표현은 극히 제한적이다. 모션코드는 오프라인 네트워크 상에서 객체들 사이에 더 많은 정보를 전달하기 위해 더 많은 데이터를 포함할 수 있도록 제안된 새로운 동적 태그 인터페이스이다. 본 논문에서는 모션코드에 모션코드 자체의 왜곡 없이 다양한 형태의 부가적 시각 정보를 포함할 수 있는 방법과 조건을 제안한다.
https://doi.org/10.3745/PKIPS.y2012m04a.460 인용 PDF

Supervised learning framework using Web-Videos (Web-Videos를 사용한 Supervised Learning Framework)

Na, Seong-Won;Lee, Ye-Gi;Yoon, Kyoung-ro
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2019.06a
- /
- pp.95-97
- /
- 2019
본 논문에서는 비디오 데이터를 이용한 감독 학습 프레임 워크를 제안한다. 최근 Deep Convolutional Neural Networks의 성공으로 많은 분야에서 사용되고 있다. DCNNs 모델 성능의 중요한 요소 중 하나는 Large-cale Dataset을 구축하는 것으로 Small-scale Dataset으로 모델을 학습한다면 과적합 및 일반화 오류를 해결하기 어렵다. 이러한 문제점을 해결하는 방법으로 이미지 왜곡을 통한 데이터 셋을 증가 또는 Dropout 기법 등을 사용하였지만 원본 데이터가 적은 경우에는 모델이 일반화 능력을 갖기 어렵다. 따라서 본 논문에서는 이러한 문제점을 보완하고자 Web으로부터 얻은 비디오에서 해당 Class와 관련된 프레임들을 추출하여 보다 쉽게 데이터 셋을 확장하고, 모델의 성능을 향상 시키는 방법을 제안한다.
PDF

A Study on Improving Power Efficiency and Image Quality of LCD TVs (LCD TV 의 전력 효율 개선과 화질 왜곡 개선 방법에 관한 연구)

Jung, Hyedong;Choi, Beom Seok;Suh, Doug Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2010.04a
- /
- pp.482-485
- /
- 2010
LED 기술의 발전을 통해 이를 이용한 LCD 에서의 local-dimming 기술이 명암 비 향상, 소비 전력 절감을 위해 많이 연구되었다. 이러한 local-dimming 을 구현할 때 기존의 방법들은 주변(인접) 블록들로부터의 밝기 영향을 고려하지 않아 과도한 밝기 제어가 됨으로써 부정확하다. 하지만 local-dimming 을 이용하더라도 백라이트 유닛(BLU)의 제어 가능 범위의 제한으로 인해 제어의 효율은 개선할 여지가 많이 남아있다. 본 연구에서는 정확한 백라이트 제어를 위하여 주변 밝기를 고려하는 Cooperative Dimming 방법을 제안하며 본 제안 기술을 통해 픽셀의 최대 밝기를 최대한 만족하여 오류를 최소화 가능한 최소전력 소비를 하는 제어방법을 보인다.
https://doi.org/10.3745/PKIPS.y2010m04a.482 인용 PDF

A Method of Performance Development in 3-path Underwater Channel (3-path 수중채널에서 통신 성능 향상 기법)

Kim, Nam-soo;Kim, Min-hyuk;Park, Tae-doo;Kim, Chul-seung;Jung, Ji-won
- Proceedings of the Korea Information Processing Society Conference
- /
- 2009.04a
- /
- pp.1303-1306
- /
- 2009
수중에서의 통신은 해수면과 해저면 등에 의한 신호의 반사에 의해 발생한 다중경로 현상으로 신호가 왜곡되어 원활한 통신이 어렵다. 이에 본 논문에서는 다중경로에 의해 발생한 오류를 정정하고자 수중 채널 전달함수를 이용한 정정기법을 제안하였으며, 시뮬레이션 결과 제안한 기법을 적용하였을 경우 적용하지 않았을 때 보다 더욱 우수한 성능을 보이는 것을 확인 할 수 있다.
https://doi.org/10.3745/PKIPS.y2009m04a.1303 인용 PDF

Sampling Bias of Discontinuity Orientation Measurements for Rock Slope Design in Linear Sampling Technique : A Case Study of Rock Slopes in Western North Carolina (선형 측정 기법에 의해 발생하는 불연속면 방향성의 왜곡 : 서부 North Carolina의 암반 사면에서의 예)

박혁진
- Journal of the Korean Geotechnical Society
- /
- v.16 no.1
- /
- pp.145-155
- /
- 2000
Orientation data of discontinuities are of paramount importance for rock slope stability studies because they control the possibility of unstable conditions or excessive deformation. Most orientation data are collected by using linear sampling techniques, such as borehole fracture mapping and the detailed scanline method (outcrop mapping). However, these data, acquired by the above linear sampling techniques, are subjected to bias, owing to the orientation of the sampling line. Even though a weighting factor is applied to orientation data in order to reduce this bias, the bias will not be significantly reduced when certain sampling orientations are involved. That is, if the linear sampling orientation nearly parallels the discontinuity orientation, most discontinuities orientation data which are parallel to sampling line will be excluded from the survey result. This phenomenon can cause serious misinterpretation of discontinuity orientation data because critical information is omitted. In the case study, orientation data collected by using the borehole fracture mapping method (vertical scanline) were compared to those based on orientation data from the detailed scanline method (horizontal scanline). Differences in results for the two procedures revealed a concern that a representative orientation of discontinuities was not accomplished. Equal-area, polar stereo nets were used to determine the distribution of dip angles and to compare the data distribution fur the borehole method versus those for the scanline method.
PDF

Model Simulation for Assessment of Image Acquisition Errors Affecting Electron Tomography (영상 자료 획득시의 오류가 전자토모그래피 결과에 미치는 영향 고찰-모델 시뮬레이션을 중심으로)

Jou, Hyeong-Tae ;Lee, Su-Jeong;Kim, Youn-Joong;Suk, Bong-Chool
- Applied Microscopy
- /
- v.38 no.1
- /
- pp.51-61
- /
- 2008
This simulation study examined the effect of data acquisition error including the data type of TEM image, and incident beam intensity of the tilt series on 3D tomograms. Simulation was performed with the 3D head phantom model of Kak and Slaney, and the slightly modified 3D head phantom model with enhanced difference in absorption coefficients. Reconstructed tomogram for the original head phantom model using 8-bit gray-scale image was distorted with extremely high level of noise, while an acceptable result was obtained for the modified model. The results for the original model using wrong formulation for the transmitted beam intensity was proved to be incorrect. The high level of noise along the z direction was found in case of the modified model. On the other hand, the wrong value of incident beam intensity in both models gave distorted results. In order to reconstruct an artifacts-free 3D structure from the projections with invisible features in electron tomography, the 16-bit projection images should be used with the correct incident beam intensity which is applied to Beer's law.
PDF KSCI

Teaching Democracy in Indonesian Civic Education Textbook (인도네시아 시민윤리교육 교과서에서의 민주주의 교육)

KIM, Hyun Kyoung
- The Southeast Asian review
- /
- v.27 no.3
- /
- pp.1-47
- /
- 2017
This paper examines how democracy is being taught in secondary school level of Indonesian civic education. For this purpose, this study analyses the textbook contents concerning democracy. First, this study sets the freedom, the right, the unity and the stability as key words and analyzes the characteristic of describing democracy by looking at how each keyword is explained in the textbook. The result of analysis shows that democracy of Indonesia can be described as "Pancasila democracy" and textbooks have tendency to relatively emphasize 'the unity', and 'the stability' by differentiating themselves from "liberal democracy" and "liberalism." The freedom in textbook can be interpreted in the context of organic-statism that state and interests of state have the ascendancy over individuals. This viewpoint is based on the historical contexts of Indonesia. However, when textbook describes about Indonesian democracy and its values, they deal with contents of democratic principles, "the freedom of opposition", "the negative freedom", and natural rights. And the study interprets the existence of the two contrasting concepts - relative emphasis on the unity of state and the statement about the importance of individual rights and the freedom - in the textbook as a logical tension in transitional process of traditional organic-statism. Second, the study examines educational contents in accordance with the method of description in textbook. It has been found that there are logical tension and fallacy in describing the principle of fundamental concepts and applicate that concepts into Indonesia case. Also, when describing Marsinah and Munir case, there are some parts distorted and overlooked the facts. On the other hand, the gaps between the explanation in textbook and reality can be pointed out. This study which examined textbook and contents of the rights of the individual is an introductory study on textbook, education and democracy for development of Indonesia and their education.

Speech Transition Detection and approximate-synthesis Method for Speech Signal Compression and Recovery (음성신호 압축 및 복원을 위한 음성 천이구간 검출과 근사합성 방식)

Lee, Kwang-Seok;Kim, Bong-Gi;Kang, Seong-Soo;Kim, Hyun-Deok
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2008.05a
- /
- pp.763-767
- /
- 2008
In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech qualify in case coexist with a voiced and an unvoiced consonants in a frame. So, We proposed TS(Transition Segment) including unvoiced consonant searching and extraction method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This research present a new method of TS approximate-synthesis by using Least Mean Square and frequency band division. As a result, this method obtain a high quality approximation-synthesis waveforms within TS by using frequency information of 0.547kHz below and 2.813kHz above. The important thing is that the maximum error signal can be made with low distortion approximation-synthesis waveform within TS. This method has the capability of being applied to a new speech coding of Voiced/Silence/TS, speech analysis and speech synthesis.
PDF

High-Resolution Image Reconstruction Considering the Inaccurate Sub-Pixel Motion Information (부정확한 부화소 단위의 움직임 정보를 고려한 고해상도 영상 재구성 연구)

Park, Jin-Yeol;Lee, Eun-Sil;Gang, Mun-Gi
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.38 no.2
- /
- pp.169-178
- /
- 2001
The demand for high-resolution images is gradually increasing, whereas many imaging systems have been designed to allow a certain level of aliasing during image acquisition. Thus, digital image processing approaches have recently been investigated to reconstruct a high-resolution image from aliased low-resolution images. However, since the sub-pixel motion information is assumed to be accurate in most conventional approaches, the satisfactory high-resolution image cannot be obtained when the sub-pixel motion information is inaccurate. Therefore, in this paper we propose a new algorithm to reduce the distortion in the reconstructed high-resolution image due to the inaccuracy of sub-pixel motion information. For this purpose, we analyze the effect of inaccurate sub-pixel motion information on a high-resolution image reconstruction, and model it as zero-mean additive Gaussian errors added respectively to each low-resolution image. To reduce the distortion we apply the modified multi-channel image deconvolution approach to the problem. The validity of the proposed algorithm is both theoretically and experimentally demonstrated in this paper.
PDF

Search Result 154, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)