Search | Korea Science

A Study on Music Summarization (음악요약 생성에 관한 연구)

Kim Sung-Tak;Kim Sang-Ho;Kim Hoi-Rin;Choi Ji-Hoon;Lee Han-Kyu;Hong Jin-Woo
- Journal of Broadcast Engineering
- /
- v.11 no.1 s.30
- /
- pp.3-14
- /
- 2006
Music summarization means a technique which automatically generates the most importantand representative a part or parts ill music content. The techniques of music summarization have been studied with two categories according to summary characteristics. The first one is that the repeated part is provided as music summary and the second provides the combined segments which consist of segments with different characteristics as music summary in music content In this paper, we propose and evaluate two kinds of music summarization techniques. The algorithm using multi-level vector quantization which provides a repeated part as music summary gives fixed-length music summary is evaluated by overlapping ration between hand-made repeated parts and automatically generated summary. As results, the overlapping ratios of conventional methods are 42.2% and 47.4%, but that of proposed method with fixed-length summary is 67.1%. Optimal length music summary is evaluated by the portion of overlapping between summary and repeated part which is different length according to music content and the result shows that automatically-generated summary expresses more effective part than fixed-length summary with optimal length. The cluster-based algorithm using 2-D similarity matrix and k-means algorithm provides the combined segments as music summary. In order to evaluate this algorithm, we use MOS test consisting of two questions(How many similar segments are in summarized music? How many segments are included in same structure?) and the results show good performance.
PDF KSCI

Design and Implementation of AR Model based Automatic Identification and Restoration Scheme for Line Scratches in Old Films (AR 모델 기반의 고전영화의 긁힘 손상의 자동 탐지 및 복원 시스템 설계와 구현)

Han, Ngoc-Soc;Kim, Seong-Whan
- The KIPS Transactions:PartB
- /
- v.17B no.1
- /
- pp.47-54
- /
- 2010
Old archived film shows two major defects: line scratch and blobs. In this paper, we present a design and implementation of an automatic video restoration system for line scratches observed in archived film. We use autoregressive (AR) image model because we can make stochastic and specifically autoregressive image generation process with our PAST-PRESENT model and Sampling Pattern. We designed locality maximizing scanning pattern, which can generate nearly stationary time-like series of pixels, which is a strong requirement for a stochastic series to be autoregressive. The sampled pixel series undergoes filtering and model fitting using Durbin-Levinson algorithm before interpolation process. We designed three-stage film restoration system, which includes (1) film acquisition from VHS tapes, (2) simple line scratch detection and restoration, and (3) manual blob identification and sophisticated inpainting scheme. We implemented film acquisition and simple inpainting scheme on Texas Instruments DSP board TMS320DM642 EVM, and implemented our AR inpainting scheme on PC for sophisticated restoration. We experimented our scheme with two old Korean films: "Viva Freedom" and "Robot Tae-Kwon-V", and the experimental results show that our scheme improves Bertalmio's scheme for subjective quality (MOS), objective quality (PSNR), and especially restoration ratio (RR), which reflects how much similar to the manual inpainting results.
https://doi.org/10.3745/KIPSTB.2010.17B.1.047 인용 PDF KSCI

On a Cleaning of COVID-19 Prevention Masks with Electrolytic Decomposition Water (전기분해수로 코로나방역용 마스크의 세정에 관한연구)

Tian, Zhixing;Bae, Myung-Jin
- The Journal of the Convergence on Culture Technology
- /
- v.8 no.1
- /
- pp.591-596
- /
- 2022
Various COVID-19 quarantine guidelines and measures are being taken by country at the WHO, but the number of confirmed cases has not decreased significantly. In order to prevent the inflow and outflow of COVID-19 through individual droplets, it is mandatory to wear a mask anytime, anywhere. However, as virus bacteria entering the mask amplify, it pollutes the mask and causes a disgusting smell. In this paper, a new method of preventing the spread of COVID-19 was proposed by sterilizing the mask with a dental gait spray introduced into the mask that has been used for a long time. Dental gargle water is usually produced by electrolysis of tap water, and the unstable ion water (HOCl) dissolved in water penetrates the cell barrier of various viruses and fails to act in its nucleus, causing water to self-purify. As a result of the experiment, when the mask used for a long time was washed with gargle water spray, the washed mask was dried after 10 minutes, and the smell of virus droplets or saliva almost disappeared. In particular, as a result of MOS testing the fit of the subjects who participated in the mask cleaning, it was excellent at 4.4 on average. Therefore, the mask was disposable, but if the spray was washed in the proposed method more than twice a day, the mask could be used in a comfortable state for more than a week.
https://doi.org/10.17703/JCCT.2022.8.1.591 인용 PDF KSCI

Design of eFuse OTP IP for Illumination Sensors Using Single Devices (Single Device를 사용한 조도센서용 eFuse OTP IP 설계)

Souad, Echikh;Jin, Hongzhou;Kim, DoHoon;Kwon, SoonWoo;Ha, PanBong;Kim, YoungHee
- Journal of IKEEE
- /
- v.26 no.3
- /
- pp.422-429
- /
- 2022
A light sensor chip requires a small capacity eFuse (electrical fuse) OTP (One-Time Programmable) memory IP (Intellectual Property) to trim analog circuits or set initial values of digital registers. In this paper, 128-bit eFuse OTP IP is designed using only 3.3V MV (Medium Voltage) devices without using 1.8V LV (Low-Voltage) logic devices. The eFuse OTP IP designed with 3.3V single MOS devices can reduce a total process cost of three masks which are the gate oxide mask of a 1.8V LV device and the LDD implant masks of NMOS and PMOS. And since the 1.8V voltage regulator circuit is not required, the size of the illuminance sensor chip can be reduced. In addition, in order to reduce the number of package pins of the illumination sensor chip, the VPGM voltage, which is a program voltage, is applied through the VPGM pad during wafer test, and the VDD voltage is applied through the PMOS power switching circuit after packaging, so that the number of package pins can be reduced.
https://doi.org/10.7471/ikeee.2022.26.3.422 인용 PDF KSCI

A Performance Improvement Method using Variable Break in Corpus Based Japanese Text-to-Speech System (가변 Break를 이용한 코퍼스 기반 일본어 음성 합성기의 성능 향상 방법)

Na, Deok-Su;Min, So-Yeon;Lee, Jong-Seok;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.2
- /
- pp.155-163
- /
- 2009
In text-to-speech systems, the conversion of text into prosodic parameters is necessarily composed of three steps. These are the placement of prosodic boundaries. the determination of segmental durations, and the specification of fundamental frequency contours. Prosodic boundaries. as the most important and basic parameter. affect the estimation of durations and fundamental frequency. Break prediction is an important step in text-to-speech systems as break indices (BIs) have a great influence on how to correctly represent prosodic phrase boundaries, However. an accurate prediction is difficult since BIs are often chosen according to the meaning of a sentence or the reading style of the speaker. In Japanese, the prediction of an accentual phrase boundary (APB) and major phrase boundary (MPB) is particularly difficult. Thus, this paper presents a method to complement the prediction errors of an APB and MPB. First, we define a subtle BI in which it is difficult to decide between an APB and MPB clearly as a variable break (VB), and an explicit BI as a fixed break (FB). The VB is chosen using the classification and regression tree, and multiple prosodic targets in relation to the pith and duration are then generated. Finally. unit-selection is conducted using multiple prosodic targets. In the MOS test result. the original speech scored a 4,99. while proposed method scored a 4.25 and conventional method scored a 4.01. The experimental results show that the proposed method improves the naturalness of synthesized speech.
https://doi.org/10.7776/ASK.2009.28.2.155 인용 PDF KSCI

Search Result 35, Processing Time 0.02 seconds

A Study on Music Summarization (음악요약 생성에 관한 연구)

Design and Implementation of AR Model based Automatic Identification and Restoration Scheme for Line Scratches in Old Films (AR 모델 기반의 고전영화의 긁힘 손상의 자동 탐지 및 복원 시스템 설계와 구현)

On a Cleaning of COVID-19 Prevention Masks with Electrolytic Decomposition Water (전기분해수로 코로나방역용 마스크의 세정에 관한연구)

Design of eFuse OTP IP for Illumination Sensors Using Single Devices (Single Device를 사용한 조도센서용 eFuse OTP IP 설계)

A Performance Improvement Method using Variable Break in Corpus Based Japanese Text-to-Speech System (가변 Break를 이용한 코퍼스 기반 일본어 음성 합성기의 성능 향상 방법)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)