• Title/Summary/Keyword: VQ

Search Result 252, Processing Time 0.023 seconds

Verification and estimation of a posterior probability and probability density function using vector quantization and neural network (신경회로망과 벡터양자화에 의한 사후확률과 확률 밀도함수 추정 및 검증)

  • 고희석;김현덕;이광석
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.45 no.2
    • /
    • pp.325-328
    • /
    • 1996
  • In this paper, we proposed an estimation method of a posterior probability and PDF(Probability density function) using a feed forward neural network and code books of VQ(vector quantization). In this study, We estimates a posterior probability and probability density function, which compose a new parameter with well-known Mel cepstrum and verificate the performance for the five vowels taking from syllables by NN(neural network) and PNN(probabilistic neural network). In case of new parameter, showed the best result by probabilistic neural network and recognition rates are average 83.02%.

  • PDF

Cardio-Angiographic Sequence Coding Using Neural Network Adaptive Vector Quantization (신격회로망 적응 VQ를 이용한 심장 조영상 부호화)

  • 주창희;최종수
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.40 no.4
    • /
    • pp.374-381
    • /
    • 1991
  • As a diagnostic image of hospitl, the utilization of digital image is steadily increasing. Image coding is indispensable for storing and compressing an enormous amount of diagnostic images economically and effectively. In this paper adaptive two stage vector quantization based on Kohonen's neural network for the compression of cardioangiography among typical angiography of radiographic image sequences is presented and the performance of the coding scheme is compare and gone over. In an attempt to exploit the known characteristics of changes in cardioangiography, relatively large blocks of image are quantized in the first stage and in the next stage the bloks subdivided by the threshold of quantization error are vector quantized employing the neural network of frequency sensitive competitive learning. The scheme is employed because the change produced in cardioangiography is due to such two types of motion as a heart itself and body motion, and a contrast dye material injected. Computer simulation shows that the good reproduction of images can be obtained at a bit rate of 0.78 bits/pixel.

  • PDF

Image Coding Using DCT and Block Hierarchical Segmentation Finite-State Vector Quantization (DCT와 블록 계층 분할 유한상태 벡터 양자화를 이용한 영상 부호화)

  • Jo, Seong-Hwan;Kim, Eung-Seong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.3
    • /
    • pp.1013-1020
    • /
    • 2000
  • In this paper, we propose an algorithm which segments hierarchically blocks of image using discrete cosine transform(DCT) and execute finite-state vector quantization (FSVQ) for each block. Using DCT coefficient feature, image is segmented hierarchically to large smooth block and small edge block, then the block hierarchy informations are transmitted. The codebooks are respectively constructed for each hierarchical blocks, the encoder transmits codeword index using FSVQ for reducing encoded bit with hierarchical segmentation. Compared with side match VQ(SMVQ) and hierarchical FSVQ(HFSVQ) algorithm, about Zelda and Boat image, the new algorithm shows better picture quality with 1.97dB and 2.85 dB difference as to SMVQ, 1.78dB and 1.85dB diffences as to HFSVQ respectively.

  • PDF

A Study on Angiographgy Coding (심장조영상 부호화에 관한 연구)

  • Park, Sang-Hui;Han, Young-Oh;Park, Hyun-Soo;Kim, Hyung-Suk;Shin, Joong-In
    • Journal of Biomedical Engineering Research
    • /
    • v.14 no.2
    • /
    • pp.177-183
    • /
    • 1993
  • Medical imagies with high resolution are coded to be archived and communicated in MPACS. In this paper, we have studied on coding of Cardio-Angiography. Our coding technique is Subband-Vector Quantization. This techniques is irreversible coding method. This technique's advantages are removing blocking artifact and edge degradation, adapting for drastic image change because of dye injection, and fast decoding. We achieved good results for Cardio-Angiography data, but the study on more sophiscated motion estimation techniques and VQ techniques must be performed.

  • PDF

KORAN DIGIT RECOGNITION IN NOISE ENVIRONMENT USING SPECTRAL MAPPING TRAINING

  • Ki Young Lee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1015-1020
    • /
    • 1994
  • This paper presents the Korean digit recognition method under noise environment using the spectral mapping training based on static supervised adaptation algorithm. In the presented recognition method, as a result of spectral mapping from one space of noisy speech spectrum to another space of speech spectrum without noise, spectral distortion of noisy speech is improved, and the recognition rate is higher than that of the conventional method using VQ and DTW without noise processing, and even when SNR level is 0 dB, the recognition rate is 10 times of that using the conventional method. It has been confirmed that the spectral mapping training has an ability to improve the recognition performance for speech in noise environment.

  • PDF

A Two-Layer Steganography for Mosaic Images

  • Horng, Ji-Hwei;Chang, Chin-Chen;Sun, Kun-Sheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.9
    • /
    • pp.3298-3321
    • /
    • 2021
  • A lot of data hiding schemes have been proposed to embed secret data in the plain cover images or compressed images of various formats, including JPEG, AMBTC, VQ, etc. In this paper, we propose a production process of mosaic images based on three regular images of coffee beans. A primary image is first mimicked by the process to produce a mosaic cover image. A two-layer steganography is applied to hide secret data in the mosaic image. Based on the low visual quality of the mosaic cover image, its PSNR value can be improved about 1.5 dB after embedding 3 bpp. This is achieved by leveraging the newly proposed polarized search mask and the concepts of strong embedding and weak embedding. Applying steganography to the mosaic cover images is a completely new idea and it is promising.

Speaker-Adaptive Speech Synthesis based on Fuzzy Vector Quantizer Mapping and Neural Networks (퍼지 벡터 양자화기 사상화와 신경망에 의한 화자적응 음성합성)

  • Lee, Jin-Yi;Lee, Gwang-Hyeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.1
    • /
    • pp.149-160
    • /
    • 1997
  • This paper is concerned with the problem of speaker-adaptive speech synthes is method using a mapped codebook designed by fuzzy mapping on FLVQ (Fuzzy Learning Vector Quantization). The FLVQ is used to design both input and reference speaker's codebook. This algorithm is incorporated fuzzy membership function into the LVQ(learning vector quantization) networks. Unlike the LVQ algorithm, this algorithm minimizes the network output errors which are the differences of clas s membership target and actual membership values, and results to minimize the distances between training patterns and competing neurons. Speaker Adaptation in speech synthesis is performed as follow;input speaker's codebook is mapped a reference speaker's codebook in fuzzy concepts. The Fuzzy VQ mapping replaces a codevector preserving its fuzzy membership function. The codevector correspondence histogram is obtained by accumulating the vector correspondence along the DTW optimal path. We use the Fuzzy VQ mapping to design a mapped codebook. The mapped codebook is defined as a linear combination of reference speaker's vectors using each fuzzy histogram as a weighting function with membership values. In adaptive-speech synthesis stage, input speech is fuzzy vector-quantized by the mapped codcbook, and then FCM arithmetic is used to synthesize speech adapted to input speaker. The speaker adaption experiments are carried out using speech of males in their thirties as input speaker's speech, and a female in her twenties as reference speaker's speech. Speeches used in experiments are sentences /anyoung hasim nika/ and /good morning/. As a results of experiments, we obtained a synthesized speech adapted to input speaker.

  • PDF

Clinical Application of Model of Human Occupation on Goals Setting of Elderly person With Dementia: A Case Study (치매노인의 목표수립을 위한 인간작업모델 (Model of Human Occupation)의 적용: 사례연구)

  • Lee, Yu-Na;Jung, Min-Ye
    • Therapeutic Science for Rehabilitation
    • /
    • v.2 no.1
    • /
    • pp.66-76
    • /
    • 2013
  • Objective : Based on the Model of Human Occupation (MOHO), this study was to identifies the goals setting of elderly person with dementia. Methods : For 5 weeks from May to June of 2012, MOHO-based interviews and Occupational Questionnaire(OQ), Interest checklist(IC), Volitional Questionnaire(VQ), The Assessment of Communication and Interaction Skills(ACIS), Occupational Performance History Interview-II(OPHI-II), Occupational Self Assessment(OSA), The Occupational Circumstances Assessment Interview and Rating(OCAIR) assessment were conducted on elderly person with dementia. Further interviews were conducted with case manager and family. Results : After interviews and assessments were conducted, the results of which showed limitations in social and family relationship, financial issues, general well-being. Conclusion : Applying the MOHO forms a basis for judging the subject in a holistic and general way, changes how the subject is viewed, suggests various strategies. For improving the professinoalism and qualities of occupational therapy service, and may help expand the range of other relevant areas.

Segmentation of Seabed Points from Airborne Bathymetric LiDAR Point Clouds Using Cloth Simulation Filtering Algorithm (항공수심라이다 데이터 해저면 포인트 클라우드 분리를 위한 CSF 알고리즘 적용에 관한 연구)

  • Lee, Jae Bin;Jung, Jae Hoon;Kim, Hye Jin
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.38 no.1
    • /
    • pp.1-9
    • /
    • 2020
  • ABL (Airborne Bathymetric LiDAR) is an advanced survey technology that uses green lasers to simultaneously measure the water depths and oceanic topography in coastal and river areas. Seabed point cloud extraction is an essential prerequisite to further utilizing the ABL data for various geographic data processing and applications. Conventional seabed detection approaches often use return waveforms. However, their limited accessibility often limits the broad use of the bathymetric LiDAR (Light Detection And Ranging) data. Further, it is often questioned if the waveform-based seabed extraction is reliable enough to extract seabed. Therefore, there is a high demand to extract seabed from the point cloud using other sources of information, such as geometric information. This study aimed to assess the feasibility of a ground filtering method to seabed extraction from geo-referenced point cloud data by using CSF (Cloth Simulation Filtering) method. We conducted a preliminary experiment with the RIGEL VQ 880 bathymetric data, and the results show that the CSF algorithm can be effectively applied to the seabed point segmentation.

A Study On Male-To-Female Voice Conversion (남녀 음성 변환 기술연구)

  • Choi Jung-Kyu;Kim Jae-Min;Han Min-Su
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.115-118
    • /
    • 2000
  • Voice conversion technology is essential for TTS systems because the construction of speech database takes much effort. In this paper. male-to-female voice conversion technology in Korean LPC TTS system has been studied. In general. the parameters for voice color conversion are categorized into acoustic and prosodic parameters. This paper adopts LSF(Line Spectral Frequency) for acoustic parameter, pitch period and duration for prosodic parameters. In this paper. Pitch period is shortened by the half, duration is shortened by $25\%, and LSFs are shifted linearly for the voice conversion. And the synthesized speech is post-filtered by a bandpass filter. The proposed algorithm is simpler than other algorithms. for example, VQ and Neural Net based methods. And we don't even need to estimate formant information. The MOS(Mean Opinion Socre) test for naturalness shows 2.25 and for female closeness, 3.2. In conclusion, by using the proposed algorithm. male-to-female voice conversion system can be simply implemented with relatively successful results.

  • PDF