• Title/Summary/Keyword: two microphone method

Search Result 126, Processing Time 0.024 seconds

Optimization of the packet size to enhance the voice quality of the VOIP system (VOIP 음질 개선을 위한 패킷 크기의 최적화)

  • 임강빈;정기현;최경희
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.40 no.9
    • /
    • pp.373-383
    • /
    • 2003
  • In this paper we discuss the effect of the delay limit and the packet size related to the quality of service on a VoIP system using the Internet. We also provide a guideline to determining the optimal packet size of the voice data for a given delay limit. Empirical studies are done with two personal computers connected through the packet switched public IP network. The sender encodes the voice signal from the microphone to get PCM and ADPCM data and sends the data to the receiver using UDP packets. The receiver plays the reconstructed voice from the stream with lost and delayed packets. The quality of the reconstructed voice is evaluated offline by the MNB (Measuring Normal Block) method using the data acquired from the both sides. The result shows that under the delay limit of 100ms for 40Kbps, 32Kbps and l6Kbps of ADPCM data, the minimum packet size should be 300bytes, 400bytes and 600bytes respectively and the maximum packet size should be l200bytes commonly for the best quality of voice.

A study of prosodic features of patients with idiopathic Parkinson's disease (파킨슨병 환자와 정상노인 간의 문장 읽기에 나타난 운율 특성 비교)

  • Kang, Young-Ae;Seong, Cheol-Jae;Yoon, Kyu-Chul
    • Phonetics and Speech Sciences
    • /
    • v.3 no.1
    • /
    • pp.145-151
    • /
    • 2011
  • In view of the hypothesis that the effects of Parkinson's disease on voice production can be detected before pharmacological intervention, the prosodic features of patients with idiopathic Parkinson's disease (IPD) and a healthy aging group were diagnostically analyzed with the long term object of establishing, for clinical purposes, early disease-progression biomarkers. Twenty patients (male 8; female 12) with IPD (prior to pharmacological intervention) and a healthy control group of 22 (male 10; female 12) were selected. Ten sentences were recorded with a head-worn microphone. One sentence was chosen for the analysis of this paper. Relevant parameters, i.e. 3-dimensional model (F0, intensity, duration) and pitch and intensity related slopes (maxEnergy, maxF0, meanAbS, semiT, meanEnergy, meanF0), were analyzed by two-group discriminant analysis. The stepwise estimation method of discriminant analysis was performed by gender. The discriminant functions predicted 83.9% of the male test data correctly while the prediction rate was 93.1% for the female group. The results showed that meanF0_slope and semiT_slope were more important parameters than the others for the male group. For the female group, the meanEnergy_slope and maxEnergy_slope were the important ones. These findings indicate that significant parameters are different for the male and female group. Gender lifestyle may be responsible for this difference. Dysprosodic features of IPD show not simultaneously but progressively in terms of F0, intensity and duration.

  • PDF

Changes of Sound Absorption Capability of Wood by Organosolv Pretreatment (유기용매 전처리에 의한 목재의 흡음성능 변화)

  • Kang, Chun-Won;Choi, In-Gyu;Gwak, Ki-Seob;Yeo, Hwan-Myeong;Lee, Nam-Ho;Kang, Ho-Yang
    • Journal of the Korean Wood Science and Technology
    • /
    • v.40 no.4
    • /
    • pp.237-243
    • /
    • 2012
  • Sound absorption capability and anatomical features of the organosolv pretreated Japanese larch and yellow poplar wood were estimated by stereoscopic observation and two microphone transfer function method. Sound absorption capabilities of organosolv treated wood, in the entire estimated frequency range (50~6,400 Hz), were higher than those of control specimen. Especially, the treated wood's absorption capabilities measured in the frequency range of 2~4 kHz were about two times higher than those of control specimen. By the organosolv pretreatment (at $70{\sim}120^{\circ}C$), the weight loss of wood occurred in less than 1% of total weight of wood and the porosity of wood increased slightly. In addition, it was presupposed that microstructural changes of wood occurred during organosolv pretreatment and this structural changes cause the increasing of the sound absorption capability of wood.

A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth (Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究))

  • Park, Sung-Jin;Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.16 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".

  • PDF

THE CURRENT STATUS OF BIOMEDICAL ENGINEERING IN THE USA

  • Webster, John G.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1992 no.05
    • /
    • pp.27-47
    • /
    • 1992
  • Engineers have developed new instruments that aid in diagnosis and therapy Ultrasonic imaging has provided a nondamaging method of imaging internal organs. A complex transducer emits ultrasonic waves at many angles and reconstructs a map of internal anatomy and also velocities of blood in vessels. Fast computed tomography permits reconstruction of the 3-dimensional anatomy and perfusion of the heart at 20-Hz rates. Positron emission tomography uses certain isotopes that produce positrons that react with electrons to simultaneously emit two gamma rays in opposite directions. It locates the region of origin by using a ring of discrete scintillation detectors, each in electronic coincidence with an opposing detector. In magnetic resonance imaging, the patient is placed in a very strong magnetic field. The precessing of the hydrogen atoms is perturbed by an interrogating field to yield two-dimensional images of soft tissue having exceptional clarity. As an alternative to radiology image processing, film archiving, and retrieval, picture archiving and communication systems (PACS) are being implemented. Images from computed radiography, magnetic resonance imaging (MRI), nuclear medicine, and ultrasound are digitized, transmitted, and stored in computers for retrieval at distributed work stations. In electrical impedance tomography, electrodes are placed around the thorax. 50-kHz current is injected between two electrodes and voltages are measured on all other electrodes. A computer processes the data to yield an image of the resistivity of a 2-dimensional slice of the thorax. During fetal monitoring, a corkscrew electrode is screwed into the fetal scalp to measure the fetal electrocardiogram. Correlations with uterine contractions yield information on the status of the fetus during delivery To measure cardiac output by thermodilution, cold saline is injected into the right atrium. A thermistor in the right pulmonary artery yields temperature measurements, from which we can calculate cardiac output. In impedance cardiography, we measure the changes in electrical impedance as the heart ejects blood into the arteries. Motion artifacts are large, so signal averaging is useful during monitoring. An intraarterial blood gas monitoring system permits monitoring in real time. Light is sent down optical fibers inserted into the radial artery, where it is absorbed by dyes, which reemit the light at a different wavelength. The emitted light travels up optical fibers where an external instrument determines O2, CO2, and pH. Therapeutic devices include the electrosurgical unit. A high-frequency electric arc is drawn between the knife and the tissue. The arc cuts and the heat coagulates, thus preventing blood loss. Hyperthermia has demonstrated antitumor effects in patients in whom all conventional modes of therapy have failed. Methods of raising tumor temperature include focused ultrasound, radio-frequency power through needles, or microwaves. When the heart stops pumping, we use the defibrillator to restore normal pumping. A brief, high-current pulse through the heart synchronizes all cardiac fibers to restore normal rhythm. When the cardiac rhythm is too slow, we implant the cardiac pacemaker. An electrode within the heart stimulates the cardiac muscle to contract at the normal rate. When the cardiac valves are narrowed or leak, we implant an artificial valve. Silicone rubber and Teflon are used for biocompatibility. Artificial hearts powered by pneumatic hoses have been implanted in humans. However, the quality of life gradually degrades, and death ensues. When kidney stones develop, lithotripsy is used. A spark creates a pressure wave, which is focused on the stone and fragments it. The pieces pass out normally. When kidneys fail, the blood is cleansed during hemodialysis. Urea passes through a porous membrane to a dialysate bath to lower its concentration in the blood. The blind are able to read by scanning the Optacon with their fingertips. A camera scans letters and converts them to an array of vibrating pins. The deaf are able to hear using a cochlear implant. A microphone detects sound and divides it into frequency bands. 22 electrodes within the cochlea stimulate the acoustic the acoustic nerve to provide sound patterns. For those who have lost muscle function in the limbs, researchers are implanting electrodes to stimulate the muscle. Sensors in the legs and arms feed back signals to a computer that coordinates the stimulators to provide limb motion. For those with high spinal cord injury, a puff and sip switch can control a computer and permit the disabled person operate the computer and communicate with the outside world.

  • PDF

A Study on Skin Status with Acoustic Measurements of Skin Friction Noise (피부 마찰 소음 측정을 통한 피부 상태 연구)

  • Chang, Yun Hee;Seo, Dae Hoon;Koh, A Rum;Kim, Sun Young;Lim, Jun Man;Han, Jong Seup;Lee, Sang Hwa;Park, Sun Gyoo;Kim, Yang Han
    • Journal of the Society of Cosmetic Scientists of Korea
    • /
    • v.42 no.2
    • /
    • pp.103-109
    • /
    • 2016
  • Efficacy of cosmetics has been mainly evaluated by qualitative and quantitative methods based on visual sense, tactile sense and skin structure until now. In this study, we suggested a novel evaluation method for skin status based on sound; measuring and analyzing the rubbing noise generated by applying cosmetics. First, the rubbing noise was measured at a close range by a high-sensitivity microphone in anechoic environment, and the noises were analyzed by 1/3 octave band analysis in frequency-domain. Three conditions, 1) before washing, 2) after washing and 3) after application of cosmetics, were compared. As a result, sound pressure level (SPL) of rubbing noise after washing was larger than that of before washing, and the SPL of rubbing noise after cosmetic application was the smallest. Furthermore, the energy of rubbing noise after application was higher than that of the before and after washing conditions in a low frequency band (lower than 2 kHz region). Conversely, the energy of rubbing noise after application was much lower than the others in a high-frequency band (upper than 2 kHz region). This change of energy distribution was described as a balloon-skin model. High SPL in the low frequency region after the cosmetic applications was due to the increase of "flexibility index", while SPL in the high frequency region significantly decreased because of the attenuation which is related to "softness index". Therefore, we developed two indices based on the spectrum-energy difference for evaluating skin conditions. This proposed method and indices were verified via skin flexibility and roughness measurement using cutometer and primos respectively. These results suggest that acoustic measurement of skin friction noise may be a new skin status evaluation method.