• Title/Summary/Keyword: background noise

Search Result 964, Processing Time 0.028 seconds

An Efficient Character Image Enhancement and Region Segmentation Using Watershed Transformation (Watershed 변환을 이용한 효율적인 문자 영상 향상 및 영역 분할)

  • Choi, Young-Kyoo;Rhee, Sang-Burm
    • The KIPS Transactions:PartB
    • /
    • v.9B no.4
    • /
    • pp.481-490
    • /
    • 2002
  • Off-line handwritten character recognition is in difficulty of incomplete preprocessing because it has not dynamic information has various handwriting, extreme overlap of the consonant and vowel and many error image of stroke. Consequently off-line handwritten character recognition needs to study about preprocessing of various methods such as binarization and thinning. This paper considers running time of watershed algorithm and the quality of resulting image as preprocessing for off-line handwritten Korean character recognition. So it proposes application of effective watershed algorithm for segmentation of character region and background region in gray level character image and segmentation function for binarization by extracted watershed image. Besides it proposes thinning methods that effectively extracts skeleton through conditional test mask considering routing time and quality of skeleton, estimates efficiency of existing methods and this paper's methods as running time and quality. Average execution time on the previous method was 2.16 second and on this paper method was 1.72 second. We prove that this paper's method removed noise effectively with overlap stroke as compared with the previous method.

The Development of Topographic Feature Extraction Method by use of the Seafloor Curvature Measurement (곡률 계산에 의한 해저면 지형요소 추출 기법 개발)

  • Kim, Hyun-Sub;Jung, Mee-Sook;Park, Cheong-Kee
    • Geophysics and Geophysical Exploration
    • /
    • v.10 no.3
    • /
    • pp.163-172
    • /
    • 2007
  • A seafloor curvature measurement method was developed to extract redundant topographic features from the multi-beam bathymetry data, and then applied to the data of abyssal plain area in the Pacific. Any seafloor might be modeled to a quadratic surface determined in a linear least squares sense, and its curvature could be derived from the eigen values related with quadratic model parameters. The curvature's magnitude as well as polarity showed distinct relationship with geometric characteristics of the seafloor like as ridge and valley. From the investigation of curvature's variation with the number of data in the quadratic surface, the optimal size of data aperture could be applied to real bathymetry data. The application to real data also required the determination of the accompanying threshold values to cope with corresponding topographic features. The calculation method of previous studies were reported to be sensitive to the background noise. The improved curvature measurement method, incorporating the sum of eigen values has reduced unwanted artifacts and enhanced ability to extract lineament features along strike direction. The result of application shows that the curvature measurement method is effective tool for the estimation of a possible mining area in the seamount free abyssal hill area.

Progress Report on NISS onboard NEXTSat-1

  • Jeong, Woong-Seob;Park, Sung-Joon;Park, Kwijong;Moon, Bongkon;Lee, Dae-Hee;Pyo, Jeonghyun;Park, Youngsik;Kim, Il-Joong;Park, Won-Kee;Lee, Duk-Hang;Park, Chan;Ko, Kyeongyeon;Nam, Ukwon;Han, Wonyong;Im, Myungshin;Lee, Hyung Mok;Lee, Jeong-Eun;Shin, Goo-Hwan;Chae, Jangsoo;Matsumoto, Toshio
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.39 no.1
    • /
    • pp.49.1-49.1
    • /
    • 2014
  • The NISS (Near-infrared Imaging Spectrometer for Star formation history) onboard NEXTSat-1 is the near-infrared instrument onboard NEXTSat-1 which is being developed by KASI. The imaging low-resolution spectroscopic observation in the near-infrared range for nearby galaxies, low background regions, star-forming regions and so on will be performed on orbit. After the System Requirement Review, the optical design is changed from on-axis to the off-axis telescope which has a wide field of view (2 deg. ${\times}$ 2 deg.) as well as the wide wavelength range from 0.95 to $3.8{\mu}m$. The mechanical structure is considered to endure the launching condition as well as the space environment. The design of relay optics is optimized to maintain the uniform optical performance in the required wavelength range. The stray light analysis is being made to evade a light outside a field of view. The dewar is designed to operate the infrared detector at 80K stage. From the thermal analysis, we confirmed that the telescope can be cooled down to around 200K in order to reduce the large amount of thermal noise. Here, we report the current status of the NISS development.

  • PDF

Near-Infrared Imaging Spectrometer onboard NEXTSat-1

  • Jeong, Woong-Seob;Lee, Dae Hee;Moon, Bongkon;Park, Kwijong;Park, Sung-Joon;Pyo, Jeonghyun;Park, Youngsik;Kim, Il-Joong;Park, Won-Kee;Kim, Mingyu;Lee, Duk-Hang;Nam, Ukwon;Han, Wonyong;Im, Myungshin;Lee, Hyung Mok;Lee, Jeong-Eun;Shin, Goo-Hwan;Chae, Jangsoo
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.38 no.1
    • /
    • pp.70.1-70.1
    • /
    • 2013
  • New space program for "Next-Generation Small Satellite (NEXTSat)" launched last year after the success of the series of Science & Technology Satellite (STSAT). KASI proposed the near-infrared imaging spectrometer as a scientific payload onboard NEXTSat-1. It was selected as one of two scientific payloads. The approved scientific payload is the near-infrared imaging spectrometer for the study of star formation history (NISS). The efficient near-infrared observation can be performed in space by evading the atmospheric emission as well as other thermal noise. The observation of cosmic near-infrared background enables us to reveal the early Universe in an indirect way through the measurement of absolute brightness and spatial fluctuation. The detection of near-infrared spectral lines in nearby galaxies, cluster of galaxies and star forming regions give us less biased information on the star formation. In addition, the NISS will be expected to demonstrate our technologies related to the development of the Korea's leading near-infrared instrument for the future large infrared telescope, SPICA.

  • PDF

Fingerprint Segmentation and Ridge Orientation Estimation with a Mobile Camera for Fingerprint Recognition (모바일 카메라를 이용한 지문인식을 위한 지문영역 추출 및 융선방향 추출 알고리즘)

  • Lee Chulhan;Lee Sanghoon;Kim Jaihie;Kim Sung-Jae
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.6
    • /
    • pp.89-98
    • /
    • 2005
  • Fingerprint segmentation and ridge orientation estimation algorithms with images from a mobile camera are proposed. The fingerprint images from a mobile camera are quite different from those from conventional sensor, called touch based sensor such as optical, capacitive, and thermal. For example, the images from a mobile camera are colored and the backgrounds or non-finger regions are very erratic depending on how the image capture time and place. Also the contrast between ridge and valley of a mobile camera image are lower than that of touch based sensor image. To segment fingerprint region, we first detect the initial region using color information and texture information. The LUT (Look Up Table) is used to model the color distribution of fingerprint images using manually segmented images and frequency information is extracted to discriminate between in focused fingerprint regions and out of focused background regions. With the detected initial region, the region growing algerian is executed to segment final fingerprint region. In fingerprint orientation estimation, the problem of gradient based method is very sensitive to outlier that occurred by scar and camera noise. To solve this problem, we propose a robust regression method that removes the outlier iteratively and effectively. In the experiments, we evaluated the result of the proposed fingerprint segmentation algerian using 600 manually segmented images and compared the orientation algorithms in terms of recognition accuracy.

Effects of Cognitive Impairment on Self-reported Hearing Handicap in Older Adults with Early-stage Presbycusis (초기 노인성 난청자에서 인지장애가 일상생활 듣기 어려움에 미치는 영향)

  • Lee, Soo Jung
    • 한국노년학
    • /
    • v.38 no.1
    • /
    • pp.1-14
    • /
    • 2018
  • Everyday hearing handicap caused by presbycusis ultimately reduces quality of life in older adults. The aim of this study was to explore effects of cognitive impairment on self-reported hearing handicap in older adults with early-stage presbycusis. We compared K-HHIE scores between 40 elderly subjects with mild cognitive impairment (MCI) and age- and hearing-threshold matched 40 cognitively normal elderly (CNE) subjects. The results are as follows: 1) The MCI group scored significantly higher than the CNE group on the social/situational and emotional sections, and in total. 2) The MCI group scored significantly higher than the CNE group on all four subscales, and the most significant group difference was on the first subscale relating to interpersonal relationships and social handicaps. 3) Both groups scored highest on the item 8 (problems hearing whispering sounds) and item 15 (problems hearing TV or radio sounds). Besides those two items, the MCI group also scored high on the item 21 (problems hearing in a restaurant), item 6 (problems hearing when attending a party), item 3 (avoiding groups of people), and item 20 (personal or social restrictions). Our findings suggest that, among older adults with early-stage presbycusis, older adults with cognitive impairment tend to report greater everyday hearing handicap than their peers with normal cognitive function. Especially, they show significant problems hearing in background noise or multi-talker situations, which cause social restrictions and social/emotional loneliness.

Reliability of OperaVOXTM against Multi-Dimensional Voice Program to Assess Voice Quality before and after Laryngeal Microsurgery in Patient with Vocal Polyp (성대 용종 환자의 후두미세수술 전후 음성 평가에서 OperaVOXTM와 Multi-Dimensional Voice Program 간의 신뢰도 연구)

  • Kim, Sun Woo;Kim, So Yean;Cho, Jae Kyung;Jin, Sung Min;Lee, Sang Hyuk
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.31 no.2
    • /
    • pp.71-77
    • /
    • 2020
  • Background and Objectives OperaVOXTM (Oxford Wave Research Ltd.) is a portable voice analysis software package designed for use with iOS devices. As a relatively cheap, portable and easily accessible form of acoustic analysis, OperaVOXTM may be more clinically useful than laboratory-based software in many situations. The aim of this study was to evaluate the agreement between OperaVOXTM and Multi-Dimensional Voice Program (MDVP; Computerized Speech Lab) to assess voice quality before and after laryngeal microsurgery in patient with vocal polyp. Materials and Method Twenty patients who had undergone laryngeal microsurgery for vocal polyp were enrolled in this study. Preoperative and postoperative voices were assessed by acoustic analysis using MDVP and OperaVOXTM. A five-seconds recording of vowel /a/ was used to measure fundamental frequency (F0), jitter, shimmer and noise-to-harmonic ratio (NHR). Results Several acoustic parameters of MDVP and OperaVOXTM related to short-term variability showed significant improvement. While pre-operative value of F0, jitter, shimmer, NHR was 155.75 Hz (male: 125.37 Hz, female: 183.37 Hz), 2.20%, 6.28%, 0.16, post-operative values of these parameter was 164.34 Hz (male: 129.42 Hz, female: 199.26 Hz), 2.15%, 5.18%, 0.14 Hz in MDVP. While pre-operative value of F0, jitter, shimmer, NHR was 168.26 Hz (male: 135.16 Hz, female: 201.37 Hz), 2.27%, 6.95%, 0.26, post-operative values of these parameters was 162.72 Hz (male: 128.267 Hz, female: 197.18 Hz), 1.71%, 5.36%, 0.20 in OperaVOXTM. There was high intersoftware agreement for F0, jitter, shimmer with intraclass correlation coefficient. Conclusion Our results showed that the short-term variability of acoustic parameters in both MDVP and OperaVOXTM were useful for the objective assessment of voice quality in patients who received laryngeal microsurgery. OperaVOXTM is comparable to MDVP and has high intersoftware reliability with MDVP in measuring the F0, jitter, and shimmer

Effects of Voice Therapy Using Gliding and Humming in Dysphonic Patients With Glottal Gap (활창과 허밍을 이용한 음성치료가 성문틈 환자의 음성 개선에 미치는 효과)

  • Jung, Dae-Yong;Shim, Mi-Ran;Hwang, Yeon-Shin;Kim, Geun-Jeon;Sun, Dong-Il
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.32 no.2
    • /
    • pp.81-86
    • /
    • 2021
  • Background and Objectives Therapies have been reported to treat the glottal gap previously. However, these voice therapies showed the limits because many techniques focused only on one among breathing, resonance and phonation. In addition patients often have difficulties visiting hospital frequently. 'Gliding and humming' is vocal training technique that readjusts total vocal patterns such as breathing, resonance and phonation. This technique can be easily applied during short term sessions. The purpose of this study is to evaluate the efficiency of voice therapy with 'gliding and humming' for patients with glottic gap during short-term treatment sessions. Materials and Method Twenty-three patients with glottal gap were selected. Of all patients, 14 patients had sulcus vocalis and 12 patients had muscle tension dysphonia (MTD). Voice therapies were performed 1.9 sessions in average. GRBAS, jitter, shimmer, noise to harmonic ratio, semitone range, closed quotient_vowel and maximum phonation time were compared before and after the therapies. In addition, changes of glottal gap and MTD severity were evaluated. Results Statistically significant improvement was observed. MTD improvement was observed only among the patients with glottal gap improvement. Also sulcus vocalis group showed the statistically significant improvement. Conclusion 'Gliding and humming' was effective to the patients with glottic gap and sulcus vocalis. Also, among patients who have both glottic gap and MTD, the data suggests that voice therapy for glottic gap also makes improvement in MTD.

Comparison of the Voice Outcome After Injection Laryngoplasty: Unilateral Vocal Fold Paralysis Due to Cancer Nerve Invasion and Iatrogenic Injury (성대주입술 후 음향학적 분석결과 비교: 암의 신경 침윤으로 인한 일측성 성대마비 환자와 수술 후 발생한 일측성 성대마비 환자)

  • Yongmin, Cho;Hyunseok, Choi;Kyoung Ho, Oh;Seung-Kuk, Baek;Jeong-Soo, Woo;Soon Young, Kwon;Kwang-Yoon, Jung;Jae-Gu, Cho
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.33 no.3
    • /
    • pp.172-178
    • /
    • 2022
  • Background and Objectives Injection laryngoplasty is a common method for treatment of unilateral vocal fold paralysis. Unilateral vocal fold paralysis has various causes, including idiopathic, infection, stroke, neurologic condition, surgery and nerve invasion by cancer. To the knowledge of the authors, there was no study on the relationship between the causes of vocal cord paralysis and the outcome of injection laryngoplasty. Therefore, we tried to investigate the difference in the outcomes of injection laryngoplasty between vocal cord paralysis after surgery group and nerve invasion by cancer group. Materials and Method A retrospective analysis was performed for 24 patients who underwent vocal cord injection due to unilateral vocal cord paralysis caused by surgery or nerve invasion by cancer. The objective quality of the voice was assessed by acoustic voice analysis with the Multi-Dimensional Voice Program. Results Both group showed an improvement of fundamental frequemcy (F0), jitter percent, shimmer (percent), and noise to hearmonic ratio (NHR) after injection laryngoplasty. The vocal cord paralysis due to nerve invasion group showed more improvement in both the mean and median value of F0, shimmer percent and NHR than the vocal cord paralysis due to surgery group, but there was not statistically significant. Conclusion Our study did not show a statistically significant difference in outcome between vocal cord paralysis due to cancer invasion group and surgery group, but statistically tendency was suggested. The vocal cord paralysis due to nerve invasion group showed more improvement in both the mean and median value of acoustic voice analysis than surgery group.

Speaker verification with ECAPA-TDNN trained on new dataset combined with Voxceleb and Korean (Voxceleb과 한국어를 결합한 새로운 데이터셋으로 학습된 ECAPA-TDNN을 활용한 화자 검증)

  • Keumjae Yoon;Soyoung Park
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.2
    • /
    • pp.209-224
    • /
    • 2024
  • Speaker verification is becoming popular as a method of non-face-to-face identity authentication. It involves determining whether two voice data belong to the same speaker. In cases where the criminal's voice remains at the crime scene, it is vital to establish a speaker verification system that can accurately compare the two voice evidence. In this study, to achieve this, a new speaker verification system was built using a deep learning model for Korean language. High-dimensional voice data with a high variability like background noise made it necessary to use deep learning-based methods for speaker matching. To construct the matching algorithm, the ECAPA-TDNN model, known as the most famous deep learning system for speaker verification, was selected. A large dataset of the voice data, Voxceleb, collected from people of various nationalities without Korean. To study the appropriate form of datasets necessary for learning the Korean language, experiments were carried out to find out how Korean voice data affects the matching performance. The results showed that when comparing models learned only with Voxceleb and models learned with datasets combining Voxceleb and Korean datasets to maximize language and speaker diversity, the performance of learning data, including Korean, is improved for all test sets.