• 제목/요약/키워드: Feature enhancement

검색결과 258건 처리시간 0.027초

EAR: Enhanced Augmented Reality System for Sports Entertainment Applications

  • Mahmood, Zahid;Ali, Tauseef;Muhammad, Nazeer;Bibi, Nargis;Shahzad, Imran;Azmat, Shoaib
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권12호
    • /
    • pp.6069-6091
    • /
    • 2017
  • Augmented Reality (AR) overlays virtual information on real world data, such as displaying useful information on videos/images of a scene. This paper presents an Enhanced AR (EAR) system that displays useful statistical players' information on captured images of a sports game. We focus on the situation where the input image is degraded by strong sunlight. Proposed EAR system consists of an image enhancement technique to improve the accuracy of subsequent player and face detection. The image enhancement is followed by player and face detection, face recognition, and players' statistics display. First, an algorithm based on multi-scale retinex is proposed for image enhancement. Then, to detect players' and faces', we use adaptive boosting and Haar features for feature extraction and classification. The player face recognition algorithm uses boosted linear discriminant analysis to select features and nearest neighbor classifier for classification. The system can be adjusted to work in different types of sports where the input is an image and the desired output is display of information nearby the recognized players. Simulations are carried out on 2096 different images that contain players in diverse conditions. Proposed EAR system demonstrates the great potential of computer vision based approaches to develop AR applications.

강인한 음성인식을 위한 켑스트럼 거리와 로그 에너지 기반 묵음 특징 정규화 (Cepstral Distance and Log-Energy Based Silence Feature Normalization for Robust Speech Recognition)

  • 신광호;정현열
    • 한국음향학회지
    • /
    • 제29권4호
    • /
    • pp.278-285
    • /
    • 2010
  • 훈련 환경과 인식 환경의 차이가 음성인식 성능저하의 주요요인이다. 이러한 환경의 불일치를 줄이기 위한 방법으로 다양한 묵음특징 정규화 방법이 제안되고 있다. 기존의 묵음특징 정규화 방법은 낮은 SNR (Signal-to-Noise Ratio)에서 묵음구간의 에너지 레벨이 증가하여 음성/묵음 분류의 정확도가 떨어짐으로 인해 인식성능이 저하되는 문제점이 있었다. 본 논문에서는 로그 에너지와 음성/묵음(또는잡음)의 켑스트럼 특징의 분포 특성의 차이를 나타내는 켑스트럼 유클리디언(Euclidean) 거리를 결합하여 음성/묵음을 분류하는 묵음특징 정규화 방법 (Cepstral distance and Log-energy based Silence Feature Normalization)을 제안하였다. 제안한 방법은 높은 SNR에서는 로그 에너지 특징이 잡음의 영향을 적게 받는 특성을 반영하여 기존의 묵음 특징 정규화 (Silence Feature Normalization)방법의 우수성을 그대로 유지하는 반면, 낮은 SNR에서는 로그 에너지 대신 음성/묵음 분류의 분별력이 우수한 켑스트럼 거리 정보를 이용함으로써 인식성능을 향상시킬 수 있다. 인식실험결과 기존의 SFN-I/II, CSFN 방법에 비해 전반적으로 향상된 인식성능을 얻을 수 있어 그 유효성을 확인할 수 있었다.

Validation of CT-Based Risk Stratification System for Lymph Node Metastasis in Patients With Thyroid Cancer

  • Yun Hwa Roh;Sae Rom Chung;Jung Hwan Baek;Young Jun Choi;Tae-Yon Sung;Dong Eun Song;Tae Yong Kim;Jeong Hyun Lee
    • Korean Journal of Radiology
    • /
    • 제24권10호
    • /
    • pp.1028-1037
    • /
    • 2023
  • Objective: To evaluate the computed tomography (CT) features for diagnosing metastatic cervical lymph nodes (LNs) in patients with differentiated thyroid cancer (DTC) and validate the CT-based risk stratification system suggested by the Korean Thyroid Imaging Reporting and Data System (K-TIRADS) guidelines. Materials and Methods: A total of 463 LNs from 399 patients with DTC who underwent preoperative CT staging and ultrasound-guided fine-needle aspiration were included. The following CT features for each LN were evaluated: absence of hilum, cystic changes, calcification, strong enhancement, and heterogeneous enhancement. Multivariable logistic regression analysis was performed to identify independent CT features associated with metastatic LNs, and their diagnostic performances were evaluated. LNs were classified into probably benign, indeterminate, and suspicious categories according to the K-TIRADS and the modified LN classification proposed in our study. The diagnostic performance of both classification systems was compared using the exact McNemar and Kosinski tests. Results: The absence of hilum (odds ratio [OR], 4.859; 95% confidence interval [CI], 1.593-14.823; P = 0.005), strong enhancement (OR, 28.755; 95% CI, 12.719-65.007; P < 0.001), and cystic changes (OR, 46.157; 95% CI, 5.07-420.234; P = 0.001) were independently associated with metastatic LNs. All LNs showing calcification were diagnosed as metastases. Heterogeneous enhancement did not show a significant independent association with metastatic LNs. Strong enhancement, calcification, and cystic changes showed moderate to high specificity (70.1%-100%) and positive predictive value (PPV) (91.8%-100%). The absence of the hilum showed high sensitivity (97.8%) but low specificity (34.0%). The modified LN classification, which excluded heterogeneous enhancement from the K-TIRADS, demonstrated higher specificity (70.1% vs. 62.9%, P = 0.016) and PPV (92.5% vs. 90.9%, P = 0.011) than the K-TIRADS. Conclusion: Excluding heterogeneous enhancement as a suspicious feature resulted in a higher specificity and PPV for diagnosing metastatic LNs than the K-TIRADS. Our research results may provide a basis for revising the LN classification in future guidelines.

Classification of Infant Crying Audio based on 3D Feature-Vector through Audio Data Augmentation

  • JeongHyeon Park;JunHyeok Go;SiUng Kim;Nammee Moon
    • 한국컴퓨터정보학회논문지
    • /
    • 제28권9호
    • /
    • pp.47-54
    • /
    • 2023
  • 영아는 비언어적 의사 소통 방식인 울음이라는 수단을 사용한다[1]. 하지만 영아의 울음소리를 파악하는 것에는 어려움이 따른다. 영아의 울음소리를 해석하기 위해 많은 연구가 진행되었다[2,3]. 이에 본 논문에서는 다양한 음성 데이터 증강을 통한 3D 특징 벡터를 이용한 영아의 울음소리 분류를 제안한다. 연구에서는 총 5개의 클래스 복통, 하품, 불편함, 배고픔, 피곤함(belly pain, burping, discomfort, hungry, tired)로 분류된 데이터 세트를 사용한다. 데이터들은 5가지 기법(Pitch, Tempo, Shift, Mixup-noise, CutMix)을 사용하여 증강한다. 증강 기법 중에서 Tempo, Shift, CutMix 기법을 적용하였을 때 성능의 향상을 보여주었다. 최종적으로 우수한 데이터 증강 기법들을 동시 적용한 결과 단일 특징 벡터와 오리지널 데이터를 사용한 모델보다 17.75%의 성능 향상을 도출하였다.

특징 추출과 분석 기법에 기반한 단백질 상호작용 데이터 신뢰도 향상 시스템 (Protein-Protein Interaction Reliability Enhancement System based on Feature Selection and Classification Technique)

  • 이민수;박승수;이상호;용환승;강성희
    • 정보처리학회논문지B
    • /
    • 제13B권7호
    • /
    • pp.679-688
    • /
    • 2006
  • 대용량 실험으로부터 산출된 단백질 상호작용 데이터는 위양성(false positive) 데이터의 비율이 높다는 단점을 가지고 있다. 본 논문에서는 오류가 섞여있는 단백질 상호작용 데이터를 입력으로 받아 각 단백질 상호작용의 신뢰도를 검증하는 시스템을 제안하고 구현하였다. 제안 시스템은 단백질 상호작용 데이터에 상호작용의 근거로서 사용될 수 있는 다양한 생물학적 특징들에 관한 데이터를 통합하고 특징 선택 방법을 사용하여 통합된 속성들 중 위양성 여부를 판별하는데 가장 적합한 특징들을 선택한 후 데이터 마이닝 분류 알고리즘을 적용하여 대용량 실험으로부터 산출된 단백질 상호작용 데이터의 신뢰도를 평가한다. 특징 선택의 결과와 분류 기법의 성능은 데이터 특성에 매우 의존하므로, 제안시스템에 가장 적합한 속성 부분집합과 가장 좋은 성능을 내는 분류 알고리즘을 찾기 위해 다양한 특징 선택 방법과 데이터 마이닝 분류 알고리즘들을 적용하고 그 성능을 다각적으로 비교분석 하였다. 실험 결과, 특징 선택 방법과 분류 알고리즘을 결합시킨 제안 시스템은 오류 데이터가 섞여있는 단백질 상호작용 데이터에서 실제로 상호작용하는 단백질 쌍을 골라내는 작업에 있어 기존 연구들에 비해 매우 뛰어난 성능을 보여줬다. 또한 본 연구를 통해 단백질 상호작용 데이터의 신뢰도를 검증함에 있어서 다양한 특징 선택 방법들과 분류 알고리즘들이 성능에 미치는 영향에 관해서도 정리할 수 있었다.

Deep Window Detection in Street Scenes

  • Ma, Wenguang;Ma, Wei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권2호
    • /
    • pp.855-870
    • /
    • 2020
  • Windows are key components of building facades. Detecting windows, crucial to 3D semantic reconstruction and scene parsing, is a challenging task in computer vision. Early methods try to solve window detection by using hand-crafted features and traditional classifiers. However, these methods are unable to handle the diversity of window instances in real scenes and suffer from heavy computational costs. Recently, convolutional neural networks based object detection algorithms attract much attention due to their good performances. Unfortunately, directly training them for challenging window detection cannot achieve satisfying results. In this paper, we propose an approach for window detection. It involves an improved Faster R-CNN architecture for window detection, featuring in a window region proposal network, an RoI feature fusion and a context enhancement module. Besides, a post optimization process is designed by the regular distribution of windows to refine detection results obtained by the improved deep architecture. Furthermore, we present a newly collected dataset which is the largest one for window detection in real street scenes to date. Experimental results on both existing datasets and the new dataset show that the proposed method has outstanding performance.

심초음파도내에서의 심장 판막 운동 추적을 위한 동영상 처리 기술에 대한 기초 연구 (I) (A study on the development of an image processing technique for tracing the movement of heart valves in echocardiograms (I))

  • 육인수;김재익;최홍호
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1997년도 춘계학술대회
    • /
    • pp.88-91
    • /
    • 1997
  • One of the most significant feature of diagnostic ultrasonic instrument is to display information on the soft tissues in the body in real time. In this paper we carried out basic study on the digital moving image processing for tracing the movement of heart valves in echocardiograms. Digital moving image file was made from analog echocardiograms and it was remade as 256 gray-level images on each frame. The ROI(Region of interest) was placed on a heart valve region to process images efficiently. Images were processed by the use of image enhancement filters and morphology filters. The result shows that the processed images were more enhanced than original images. When a moving image is reconstructed by using these enhanced images, we can trace the movement of heart valves more easily. In this study we proposed the availability of the moving image reconstruction using enhancement images.

  • PDF

Mitochondrial Myopathy 환자에서 과제지향적 상지운동과 탄성밴드를 이용한 기능적 근력증진 프로그램이 상지근력과 일상생활활동에 미치는 영향 -단일사례연구- (The Effect of Task-oriented Arm Movements and Muscle Enhancement Program Using Elastic Bands on Upper Limb Muscle Strength and Activities of Daily Living of Mitochondrial Myopathy Patient -Single subject design-)

  • 박형기;이강성
    • PNF and Movement
    • /
    • 제8권1호
    • /
    • pp.11-19
    • /
    • 2010
  • Purpose : The purpose of this study was to the effect of task-oriented arm movements and muscle enhancement program using elastic bands on limb muscle strength and activities of daily living of mitochondrial myopathy patient. Method : Single-subject experimental research design was applied to. AB Design was adopted. The study period was approximately four weeks. A baseline period of the three sessions of the experiment, the treatment period B, 3 sessions were conducted. Baseline period to observe the patient's daily life bardel index was measured as an independent feature, MMT as a limb muscle strength was assessed by measuring early. During the period of treatment with serabaendeu limb strength training 30 minutes after the break five minutes after the treatment using MMT limb muscle strength were evaluated. Task-oriented exercise program, and who exercise a week as a treatment was carried out in 30 minutes. Result : All of the scores for each sessional period of treatment when compared to base line and upper limb muscle strengthening exercises on the subjects that did not change significantly. Conclusion : If the muscles and nervous system involvement in patients with symptoms such as muscle weakness and paralysis of upper extremity functional use is difficult.

  • PDF

유성음/무성음 분리를 이용한 잡음처리 (Speech Enhancement Based on Voice/Unvoice Classification)

  • 유창동
    • 한국음향학회지
    • /
    • 제21권4호
    • /
    • pp.374-379
    • /
    • 2002
  • 본 논문에서는 유성음/무성음 분리를 이용하여 잡음처리를 한다. 유성음과 무성음은 음성의 하나의 중요한 특징으로 유성음과 무성음 부분에 각각 같은 잡음처리기법을 삼는 것이 아니라 각각의 성질을 고려하여 잡음처리를 하였다. 유성음/무성음의 분리는 영 교차율과 에너지를 이용하여 구해 졌으며, 유성음/무성음 분리정보를 토대로 하여 변형된 음성/잡음우세결정방법을 제안하였다. 제안된 방법은 백색 잡음과 비행기 잡음에 오염된 음성문장에 대해 성능평가가 이루어졌다. 그리고 다양한 입력 신호대잡음비 (SNR)로 오염된 문장에 대해 세그멘탈 신호대잡음비를 구하고, 듣기 평가를 통해 기존의 방법보다 향상된 성능을 가짐을 알 수 있다.

Fundamental Output Voltage Enhancement of Half-Bridge Voltage Source Inverter with Low DC-link Capacitance

  • Elserougi, Ahmed;Massoud, Ahmed;Ahmed, Shehab
    • Journal of Power Electronics
    • /
    • 제18권1호
    • /
    • pp.116-128
    • /
    • 2018
  • Conventionally, in order to reduce the ac components of the dc-link capacitors of the two-level Half-Bridge Voltage Source Inverter (HB-VSI), high dc-link capacitances are required. This necessitates the employment of short-lifetime and bulky electrolytic capacitors. In this paper, an analysis for the performance of low dc-link capacitances-based HB-VSI is presented to elucidate its ability to generate an enhanced fundamental output voltage magnitude without increasing the voltage rating of the involved switches. This feature is constrained by the load displacement factor. The introduced enhancement is due to the ac components of the capacitors' voltages. The presented approach can be employed for multi-phase systems through using multi single-phase HB-VSI(s). Mathematical analysis of the proposed approach is presented in this paper. To ensure a successful operation of the proposed approach, a closed loop current controller is examined. An expression for the critical dc-link capacitance, which is the lowest dc-link capacitance that can be employed for unipolar capacitors' voltages, is derived. Finally, simulation and experimental results are presented to validate the proposed claims.