• Title/Summary/Keyword: Data Normalization

검색결과 481건 처리시간 0.033초

Building Hybrid Stop-Words Technique with Normalization for Pre-Processing Arabic Text

  • Atwan, Jaffar
    • International Journal of Computer Science & Network Security
    • /
    • 제22권7호
    • /
    • pp.65-74
    • /
    • 2022
  • In natural language processing, commonly used words such as prepositions are referred to as stop-words; they have no inherent meaning and are therefore ignored in indexing and retrieval tasks. The removal of stop-words from Arabic text has a significant impact in terms of reducing the size of a cor- pus text, which leads to an improvement in the effectiveness and performance of Arabic-language processing systems. This study investigated the effectiveness of applying a stop-word lists elimination with normalization as a preprocessing step. The idea was to merge statistical method with the linguistic method to attain the best efficacy, and comparing the effects of this two-pronged approach in reducing corpus size for Ara- bic natural language processing systems. Three stop-word lists were considered: an Arabic Text Lookup Stop-list, Frequency- based Stop-list using Zipf's law, and Combined Stop-list. An experiment was conducted using a selected file from the Arabic Newswire data set. In the experiment, the size of the cor- pus was compared after removing the words contained in each list. The results showed that the best reduction in size was achieved by using the Combined Stop-list with normalization, with a word count reduction of 452930 and a compression rate of 30%.

조명변화에 강인한 눈 검출을 위한 조명 정규화 방법 (Illumination Normalization Method for Robust Eye Detection in Lighting Changing Environment)

  • 허성철;이흐테샴울이슬람;김인택
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2008년도 하계종합학술대회
    • /
    • pp.955-956
    • /
    • 2008
  • This paper presents a new method for illumination normalization in eye detection. Based on the retinex image formation model, we employ the discrete wavelet transform to remove the lighting effect in face image data. The final result based on the proposed method shows the better performance in detecting eyes compared with previous work.

  • PDF

On-Line Blind Channel Normalization for Noise-Robust Speech Recognition

  • Jung, Ho-Young
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제1권3호
    • /
    • pp.143-151
    • /
    • 2012
  • A new data-driven method for the design of a blind modulation frequency filter that suppresses the slow-varying noise components is proposed. The proposed method is based on the temporal local decorrelation of the feature vector sequence, and is done on an utterance-by-utterance basis. Although the conventional modulation frequency filtering approaches the same form regardless of the task and environment conditions, the proposed method can provide an adaptive modulation frequency filter that outperforms conventional methods for each utterance. In addition, the method ultimately performs channel normalization in a feature domain with applications to log-spectral parameters. The performance was evaluated by speaker-independent isolated-word recognition experiments under additive noise environments. The proposed method achieved outstanding improvement for speech recognition in environments with significant noise and was also effective in a range of feature representations.

  • PDF

Adaptive Channel Normalization Based on Infomax Algorithm for Robust Speech Recognition

  • Jung, Ho-Young
    • ETRI Journal
    • /
    • 제29권3호
    • /
    • pp.300-304
    • /
    • 2007
  • This paper proposes a new data-driven method for high-pass approaches, which suppresses slow-varying noise components. Conventional high-pass approaches are based on the idea of decorrelating the feature vector sequence, and are trying for adaptability to various conditions. The proposed method is based on temporal local decorrelation using the information-maximization theory for each utterance. This is performed on an utterance-by-utterance basis, which provides an adaptive channel normalization filter for each condition. The performance of the proposed method is evaluated by isolated-word recognition experiments with channel distortion. Experimental results show that the proposed method yields outstanding improvement for channel-distorted speech recognition.

  • PDF

독립성분해석과 정규화를 이용한 영상분류 방법 (Image Classification Method using Independent Component Analysis and Normalization)

  • 홍준식;유정웅
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제28권9호
    • /
    • pp.629-633
    • /
    • 2001
  • 본 논문에서는 독립 성분 해석(Independent Component Analysis, ICA) 기법과 정규화를 이용한 영상분류 방법을 제안한다. 이 제안된 방법은 전처리 없이 ICA나 주성분 해석(Principal Component Analysis, PCA)을 이용한 것에 비해 잡음에 대한 강인성을 증가시킨다. 영상에 잡음이 인가된 경우, CPA는 N(0, 0.4), ICA는 N(0.53)까지이 분류가 가능함을 보이는 반면에 비해, 제안된 정규화 전처리는 N(0, 0.75)까지 영상분류가 됨을 실험에서 보이고 있다.

  • PDF

내부 최적화를 이용한 화학 센서의 단기 드리프트 분석 및 보정 (Short Term Sensor's Drift Analysis and Compensation Using Internal Normalization)

  • 전진영;백종현;변형기
    • 센서학회지
    • /
    • 제24권4호
    • /
    • pp.270-273
    • /
    • 2015
  • One of the main problems when working the chemical sensor is the lack of repeatability and reproducibility of the sensor response. If the problem is not properly taken into consideration, the stability and reliability of the system using chemical sensors would be decreased. In this paper we analyzed the sensor's drift of short term and proposed a compensation method for reducing the effects of the drift in order to improve the stability and the reliability of the chemical sensor. The sensor drift was analyzed by a trend line graph and CV(coefficient of variation) was used to quantify. And we compensated for the drift by using the internal normalization. As a result it was found that the value of CV was decreased after compensation.

금강하구둑 배수갑문 조작에 의한 상류수역의 수위변동 (Variation of Water Level on the Upstream Gauging Station by Operation of the Drainage Sluice Gate of Geumgang Estuary Dam)

  • 박승기
    • 한국농공학회논문집
    • /
    • 제47권6호
    • /
    • pp.15-24
    • /
    • 2005
  • The normalization on the characteristics of water level change at the upstream gauging station was attempted according to the operation of drainage sluice gate of the Geumgang estuary dam. The characteristics were normalized by the analysis of water level change and by the linear-regression of the water level data measured at the inner station of Geumgang estuary dam and upstream gauging station. The results of normalization may be referred to the management of Geumgang estuary lake, the operation of pumping and drainage stations in the shore of the lake. The mean response time of water level change on Ibpo, Ganggyeong and Gyuam water level station were 39,81 and 160 minutes, when sluice gate was opened respectively. The mean velocity of surface wave, the mean displacement of water level change, the mean time of water level change and the mean rate of water level change varied largely depending on the location of gauging station and the characteristics of stream section of the water level gauging station.

CNN기초로 세 가지 방법을 이용한 감정 표정 비교분석 (Comparative Analysis for Emotion Expression Using Three Methods Based by CNN)

  • 양창희;박규섭;김영섭;이용환
    • 반도체디스플레이기술학회지
    • /
    • 제19권4호
    • /
    • pp.65-70
    • /
    • 2020
  • CNN's technologies that represent emotional detection include primitive CNN algorithms, deployment normalization, and drop-off. We present the methods and data of the three experiments in this paper. The training database and the test database are set up differently. The first experiment is to extract emotions using Batch Normalization, which complemented the shortcomings of distribution. The second experiment is to extract emotions using Dropout, which is used for rapid computation. The third experiment uses CNN using convolution and maxpooling. All three results show a low detection rate, To supplement these problems, We will develop a deep learning algorithm using feature extraction method specialized in image processing field.

Skeleton 정보와 LSTM을 이용한 작업자 동작인식 (Motion Recognition of Workers using Skeleton and LSTM)

  • 전왕수;이상용
    • 한국멀티미디어학회논문지
    • /
    • 제25권4호
    • /
    • pp.575-582
    • /
    • 2022
  • In the manufacturing environment, research to minimize robot collisions with human beings have been widespread, but in order to interact with robots, it is important to precisely recognize and predict human actions. In this research, after enhancing performance by applying group normalization to the Hourglass model to detect the operator motion, the skeleton was estimated and data were created using this model. And then, three types of operator's movements were recognized using LSTM. As results of the experiment, the accuracy was enhanced by 1% using group normalization, and the recognition accuracy was 99.6%.

RNAseq 빅데이터에서 유전자 선택을 위한 밀집도-의존 정규화 기반의 서포트-벡터 머신 병합법 (Combining Support Vector Machine Recursive Feature Elimination and Intensity-dependent Normalization for Gene Selection in RNAseq)

  • 김차영
    • 인터넷정보학회논문지
    • /
    • 제18권5호
    • /
    • pp.47-53
    • /
    • 2017
  • 고처리 시퀀싱과 빅데이터 및 크라우드 컴퓨팅에 혁신이 일어나면서, RNA 시퀀싱도 획기적인 변화가 일어, RNAseq가 기존의 DNA 마이크로어레이를 대체하여, 빅-데이터를 형성하고 있다. 현재, RANseq 이용한 유전자 조절망(GRN) 까지 연구가 활성화 되고 있는데, 그 중 한 분야가 GRN의 기본 요소인 특징 유전자를 빅-데이터에서도 구별하고 기존에 알려진 것 외에 새로운 역할을 찾는 것이다. 그러나, 이러한 연구 방향에 부합하는 빅-데이터를 처리할 수 있는 컴퓨테이션 방법이 아직까지 매우 부족하다. 따라서 본 논문에서는 RNAseq 빅-데이터를 처리할 수 있도록 기존의 SVM-RFE알고리즘을 밀집도-의존 정규화에 병합하여, NCBI-GEO와 같은 빅-데이터에서 공개된 일부의 데이터에 개선된 알고리즘을 적용하고 해당 알고리즘에 의해 나온 결과의 성능을 평가한다.