• Title/Summary/Keyword: Computer science and engineering

Search Result 16,057, Processing Time 0.048 seconds

α-feature map scaling for raw waveform speaker verification (α-특징 지도 스케일링을 이용한 원시파형 화자 인증)

  • Jung, Jee-weon;Shim, Hye-jin;Kim, Ju-ho;Yu, Ha-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.5
    • /
    • pp.441-446
    • /
    • 2020
  • In this paper, we propose the α-Feature Map Scaling (α-FMS) method which extends the FMS method that was designed to enhance the discriminative power of feature maps of deep neural networks in Speaker Verification (SV) systems. The FMS derives a scale vector from a feature map and then adds or multiplies them to the features, or sequentially apply both operations. However, the FMS method not only uses an identical scale vector for both addition and multiplication, but also has a limitation that it can only add a value between zero and one in case of addition. In this study, to overcome these limitations, we propose α-FMS to add a trainable parameter α to the feature map element-wise, and then multiply a scale vector. We compare the performance of the two methods: the one where α is a scalar, and the other where it is a vector. Both α-FMS methods are applied after each residual block of the deep neural network. The proposed system using the α-FMS methods are trained using the RawNet2 and tested using the VoxCeleb1 evaluation set. The result demonstrates an equal error rate of 2.47 % and 2.31 % for the two α-FMS methods respectively.

A Resource Reservation Protocol for Mobile Hosts in Wireless Mobile Networks (무선 이동망에서의 이동 호스트를 지원하기 위한 자원 예약 프로토콜)

  • Kim, Min-Sun;Suh, Young-Joo;An, Syung-Og
    • Journal of KIISE:Information Networking
    • /
    • v.29 no.4
    • /
    • pp.428-436
    • /
    • 2002
  • Providing a mobile host with its required QoS is highly influenced by its mobility. The resource ReSerVation Protocol(RSVP) establishes and maintains a reservation state to ensure a given QoS level along the path from the sender to the receiver. However, RSVP is designed for use in fixed networks and thus it is inadequate in the mobile networking environment where a host changes its point of attachment. In this paper, we propose a new resource reservation protocol, RSVP-RA(RSVP by RSVP Agent) for mobile hosts. Our protocol assumes IETF Mobile IP as a mobility support mechanism. The proposed protocol introduce a new protocol entity - RSVP agent - to manage reservations in a mobile host's current visiting network. RSVP Agent is located in a local network and makes resource reservations in neighboring cells that the mobile host is expected to visit in the future. Thus, the proposed Protocol can provide a seamless QoS to the mobile host and significantly improve the scalability problem of RSVP by reducing the end-to-end signalling messages acrossing the backbone networks. The proposed protocols reduce packet delay, bandwidth overhead and the number of RSVP messages to maintain reservation states. We compared the performance of our proposed protocol with other proposed protocols in terms of signalling overhead, packet delay by simulation.

A study on combination of loss functions for effective mask-based speech enhancement in noisy environments (잡음 환경에 효과적인 마스크 기반 음성 향상을 위한 손실함수 조합에 관한 연구)

  • Jung, Jaehee;Kim, Wooil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.3
    • /
    • pp.234-240
    • /
    • 2021
  • In this paper, the mask-based speech enhancement is improved for effective speech recognition in noise environments. In the mask-based speech enhancement, enhanced spectrum is obtained by multiplying the noisy speech spectrum by the mask. The VoiceFilter (VF) model is used as the mask estimation, and the Spectrogram Inpainting (SI) technique is used to remove residual noise of enhanced spectrum. In this paper, we propose a combined loss to further improve speech enhancement. In order to effectively remove the residual noise in the speech, the positive part of the Triplet loss is used with the component loss. For the experiment TIMIT database is re-constructed using NOISEX92 noise and background music samples with various Signal to Noise Ratio (SNR) conditions. Source to Distortion Ratio (SDR), Perceptual Evaluation of Speech Quality (PESQ), and Short-Time Objective Intelligibility (STOI) are used as the metrics of performance evaluation. When the VF was trained with the mean squared error and the SI model was trained with the combined loss, SDR, PESQ, and STOI were improved by 0.5, 0.06, and 0.002 respectively compared to the system trained only with the mean squared error.

Reversible data hiding technique applying triple encryption method (삼중 암호화 기법을 적용한 가역 데이터 은닉기법)

  • Jung, Soo-Mok
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.15 no.1
    • /
    • pp.36-44
    • /
    • 2022
  • Reversible data hiding techniques have been developed to hide confidential data in the image by shifting the histogram of the image. These techniques have a weakness in which the security of hidden confidential data is weak. In this paper, to solve this drawback, we propose a technique of triple encrypting confidential data using pixel value information and hiding it in the cover image. When confidential data is triple encrypted using the proposed technique and hidden in the cover image to generate a stego-image, since encryption based on pixel information is performed three times, the security of confidential data hidden by triple encryption is greatly improved. In the experiment to measure the performance of the proposed technique, even if the triple-encrypted confidential data was extracted from the stego-image, the original confidential data could not be extracted without the encryption keys. And since the image quality of the stego-image is 48.39dB or higher, it was not possible to recognize whether confidential data was hidden in the stego-image, and more than 30,487 bits of confidential data were hidden in the stego-image. The proposed technique can extract the original confidential data from the triple-encrypted confidential data hidden in the stego-image without loss, and can restore the original cover image from the stego-image without distortion. Therefore, the proposed technique can be effectively used in applications such as military, medical, digital library, where security is important and it is necessary to completely restore the original cover image.

Body Composition Factor Comparisons of the Intracellular Fluid(ICW), Extracellular Fluid(ECW) and Cell Membrane at Acupuncture Points and Non-Acupuncture Points by Inducing Multiple Ionic Changes (생체이온 변화 유발 후 경혈과 비경혈에서의 생체 구조 성분 분석 및 비교를 통한 경혈 특이성 고찰)

  • Kim, Soo-Byeong;Chung, Kyung-Yul;Jeon, Mi-Seon;Shin, Tae-Min;Lee, Yong-Heum
    • Korean Journal of Acupuncture
    • /
    • v.31 no.2
    • /
    • pp.66-78
    • /
    • 2014
  • Objectives : The specificity of acupuncture point has been a highly controversial subject. Existing researches said that ion-distribution differences are observed on the acupuncture point. This study was conducted under the assumption that multiple ionic changes induced by muscle fatigue would be different between the acupuncture point with non-acupuncture point. Methods : To induce the identical fatigue, twenty subjects performed the knee extension/flexion exercise using the Biodex System 3. ST32 and ST33 as well as adjacent non-acupuncture points were selected. We measured blood lactate and analyzed the median frequency(MF) and peak torque. To obtain the information on the extracellular fluid(ECW), intracellular fluid(ICW) and cell membrane indirectly, we used the multi-frequency bioelectrical impedance analysis(MF-BIA) method. Results : MF, peak torque and blood lactate level of all measurement sites were gradually returned to normal. Re resistance of ST32 had a stronger response, but a non-acupuncture point adjacent to ST33 had a larger response up to 20 minutes post exercise. Ri resistances were similar for both acupoints and non-acupoints. The $C_m$ capacitance of ST32 had a stronger response after inducing fatigue, but ST33 had a smaller response than a non-acupuncture point adjacent to it. Conclusions : In comparison with before and after inducing fatigue, the specificity of acupuncture points was not clearly observed. Hence, we concluded that the body composition factors extraction method had the limitation as a method of finding the specificity of acupuncture points by inducing fatigue.

Feedback Phenomenon in Technology Art (예술 공학의 피드백)

  • Kim Hyung-Gi
    • Science of Emotion and Sensibility
    • /
    • v.8 no.4
    • /
    • pp.423-433
    • /
    • 2005
  • The computer hardware development has provided many chances of emergence between art and technology. In many cases today's interactive artworks cannot be completed without audience's participation. The interactive production process with technical supplementation can be celled feedback. Mr. Nam Jun Paik showed 'Participant TV' that interacts with audience's response in real time. It means artwork changes with the constantly changing value from the data set from human visual perception. Dan Graham showed another feedback related work, which delays 5second playback in mirror that implies consequence of time. Today's media art has to sublimate coincidence, time ant audience into philosophical artwork through consonance that comes with video and sound as we can see from Bill Viola. Stelarc produced artworks. That use input data that is weak signals from brain, muscles. Through a terminal display with player, body expanded meaning of media. Jeffrey Shaw's 'Legible City' provided a fabrication of the reality with the interaction of bicycle's pedal speed and steering direction that is controlled by 4river. RE:MARK used microphone as input device as Edmond Couchot's 'Je same a la vent' and Nam Jun Paik's 'Participant TV' did. There is no communication without feedback between human being. The reality makes audience involved into artworks. That is the reason why feedback has to be natural. Through the feedback process, the originality of the idea is altered by audience. The feedback is not just part of flesh of artwork rather skeleton of it. Technological showoff cannot be art itself The perfection of technological application plan helps feedback that interacts with audience naturally in order that audience hoes not feel the feedback as artificial plan. Interactive media art has to be evolved into new media form with new integration feedback technology.

  • PDF

I-vector similarity based speech segmentation for interested speaker to speaker diarization system (화자 구분 시스템의 관심 화자 추출을 위한 i-vector 유사도 기반의 음성 분할 기법)

  • Bae, Ara;Yoon, Ki-mu;Jung, Jaehee;Chung, Bokyung;Kim, Wooil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.5
    • /
    • pp.461-467
    • /
    • 2020
  • In noisy and multi-speaker environments, the performance of speech recognition is unavoidably lower than in a clean environment. To improve speech recognition, in this paper, the signal of the speaker of interest is extracted from the mixed speech signals with multiple speakers. The VoiceFilter model is used to effectively separate overlapped speech signals. In this work, clustering by Probabilistic Linear Discriminant Analysis (PLDA) similarity score was employed to detect the speech signal of the interested speaker, which is used as the reference speaker to VoiceFilter-based separation. Therefore, by utilizing the speaker feature extracted from the detected speech by the proposed clustering method, this paper propose a speaker diarization system using only the mixed speech without an explicit reference speaker signal. We use phone-dataset consisting of two speakers to evaluate the performance of the speaker diarization system. Source to Distortion Ratio (SDR) of the operator (Rx) speech and customer speech (Tx) are 5.22 dB and -5.22 dB respectively before separation, and the results of the proposed separation system show 11.26 dB and 8.53 dB respectively.

Scheduling Method of Real-Time Mobile Transaction Manager considering Value of Transactions and Validity of Real-Time Data (트랜잭션의 중요도와 데이터의 유효성을 고려한 실시간 이동 트랜잭션 관리자의 스케줄링 기법)

  • Jo, Suk-Gyeong;Kim, Gyeong-Bae;Lee, Sun-Jo;Bae, Hae-Yeong
    • The KIPS Transactions:PartD
    • /
    • v.8D no.5
    • /
    • pp.533-542
    • /
    • 2001
  • In this paper, we present a scheduling method for real-time mobile transaction manager in mobile computing environment. The proposed method checks whether a transaction is executable or not. It is able to by considering not only the deadline of real-time data in mobile hosts. And then, it schedules the real-time mobile transactions by making optimal execution window based on the priority queue, while considering transaction value and deadline. Disconnection with mobile hosts is monitored in selecting the transaction for execution. Using the proposed method reduces the number of restarting times after transaction aborts which is caused by the violation of the validity constraints of real-time data. And in has merits of maximizing the sum of values of real-time mobile transactions which meet the deadline. The performance evaluation demonstrates that the number of committed real-time transactions within the deadline is improved by 20%. This method can be used in real-time mobile transaction manager is such environments as cellular communications, emergency medicine information system and so on.

  • PDF

A Big Data Based Random Motif Frequency Method for Analyzing Human Proteins (인간 단백질 분석을 위한 빅 데이타 기반 RMF 방법)

  • Kim, Eun-Mi;Jeong, Jong-Cheol;Lee, Bae-Ho
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.6
    • /
    • pp.1397-1404
    • /
    • 2018
  • Due to the technical difficulties and high cost for obtaining 3-dimensional structure data, sequence-based approaches in proteins have not been widely acknowledged. A motif can be defined as any segments in protein or gene sequences. With this simplicity, motifs have been actively and widely used in various areas. However, the motif itself has not been studied comprehensively. The value of this study can be categorized in three fields in order to analyze the human proteins using artificial intelligence method: (1) Based on our best knowledge, this research is the first comprehensive motif analysis by analyzing motifs with all human proteins in Protein Data Bank (PDB) associated with the database of Enzyme Commission (EC) number and Structural Classification of Proteins (SCOP). (2) We deeply analyze the motif in three different categories: pattern, statistical, and functional analysis of clusters. (3) At the last and most importantly, we proposed random motif frequency(RMF) matric that can efficiently distinct the characteristics of proteins by identifying interface residues from non-interface residues and clustering protein functions based on big data while varying the size of random motif.

Ultrasound-optical imaging-based multimodal imaging technology for biomedical applications (바이오 응용을 위한 초음파 및 광학 기반 다중 모달 영상 기술)

  • Moon Hwan Lee;HeeYeon Park;Kyungsu Lee;Sewoong Kim;Jihun Kim;Jae Youn Hwang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.5
    • /
    • pp.429-440
    • /
    • 2023
  • This study explores recent research trends and potential applications of ultrasound optical imaging-based multimodal technology. Ultrasound imaging has been widely utilized in medical diagnostics due to its real-time capability and relative safety. However, the drawback of low resolution in ultrasound imaging has prompted active research on multimodal imaging techniques that combine ultrasound with other imaging modalities to enhance diagnostic accuracy. In particular, ultrasound optical imaging-based multimodal technology enables the utilization of each modality's advantages while compensating for their limitations, offering a means to improve the accuracy of the diagnosis. Various forms of multimodal imaging techniques have been proposed, including the fusion of optical coherence tomography, photoacoustic, fluorescence, fluorescence lifetime, and spectral technology with ultrasound. This study investigates recent research trends in ultrasound optical imaging-based multimodal technology, and its potential applications are demonstrated in the biomedical field. The ultrasound optical imaging-based multimodal technology provides insights into the progress of integrating ultrasound and optical technologies, laying the foundation for novel approaches to enhance diagnostic accuracy in the biomedical domain.