• 제목/요약/키워드: voice search

검색결과 90건 처리시간 0.03초

HSI 컬러 공간과 신경망을 이용한 내용 기반 이미지 검색 (Content-based Image Retrieval Using HSI Color Space and Neural Networks)

  • 김광백;우영운
    • 한국전자통신학회논문지
    • /
    • 제5권2호
    • /
    • pp.152-157
    • /
    • 2010
  • 컴퓨터와 인터넷의 발달로 정보의 형태가 다양화 되어 문서 위주의 자료들로부터 이미지, 오디오, 비디오, 음성 등의 모습으로 혼합되어 가고 있다. 하지만 대부분의 검색은 문서 위주로 하기 때문에 이미지, 오디오, 비디오 등은 파일의 이름이 명확하게 설정되어 있지 않을 경우에는 검색을 할 수 없다. 이러한 문제점을 해결하기 위해 문서가 아닌 내용을 기반으로 검색하는 방법을 내용 기반 검색이라고 한다. 그리고 이미지의 내용을 기반으로 검색하는 방법을 내용 기반 이미지 검색이라고 한다. 본 논문에서는 HSI 컬러 공간, ART2 알고리즘, SOM 알고리즘을 이용한 내용 기반 이미지 검색 방법을 제안한다. 제안하는 방법은 학습 대상을 선정하기 위해 원 영상의 특징을 분할한다. 그리고 사용자가 학습 대상을 선정하도록 하기 위해 분할된 특징을 SOM 알고리즘에 적용하여 비슷한 특징을 가지는 영상들로 군집화 한다. 군집화된 영상들에 대해 사용자가 학습 대상을 선정하여 ART2 알고리즘에 적용하여 학습한다. 제안한 방법을 적용하여 이미지 검색을 실험한 결과 제안된 방법은 하나의 이미지가 여러 개의 키워드를 가질 수 있기 때문에 이미지에 포함된 정보를 효과적으로 검색하는 것을 확인하였다.

Speaker Detection and Recognition for a Welfare Robot

  • Sugisaka, Masanori;Fan, Xinjian
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2003년도 ICCAS
    • /
    • pp.835-838
    • /
    • 2003
  • Computer vision and natural-language dialogue play an important role in friendly human-machine interfaces for service robots. In this paper we describe an integrated face detection and face recognition system for a welfare robot, which has also been combined with the robot's speech interface. Our approach to face detection is to combine neural network (NN) and genetic algorithm (GA): ANN serves as a face filter while GA is used to search the image efficiently. When the face is detected, embedded Hidden Markov Model (EMM) is used to determine its identity. A real-time system has been created by combining the face detection and recognition techniques. When motivated by the speaker's voice commands, it takes an image from the camera, finds the face inside the image and recognizes it. Experiments on an indoor environment with complex backgrounds showed that a recognition rate of more than 88% can be achieved.

  • PDF

잡음 환경에서의 전송율 감소를 위한 G.723.1 VAD 성능개선에 관한 연구 (The Research of Reducing the Fixed Codebook Search Time of G.723.1 MP-MLQ)

  • 김정진;박영호;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 하계종합학술대회 논문집(4)
    • /
    • pp.98-101
    • /
    • 2000
  • On CELP type Vocoders G.723.1 6.3kbps/5.3kbps Dual Rate Speech Codec, which is developed for Internet Phone and videoconferencing, uses VAD(Voice Activity Detection)/CNG (Comfort Noise Generator) in order to reduce the bit rate in a silence period. In order to reduce the bit rate effectively in this paper, we first set the boundary condition of the energy threshold to prevent the consumption of unnecessary processing time, and use three decision rules to detect an active frame by energy, pitch gain and LSP distance. To evaluate the performance of the proposed algorithm we use silence-inserted speech data with 0, 5, 10, 20dB of SNR. As a result when SNR is over 5dB, the bit rate is reduced up to about 40% without speech degradation and the processing time is additionally decreased.

  • PDF

연안 어선에서 어선원 인명피해 최소화를 위한 통신 체계 구축 (Telecommunication System Construction to minimize the Casualty of Fisher in the coastal Fishing Boat)

  • 김석재;김욱성;이유원
    • 수산해양교육연구
    • /
    • 제25권3호
    • /
    • pp.580-586
    • /
    • 2013
  • For telecommunication system construction to minimize the casualty of fisher, we investigated the usability of TRS communication system and performance of GPS automatic position transmitter (APT) which can be utilized for the survival, search and rescue of the victims. The trial experiments were conducted at sea with TRS and CDMA in the East, West and South Sea of Korea from October to December. As a result, the usability of the TRS as an emergency communication system device was verified since it provided stable position and voice information to the boundary of 50km far from the coast. Therefore the system is expected to contribute to minimization of victims.

BcN 적합형 액세스네트워크 구조 (The Access Network Architecture for BcN Adapted)

  • 이상문
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 한국정보통신설비학회 2007년도 학술대회
    • /
    • pp.121-124
    • /
    • 2007
  • This article describes a function and structure of access network equipment under BcN environment. Access network until now have constructed separately to offer voice, data service. However, simplifies network structure, function that can do traffic concentration, subscriber certification, individual charging, QoS according to service and routing is required in BcN. In this paper, compare method offering by separate system with existing access network and method that offer integrating function inside system for structure of suitable access network to BcN and search structure of access network equipment for desirable access network of hereafter. Composition of this paper is as following. In Chapter 2, establishment history and structure of access network until present. In Chaprte 3, define suitable requirement and functions to BcN. And compare structure for access net work that is new with present. Last Chapter 4, suggests direction of structure of BcN access network and concludes conclusion.

  • PDF

통신시장 유무선 서비스 도입과 주요국의 규제현황

  • 송영화
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회 2001년도 추계학술대회 논문집
    • /
    • pp.358-361
    • /
    • 2001
  • Fixed-mobile convergence services can be defined as the combination of previously separate fixed and mobile services, and networks and commercial practices. Examples of fixed-mobile convergence services include single voicemail box, single number and unified messaging across fixed and mobile networks. Recently as more voice is transferred to mobile networks, convergence services between fixed and mobile become more important. In Korea convergence services are only starting to become established, and are likely to become an important part of any operator's offering. In this paper, I search the different levels of fixed-mobile convergence services and the trends and regulations for fixed-mobile convergence services in major countries. And at the same time I also try to find a new direction of future regulatory principles related to fixed-mobile convergence services.

  • PDF

시각장애자의 보행지원에 관한 연구 (A study on walking aids for the blind)

  • 함광근;한상휘;양승열;김현규;허웅
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1997년도 춘계학술대회
    • /
    • pp.131-135
    • /
    • 1997
  • We implementated an ultrasonic wave cane for the blind. The cane detect walking obstacle and provide a walking direction. The cane used time of flight method of ultrasonic-wave for a measurement of obstacle distance and fluxgate geomagnetic sensor for guidance of walking direction. This system can detect an obstacle of upward, forward, downward and that warn to the blind with vibration, pitch sound. And the blind can know walking direction to voice output. As a result, the blind could efficiently avoid a exposed obstacle, obstacles beyond knee, an exposed street obstacle, a branch of tree person's height and it is usable search for surrounding land mark.

  • PDF

저전송률 음성부호화기의 DUAL-TONE MULTIFREQUENCY(DTMF) SIGNALLING (Detection of DTMF Signalling for Low Bit Rate Vocoder)

  • 손상목
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
    • /
    • pp.159-164
    • /
    • 1998
  • We proposes a new detecting algorithm of DTMF tone for low bit ate vocoder so that we use DTMF tones for signalling inthe digital network. Using DTMF tones for signalling, we could not change the conventional IS-95 protocol and control the mobile phone. We apply the root finding to detection of formants and bandwidth to search whether DTMF tones or voice and moreover to find what's kinds of DTMF tones, for instance 1, 2, 3, ......., #, *, A, B, ...., etc. Consequently, proposed method has a good result which is 0.000944% average error rate. It is satisfied with rcommended error rate in ITU-T($\pm$1.8%).

  • PDF

G.723.1 보코더에서 주파수 간격 정보조절을 통한 계산량 감소에 관한 연구 (A Study on Reduction of Computation Time through Adjustment the Frequency Interval Information in the G.723.1 Vocoder)

  • 민소연;김영규;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.405-408
    • /
    • 2002
  • LSP(Line Spectrum Pairs) Parameter is used for speech analysis in vocoders or recognizers since it has advantages of constant spectrum sensitivity. low spectrum distortion and easy linear interpolation. However the method of transforming LPC(Linear Predictive Coding) into LSP is so complex that it takes much time to compute. Among conventional methods, the real root method is considerably simpler than others, but nevertheless, it still suffers from its jndeterministic computation time because the root searching is processed sequentially in frequency region. We suggest a method of reducing the LSP transformation time using voice characteristics The proposed method is to apply search order and interval differently according to the distribution of LSP parameters. in comparison with the conventional real root method, the proposed method results in about 46.5% reduction. And, the total computation time is reduce to about 5% in the G.723.1 vocoder.

  • PDF

컴퓨터 알고리즘 교육을 위한 온라인 알고리즘 뱅크 구현 (Design and Implementation of Online Algorithm Bank for Algorithm E-learning)

  • 박우창
    • 컴퓨터교육학회논문지
    • /
    • 제7권4호
    • /
    • pp.1-6
    • /
    • 2004
  • 온라인상에서 교육 내용의 전달은 많은 방법들이 개발되어 있지만 컴퓨터 언어 및 알고리즘의 e-learning과 실습은 웹상에서 프로그램 실습의 어려움으로 인하여 이론과 실습이 병행되지 못하여 왔다. 본 논문에서는 알고리즘을 검색하고 관리할 수 있는 뱅크를 구축하고 실행 인터페이스를 만들어, 학생들이 직접 웹상에서 각각의 프로그램들을 실행시킬 뿐 아니라 프로그램을 수정하여 실행할 수 있도록 하였다. 웹상에서 실습을 통한 알고리즘 뱅크 시스템은 실습 환경 구축과 적응에 대한 어려움을 없앰으로써 컴퓨터 알고리즘 학습에 대한 거리감을 없애는 효과가 있다.

  • PDF