• Title/Summary/Keyword: Voice Search

Search Result 90, Processing Time 0.025 seconds

Content-based Image Retrieval Using HSI Color Space and Neural Networks (HSI 컬러 공간과 신경망을 이용한 내용 기반 이미지 검색)

  • Kim, Kwang-Baek;Woo, Young-Woon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.5 no.2
    • /
    • pp.152-157
    • /
    • 2010
  • The development of computer and internet has introduced various types of media - such as, image, audio, video, and voice - to the traditional text-based information. However, most of the information retrieval systems are based only on text, which results in the absence of ability to use available information. By utilizing the available media, one can improve the performance of search system, which is commonly called content-based retrieval and content-based image retrieval system specifically tries to incorporate the analysis of images into search systems. In this paper, a content-based image retrieval system using HSI color space, ART2 algorithm, and SOM algorithm is introduced. First, images are analyzed in the HSI color space to generate several sets of features describing the images and an SOM algorithm is used to provide candidates of training features to a user. The features that are selected by a user are fed to the training part of a search system, which uses an ART2 algorithm. The proposed system can handle the case in which an image belongs to several groups and showed better performance than other systems.

Speaker Detection and Recognition for a Welfare Robot

  • Sugisaka, Masanori;Fan, Xinjian
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.835-838
    • /
    • 2003
  • Computer vision and natural-language dialogue play an important role in friendly human-machine interfaces for service robots. In this paper we describe an integrated face detection and face recognition system for a welfare robot, which has also been combined with the robot's speech interface. Our approach to face detection is to combine neural network (NN) and genetic algorithm (GA): ANN serves as a face filter while GA is used to search the image efficiently. When the face is detected, embedded Hidden Markov Model (EMM) is used to determine its identity. A real-time system has been created by combining the face detection and recognition techniques. When motivated by the speaker's voice commands, it takes an image from the camera, finds the face inside the image and recognizes it. Experiments on an indoor environment with complex backgrounds showed that a recognition rate of more than 88% can be achieved.

  • PDF

The Research of Reducing the Fixed Codebook Search Time of G.723.1 MP-MLQ (잡음 환경에서의 전송율 감소를 위한 G.723.1 VAD 성능개선에 관한 연구)

  • 김정진;박영호;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2000.06d
    • /
    • pp.98-101
    • /
    • 2000
  • On CELP type Vocoders G.723.1 6.3kbps/5.3kbps Dual Rate Speech Codec, which is developed for Internet Phone and videoconferencing, uses VAD(Voice Activity Detection)/CNG (Comfort Noise Generator) in order to reduce the bit rate in a silence period. In order to reduce the bit rate effectively in this paper, we first set the boundary condition of the energy threshold to prevent the consumption of unnecessary processing time, and use three decision rules to detect an active frame by energy, pitch gain and LSP distance. To evaluate the performance of the proposed algorithm we use silence-inserted speech data with 0, 5, 10, 20dB of SNR. As a result when SNR is over 5dB, the bit rate is reduced up to about 40% without speech degradation and the processing time is additionally decreased.

  • PDF

Telecommunication System Construction to minimize the Casualty of Fisher in the coastal Fishing Boat (연안 어선에서 어선원 인명피해 최소화를 위한 통신 체계 구축)

  • Kim, Seok-Jae;Kim, Wook-Sung;Lee, Yoo-Won
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.25 no.3
    • /
    • pp.580-586
    • /
    • 2013
  • For telecommunication system construction to minimize the casualty of fisher, we investigated the usability of TRS communication system and performance of GPS automatic position transmitter (APT) which can be utilized for the survival, search and rescue of the victims. The trial experiments were conducted at sea with TRS and CDMA in the East, West and South Sea of Korea from October to December. As a result, the usability of the TRS as an emergency communication system device was verified since it provided stable position and voice information to the boundary of 50km far from the coast. Therefore the system is expected to contribute to minimization of victims.

The Access Network Architecture for BcN Adapted (BcN 적합형 액세스네트워크 구조)

  • Lee, Sang-Moon
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2007.08a
    • /
    • pp.121-124
    • /
    • 2007
  • This article describes a function and structure of access network equipment under BcN environment. Access network until now have constructed separately to offer voice, data service. However, simplifies network structure, function that can do traffic concentration, subscriber certification, individual charging, QoS according to service and routing is required in BcN. In this paper, compare method offering by separate system with existing access network and method that offer integrating function inside system for structure of suitable access network to BcN and search structure of access network equipment for desirable access network of hereafter. Composition of this paper is as following. In Chapter 2, establishment history and structure of access network until present. In Chaprte 3, define suitable requirement and functions to BcN. And compare structure for access net work that is new with present. Last Chapter 4, suggests direction of structure of BcN access network and concludes conclusion.

  • PDF

통신시장 유무선 서비스 도입과 주요국의 규제현황

  • 송영화
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.358-361
    • /
    • 2001
  • Fixed-mobile convergence services can be defined as the combination of previously separate fixed and mobile services, and networks and commercial practices. Examples of fixed-mobile convergence services include single voicemail box, single number and unified messaging across fixed and mobile networks. Recently as more voice is transferred to mobile networks, convergence services between fixed and mobile become more important. In Korea convergence services are only starting to become established, and are likely to become an important part of any operator's offering. In this paper, I search the different levels of fixed-mobile convergence services and the trends and regulations for fixed-mobile convergence services in major countries. And at the same time I also try to find a new direction of future regulatory principles related to fixed-mobile convergence services.

  • PDF

A study on walking aids for the blind (시각장애자의 보행지원에 관한 연구)

  • Ham, K.K.;Han, S.H.;Yang, S.Y.;Kim, H.G.;Huh, W.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.05
    • /
    • pp.131-135
    • /
    • 1997
  • We implementated an ultrasonic wave cane for the blind. The cane detect walking obstacle and provide a walking direction. The cane used time of flight method of ultrasonic-wave for a measurement of obstacle distance and fluxgate geomagnetic sensor for guidance of walking direction. This system can detect an obstacle of upward, forward, downward and that warn to the blind with vibration, pitch sound. And the blind can know walking direction to voice output. As a result, the blind could efficiently avoid a exposed obstacle, obstacles beyond knee, an exposed street obstacle, a branch of tree person's height and it is usable search for surrounding land mark.

  • PDF

Detection of DTMF Signalling for Low Bit Rate Vocoder (저전송률 음성부호화기의 DUAL-TONE MULTIFREQUENCY(DTMF) SIGNALLING)

  • 손상목
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.159-164
    • /
    • 1998
  • We proposes a new detecting algorithm of DTMF tone for low bit ate vocoder so that we use DTMF tones for signalling inthe digital network. Using DTMF tones for signalling, we could not change the conventional IS-95 protocol and control the mobile phone. We apply the root finding to detection of formants and bandwidth to search whether DTMF tones or voice and moreover to find what's kinds of DTMF tones, for instance 1, 2, 3, ......., #, *, A, B, ...., etc. Consequently, proposed method has a good result which is 0.000944% average error rate. It is satisfied with rcommended error rate in ITU-T($\pm$1.8%).

  • PDF

A Study on Reduction of Computation Time through Adjustment the Frequency Interval Information in the G.723.1 Vocoder (G.723.1 보코더에서 주파수 간격 정보조절을 통한 계산량 감소에 관한 연구)

  • 민소연;김영규;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.405-408
    • /
    • 2002
  • LSP(Line Spectrum Pairs) Parameter is used for speech analysis in vocoders or recognizers since it has advantages of constant spectrum sensitivity. low spectrum distortion and easy linear interpolation. However the method of transforming LPC(Linear Predictive Coding) into LSP is so complex that it takes much time to compute. Among conventional methods, the real root method is considerably simpler than others, but nevertheless, it still suffers from its jndeterministic computation time because the root searching is processed sequentially in frequency region. We suggest a method of reducing the LSP transformation time using voice characteristics The proposed method is to apply search order and interval differently according to the distribution of LSP parameters. in comparison with the conventional real root method, the proposed method results in about 46.5% reduction. And, the total computation time is reduce to about 5% in the G.723.1 vocoder.

  • PDF

Design and Implementation of Online Algorithm Bank for Algorithm E-learning (컴퓨터 알고리즘 교육을 위한 온라인 알고리즘 뱅크 구현)

  • Park, Uchang
    • The Journal of Korean Association of Computer Education
    • /
    • v.7 no.4
    • /
    • pp.1-6
    • /
    • 2004
  • For an e-learning class, many voice and video technics for enhancing student teacher interaction. But for programming exercise courses, it's very difficult to add interactive components via web browser. In this paper, we make an online algorithm bank to manage and search algorithms, build an programming exercise interface on web. Students can edit, compile and execute programs included in online algorithm bank. Online program compile and execution enhance e-learning effectiveness for programming courses, and make students feel ease for computer algorithms.

  • PDF