• Title/Summary/Keyword: multi-frame detection

Search Result 65, Processing Time 0.025 seconds

Implementation of Adaptive Multi Rate (AMR) Vocoder for the Asynchronous IMT-2000 Mobile ASIC (IMT-2000 비동기식 단말기용 ASIC을 위한 적응형 다중 비트율 (AMR) 보코더의 구현)

  • 변경진;최민석;한민수;김경수
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.56-61
    • /
    • 2001
  • This paper presents the real-time implementation of an AMR (Adaptive Multi Rate) vocoder which is included in the asynchronous International Mobile Telecommunication (IMT)-2000 mobile ASIC. The implemented AMR vocoder is a multi-rate coder with 8 modes operating at bit rates from 12.2kbps down to 4.75kbps. Not only the encoder and the decoder as basic functions of the vocoder are implemented, but VAD (Voice Activity Detection), SCR (Source Controlled Rate) operation and frame structuring blocks for the system interface are also implemented in this vocoder. The DSP for AMR vocoder implementation is a 16bit fixed-point DSP which is based on the TeakLite core and consists of memory block, serial interface block, register files for the parallel interface with CPU, and interrupt control logic. Through the implementation, we reduce the maximum operating complexity to 24MIPS by efficiently managing the memory structure. The AMR vocoder is verified throughout all the test vectors provided by 3GPP, and stable operation in the real-time testing board is also proved.

  • PDF

Hybrid Super-Resolution Algorithm Robust to Cut-Change (컷 전환에 적응적인 혼합형 초고해상도 기법)

  • Kwon, Soon-Chan;Lim, Jong-Myeong;Yoo, Jisang
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.7
    • /
    • pp.1672-1686
    • /
    • 2013
  • In this paper, we propose a hybrid super-resolution algorithm robust to cut-change. Existing single-frame based super-resolution algorithms are usually fast, but quantity of information for interpolation is limited. Although the existing multi-frame based super-resolution algorithms generally robust to this problem, the performance of algorithm strongly depends on motions of input video. Furthemore at boundary of cut, applying of the algorithm is limited. In the proposed method, we detect a define boundary of cut using cut-detection algorithm. Then we adaptively apply a single-frame based super-resolution method to detected cut. Additionally, we propose algorithms of normalizing motion vector and analyzing pattern of edge to solve various problems of existing super-resolution algorithms. The experimental results show that the proposed algorithm has better performance than other conventional interpolation methods.

A Fast Sensing Method using Concurrent Driving and Sequential Sensing for Large Capacitance Touch Screens (동시구동 및 순차센싱을 이용한 대형 정전용량 터치스크린용 고속 센싱 기법)

  • Mohamed, Mohamed G.A.;Kim, HyungWon;Cho, Tae-Won
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.4
    • /
    • pp.62-70
    • /
    • 2015
  • Recently the demand for projected capacitance touch screens is sharply growing especially for large screens for medical devices, PC monitors and TVs. Large touch screens in general need a controller of higher complexity. They usually have a larger number of driving and sensing lines, and hence it takes longer to scan one frame for touch detection leading to a low frame scan rate. In this paper, a novel touch screen control technique is presented, which scans each frame in two steps of simultaneous multi-channel driving. The first step is to drive all driving lines simultaneously and determine which sensing lines have any touch. The second step is to sequentially rescan only the touched sensing lines, and determine exact positions of the touches. This technique can substantially increase the frame scan rate. This technique has been implemented using an FPGA and an AFE board, and tested using a commercial 23-inch touch screen panel. Experimental results show that the proposed technique improves the frame scan rate by 8.4 times for the 23-inch touch screen panel over conventional methods.

Efficient Harmonic-CELP Based Low Bit Rate Speech Coder (효율적인 하모닉-CELP 구조를 갖는 저 전송률 음성 부호화기)

  • 최용수;김경민;윤대희
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.5
    • /
    • pp.35-47
    • /
    • 2001
  • This paper describes an efficient harmonic-CELP speech coder by taking advantages of harmonic and CELP coders into account. According to frame voicing decision, the proposed harmonic-CELP coder adopts the RP-VSELP coder as a fast CELP in case of an unvoiced frame, or an improved harmonic coder in case of a voiced frame. The proposed coder has main features as follows: simple pitch detection, fast harmonic estimation, variable dimension harmonic vector quantization, perceptual weighting reflecting frequency resolution, fast harmonic synthesis, naturalness control using band voicing, and multi-mode. These features make the proposed coder require very low complexity, compared with HVXC coder To demonstrate the performance of the proposed coder, a 2.4 kbps coder has been implemented and compared with reference coders. From results of informal listening tests, the proposed coder showed good quality while requiring low delay and complexity.

  • PDF

A Real-time Motion Object Detection based on Neighbor Foreground Pixel Propagation Algorithm (주변 전경 픽셀 전파 알고리즘 기반 실시간 이동 객체 검출)

  • Nguyen, Thanh Binh;Chung, Sun-Tae
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.1
    • /
    • pp.9-16
    • /
    • 2010
  • Moving object detection is to detect foreground object different from background scene in a new incoming image frame and is an essential ingredient process in some image processing applications such as intelligent visual surveillance, HCI, object-based video compression and etc. Most of previous object detection algorithms are still computationally heavy so that it is difficult to develop real-time multi-channel moving object detection in a workstation or even one-channel real-time moving object detection in an embedded system using them. Foreground mask correction necessary for a more precise object detection is usually accomplished using morphological operations like opening and closing. Morphological operations are not computationally cheap and moreover, they are difficult to be rendered to run simultaneously with the subsequent connected component labeling routine since they need quite different type of processing from what the connected component labeling does. In this paper, we first devise a fast and precise foreground mask correction algorithm, "Neighbor Foreground Pixel Propagation (NFPP)" which utilizes neighbor pixel checking employed in the connected component labeling. Next, we propose a novel moving object detection method based on the devised foreground mask correction algorithm, NFPP where the connected component labeling routine can be executed simultaneously with the foreground mask correction. Through experiments, it is verified that the proposed moving object detection method shows more precise object detection and more than 4 times faster processing speed for a image frame and videos in the given the experiments than the previous moving object detection method using morphological operations.

A Personal Video Event Classification Method based on Multi-Modalities by DNN-Learning (DNN 학습을 이용한 퍼스널 비디오 시퀀스의 멀티 모달 기반 이벤트 분류 방법)

  • Lee, Yu Jin;Nang, Jongho
    • Journal of KIISE
    • /
    • v.43 no.11
    • /
    • pp.1281-1297
    • /
    • 2016
  • In recent years, personal videos have seen a tremendous growth due to the substantial increase in the use of smart devices and networking services in which users create and share video content easily without many restrictions. However, taking both into account would significantly improve event detection performance because videos generally have multiple modalities and the frame data in video varies at different time points. This paper proposes an event detection method. In this method, high-level features are first extracted from multiple modalities in the videos, and the features are rearranged according to time sequence. Then the association of the modalities is learned by means of DNN to produce a personal video event detector. In our proposed method, audio and image data are first synchronized and then extracted. Then, the result is input into GoogLeNet as well as Multi-Layer Perceptron (MLP) to extract high-level features. The results are then re-arranged in time sequence, and every video is processed to extract one feature each for training by means of DNN.

Multi-Channel Data Acquisition System Design for Spiral CT Application

  • Yoo, Sun-Won;Kim, In-Su;Kim, Bong-Su;Yun Yi;Kwak, Sung-Woo;Cho, Kyu-Sung;Park, Jung-Byung
    • Proceedings of the Korean Society of Medical Physics Conference
    • /
    • 2002.09a
    • /
    • pp.468-470
    • /
    • 2002
  • We have designed X-ray detection system and multi-channel data acquisition system for Spiral CT application. X-ray detection system consists of scintillator and photodiode. Scintillator converts X-ray into visible light. Photodiode converts visible light into electrical signal. The multi-channel data acquisition system consists of analog, digital, master and backplane board. Analog board detects electrical signal and amplifies signal by 140dB. Digital board consists of MUX(Multiplex) which routes multi-channel analog signal to preamplifier, and ADC(Analog to Digital Converter) which converts analog signal into digital signal. Master board supplies the synchronized clock and transmits the digital data to image reconstructor. Backplane provides electrical power, analog output and clock signal. The system converts the projected X-ray signal over the detector array with large gain, samples the data in each channel sequentially, and the sampled data are transmitted to host computer in a given time frame. To meet the timing limitation, this system is very flexible since it is implemented by FPGA(Field Programmable Gate Array). This system must have a high-speed operation with low noise and high SNR(signal to noise ratio), wide dynamic range to get a high resolution image.

  • PDF

Automatic Matching of Multi-Sensor Images Using Edge Detection Based on Thinning Algorithm (세선화 알고리즘 기반의 에지검출을 이용한 멀티센서 영상의 자동매칭)

  • Shin, Sung-Woong;Kim, Jun-Chul;Oh, Kum-Hui;Lee, Young-Ran
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.26 no.4
    • /
    • pp.407-414
    • /
    • 2008
  • This study introduces an automatic image matching algorithm that can be applied for the scale different image pairs consisting of the satellite pushbroom images and the aerial frame images. The proposed method is based on several image processing techniques such as pre-processing, filtering, edge thinning, interest point extraction, and key-descriptor matching, in order to enhance the matching accuracy and the processing speed. The proposed method utilizes various characteristics, such as the different geometry of image acquisition and the different radiometric characteristics, of the multi-sensor images. In addition, the suggested method uses the sensor model to minimize search area and eliminate false-matching points automatically.

Speaker Detection System for Video Conference (영상회의를 위한 화자 검출 시스템)

  • Lee, Byung-Sun;Ko, Sung-Won;Kwon, Heak-Bong
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.17 no.5
    • /
    • pp.68-79
    • /
    • 2003
  • In this paper, we propose a system that detects the current speaker in multi-speaker video conference by using lip motion. First, the system detects the face and lip area of each of the speakers using face color and shape information. Then, to detect the current speaker, it calculates the change between the current frame and the previous frame. To accomplish this, we used two CCD cameras. One is a general CCD camera, the other is a PTZ camera controlled by RS-232C serial port. The result is a system capable of detecting the face of current speaker in a video feed with more than three people, regardless of orientation of the faces. With this system, it only takes 4 to 5 seconds to zoom in on the speaker from the initial image. Also, it is amore efficient image transmission system for such things as video conference and internet broadcasting because it offers a face area screen at a resolution of 320X240, while at the same time providing a whole background screen.

Performance-based and damage assessment of SFRP retrofitted multi-storey timber buildings

  • Vahedian, Abbas;Mahini, Seyed Saeed;Glencross-Grant, Rex
    • Structural Monitoring and Maintenance
    • /
    • v.2 no.3
    • /
    • pp.269-282
    • /
    • 2015
  • Civil structures should be designed with the lowest cost and longest lifetime possible and without service failure. The efficient and sustainable use of materials in building design and construction has always been at the forefront for civil engineers and environmentalists. Timber is one of the best contenders for these purposes particularly in terms of aesthetics; fire protection; strength-to-weight ratio; acoustic properties and seismic resistance. In recent years, timber has been used in commercial and taller buildings due to these significant advantages. It should be noted that, since the launch of the modern building standards and codes, a number of different structural systems have been developed to stabilise steel or concrete multistorey buildings, however, structural analysis of high-rise and multi-storey timber frame buildings subjected to lateral loads has not yet been fully understood. Additionally, timber degradation can occur as a result of biological decay of the elements and overloading that can result in structural damage. In such structures, the deficient members and joints require strengthening in order to satisfy new code requirements; determine acceptable level of safety; and avoid brittle failure following earthquake actions. This paper investigates performance assessment and damage assessment of older multi-storey timber buildings. One approach is to retrofit the beams in order to increase the ductility of the frame. Experimental studies indicate that Sprayed Fibre Reinforced Polymer (SFRP) repairing/retrofitting not only updates the integrity of the joint, but also increases its strength; stiffness; and ductility in such a way that the joint remains elastic. Non-linear finite element analysis ('pushover') is carried out to study the behaviour of the structure subjected to simulated gravity and lateral loads. A new global index is re-assessed for damage assessment of the plain and SFRP-retrofitted frames using capacity curves obtained from pushover analysis. This study shows that the proposed method is suitable for structural damage assessment of aged timber buildings. Also SFRP retrofitting can potentially improve the performance and load carrying capacity of the structure.