Search | Korea Science

A Speaker Detection System based on Stereo Vision and Audio (스테레오 시청각 기반의 화자 검출 시스템)

An, Jun-Ho;Hong, Kwang-Seok
- Journal of Internet Computing and Services
- /
- v.11 no.6
- /
- pp.21-29
- /
- 2010
In this paper, we propose the system which detects the speaker, who is speaking currently, among a number of users. A proposed speaker detection system based on stereo vision and audio is mainly composed of the followings: a position estimation of speaker candidates using stereo camara and microphone, a current speaker detection, and a speaker information acquisition based on a mobile device. We use the haar-like features and the adaboost algorithm to detect the faces of speaker candidates with stereo camera, and the position of speaker candidates is estimated by a triangulation method. Next, the Time Delay Of Arrival (TDOA) is estimated by the Cross Power Spectrum Phase (CPSP) analysis to find the direction of source with two microphone. Finally we acquire the information of the speaker including his position, voice, and face by comparing the information of the stereo camera with that of two microphone. Furthermore, the proposed system includes a TCP client/server connection method for mobile service.
PDF KSCI

Geometric LiveWire and Geometric LiveLane for 3D Meshes (삼차원 메쉬에 대한 기하학 라이브와이어와 기하학 라이브레인)

Yoo Kwan-Hee
- The KIPS Transactions:PartA
- /
- v.12A no.1 s.91
- /
- pp.13-22
- /
- 2005
Similarly to the edges defined in a 2D image, we can define the geometric features representing the boundary of the distinctive parts appearing on 3D meshes. The geometric features have been used as basic primitives in several applications such as mesh simplification, mesh deformation, and mesh editing. In this paper, we propose geometric livewire and geometric livelane for extracting geometric features in a 3D mesh, which are the extentions of livewire and livelane methods in images. In these methods, approximate curvatures are adopted to represent the geometric features in a 3D mesh and the 3D mesh itself is represented as a weighted directed graph in which cost functions are defined for the weights of edges. Using a well-known shortest path finding algorithm in the weighted directed graph, we extracted geometric features in the 3D mesh among points selected by a user. In this paper, we also visualize the results obtained from applying the techniques to extracting geometric features in the general meshes modeled after human faces, cows, shoes, and single teeth.
https://doi.org/10.3745/KIPSTA.2005.12A.1.013 인용 PDF KSCI

A Method of Auto Photography Composition Suggestion (사진의 자동 구도 보정 제시 기법)

Choi, Yong-Sub;Park, Dae-Hyun;Kim, Yoon
- Journal of the Korea Society of Computer and Information
- /
- v.19 no.1
- /
- pp.9-21
- /
- 2014
In this paper, we propose the auto correction technique of photography composition by which the eye line is concentrated and the stable image of the structure can be obtained in case the general user takes a picture. Because the general user photographs in most case without background knowledge about the composition of the photo, the subject location is not appropriate and the unstable composition is contrasted with the stable composition of pictures which the experts take. Therefore, we provide not the method processing the image after photographing, but he method presenting automatically the stable composition when the general users take a photograph. The proposed method analyze the subject through Saliency Map, Image Segmentation, Edge Detection, etc. and outputs the subject at the location where the stable composition can be comprised along with the guideline of the Rule of Thirds. The experimental result shows that the good composition was presented to the user automatically.
https://doi.org/10.9708/jksci.2014.19.1.009 인용 PDF KSCI

A Study on Kidney Diseases Diagnosis System for Sensation Type Using Physiological Signal Analysis (생체 신호 분석을 이용한 감각형 신장 질환 진단 시스템 연구)

Cho, Dong-Uk;Kim, Bong-Hyun;Lee, Se-Hwan
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.31 no.10C
- /
- pp.964-972
- /
- 2006
The kidney keeps with close relationship in the internal organs, that the kidney function filtering eliminate the wastes to the urine on the processing to replace the old with the new blood. In case of these problem in the kidney, there is no way to catch out with self-awakening symptom except for serious illness. This problem can solve with keeping the systematic diagnosis method in the kidney trouble shooting. Under the circumstances, the importance of the diagnosis for the kidney disease is growing day after day. In this paper, among the great four diagnosises, using the way of ocular inspection & auscultation, we would like to propose rouble shooting in the way of the kidney. To do this, through the assistance of the input image, extract the value of the color with appropriate output, analysing the color of the face with related to the kidney, using the results we would like to get the accurate symptoms on the kidney's problems. Also, through analysing and comparing with the relationship the kidney and the signal of voice, we would like to realize the proof system of human health. Finally, we'd like to make proof of the usefulness for proposed method from this study.
PDF KSCI

Building Living Lab for Acquiring Behavioral Data for Early Screening of Developmental Disorders

Kim, Jung-Jun;Kwon, Yong-Seop;Kim, Min-Gyu;Kim, Eun-Soo;Kim, Kyung-Ho;Sohn, Dong-Seop
- Journal of the Korea Society of Computer and Information
- /
- v.25 no.8
- /
- pp.47-54
- /
- 2020
Developmental disorders are impairments of brain and/or central nervous system and refer to a disorder of brain function that affects languages, communication skills, perception, sociality and so on. In diagnosis of developmental disorders, behavioral response such as expressing emotions in proper situation is one of observable indicators that tells whether or not individual has the disorders. However, diagnosis by observation can allow subjective evaluation that leads erroneous conclusion. This research presents the technological environment and data acquisition system for AI based screening of autism disorder. The environment was built considering activities for two screening protocols, namely Autism Diagnostic Observation Schedule (ADOS) and Behavior Development Screening for Toddler (BeDevel). The activities between therapist and baby during the screening are fully recorded. The proposed software in this research was designed to support recording, monitoring and data tagging for learning AI algorithms.
https://doi.org/10.9708/jksci.2020.25.08.047 인용 PDF KSCI

Facial Expression Recognition by Combining Adaboost and Neural Network Algorithms (에이다부스트와 신경망 조합을 이용한 표정인식)

Hong, Yong-Hee;Han, Young-Joon;Hahn, Hern-Soo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.20 no.6
- /
- pp.806-813
- /
- 2010
Human facial expression shows human's emotion most exactly, so it can be used as the most efficient tool for delivering human's intention to computer. For fast and exact recognition of human's facial expression on a 2D image, this paper proposes a new method which integrates an Discrete Adaboost classification algorithm and a neural network based recognition algorithm. In the first step, Adaboost algorithm finds the position and size of a face in the input image. Second, input detected face image into 5 Adaboost strong classifiers which have been trained for each facial expressions. Finally, neural network based recognition algorithm which has been trained with the outputs of Adaboost strong classifiers determines final facial expression result. The proposed algorithm guarantees the realtime and enhanced accuracy by utilizing fastness and accuracy of Adaboost classification algorithm and reliability of neural network based recognition algorithm. In this paper, the proposed algorithm recognizes five facial expressions such as neutral, happiness, sadness, anger and surprise and achieves 86~95% of accuracy depending on the expression types in real time.
https://doi.org/10.5391/JKIIS.2010.20.6.7.806 인용 PDF KSCI

Digital Holographic Security Identification System (디지털 홀로그래픽 보안 인증 시스템)

Kim, Jung-Hoi;Kim, Nam;Jeon, Seok-Hee
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.2
- /
- pp.89-98
- /
- 2004
In this paper, we implement a digital holographic security card system that combines digital holographic memory using random phase encoded reference beams with electrical biometrics. Digitally encoded data including a document, a picture of face, and a fingerprint are recorded by multiplexing of holographic memory. A random phase mask encoding reference beams are used as a decoded key to protect illegal counterfeit. As a result, we can achieve a raw BER of 3.6${\times}$10-4 and shift selectivity of 4${\mu}{\textrm}{m}$ using the 2D random phase mask. Also, we develop a recording pattern and image processing which are suitable for a low cost reader without a position sensing photo-detector for real time data extraction and remove danger of fraud from unauthorized person by comparing the reconstructed holographic data with the live fingerprint data.
PDF KSCI

Texture Classification Using Wavelet-Domain BDIP and BVLC Features With WPCA Classifier (웨이브렛 영역의 BDIP 및 BVLC 특징과 WPCA 분류기를 이용한 질감 분류)

Kim, Nam-Chul;Kim, Mi-Hye;So, Hyun-Joo;Jang, Ick-Hoon
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.49 no.2
- /
- pp.102-112
- /
- 2012
In this paper, we propose a texture classification using wavelet-domain BDIP (block difference of inverse probabilities) and BVLC (block variance of local correlation coefficients) features with WPCA (whitened principal component analysis) classifier. In the proposed method, the wavelet transform is first applied to a query image. The BDIP and BVLC operators are next applied to the wavelet subbands. Global moments for each subband of BDIP and BVLC are then computed and fused into a feature vector. In classification, the WPCA classifier, which is usually adopted in the face identification, searches the training feature vector most similar to the query feature vector. Experimental results show that the proposed method yields excellent texture classification with low feature dimension for test texture image DBs.
PDF KSCI

Implementation of Intelligent Moving Target Tracking and Surveillance System Using Pan/Tilt-embedded Stereo Camera System (팬/틸트 탑제형 스테레오 카메라를 이용한 지능형 이동표적 추적 및 감시 시스템의 구현)

고정환;이준호;김은수
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.4C
- /
- pp.514-523
- /
- 2004
In this paper, a new intelligent moving target tracking and surveillance system basing on the pan/tilt-embedded stereo camera system is suggested and implemented. In the proposed system, once the face area of a target is detected from the input stereo image by using a YCbCr color model and then, using this data as well as the geometric information of the tracking system, the distance and 3D information of the target are effectively extracted in real-time. Basing on these extracted data the pan/tilted-embedded stereo camera system is adaptively controlled and as a result, the proposed system can track the target adaptively under the various circumstance of the target. From some experiments using 80 frames of the test input stereo image, it is analyzed that standard deviation of the position displacement of the target in the horizontal and vertical directions after tracking is kept to be very low value of 1.82, 1.11, and error ratio between the measured and computed 3D coordinate values of the target is also kept to be very low value of 0.5% on average. From these good experimental results a possibility of implementing a new real-time intelligent stereo target tracking and surveillance system using the proposed scheme is finally suggested.
PDF KSCI

The Study about the Differential compression based on the ROI(Region Of Interest) (ROI(Region Of Interest)기반의 차등적 이미지 압축에 관한 연구)

Yun, Chi-Hwan;Ko, Sun-Woo;Lee, Geun-Ho
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.18 no.3
- /
- pp.679-686
- /
- 2014
Recently, users can get countless images and videos by network. So, the compression technology of image and video is researched more and more. However, the situation which is the interested range of the image is occurred. For instance, since the region of face is more important than background, the image compression technology bases on the region of interest (ROI) is necessary, in the ATM environment. In this research, given the human visual system, which are not sensitive to illumination variations at very dark and light regions of image, we calculate the standard deviation of block and use this value to define the ROI. In encoding process, the relatively high quality can be obtained at the ROI and the relatively low quality can be obtained at the non ROI. In proposed scheme, the feature which is the encoding process according to subjectively image quality can be demonstrated. Finally, this proposed scheme is applied to JPEG standard. The experimental results demonstrate that proposed scheme can achieve better image quality at the high compression ratio.
https://doi.org/10.6109/jkiice.2014.18.3.679 인용 PDF KSCI

Search Result 1,527, Processing Time 0.044 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)