Search | Korea Science

Fast Intra-Mode Decision for H.264/AVC using Inverse Tree-Structure (H.264/AVC 표준에서 역트리 구조를 이용하여 고속으로 화면내 모드를 결정하는 방법)

Ko, Hyun-Suk;Yoo, Ki-Won;Seo, Jung-Dong;Sohn, Kwang-Hoon
- Journal of Broadcast Engineering
- /
- v.13 no.3
- /
- pp.310-318
- /
- 2008
The H.264/AVC standard achieves higher coding efficiency than previous video coding standards with the rate-distortion optimization (RDO) technique which selects the best coding mode and reference frame for each macroblock. As a result, the complexity of the encoder have been significantly increased. In this paper, a fast intra-mode decision algorithm is proposed to reduce the computational load of intra-mode search, which is based on the inverse tree-structure edge prediction algorithm. First, we obtained the dominant edge for each $4{\times}4$ block from local edge information, then the RDO process is only performed by the mode which corresponds to dominant edge direction. Then, for the $8{\times}8$ (or $16{\times}16$) block stage, the dominant edge is calculated from its four $4{\times}4$ (or $16{\times}16$) blocks' dominant edges without additional calculation and the RDO process is also performed by the mode which is related to dominant edge direction. Experimental results show that proposed scheme can significantly improve the speed of the intra prediction with a negligible loss in the peak signal to noise ratio (PSNR) and a little increase of bits.
https://doi.org/10.5909/JBE.2008.13.3.310 인용 PDF KSCI

Real-Time Interested Pedestrian Detection and Tracking in Controllable Camera Environment (제어 가능한 카메라 환경에서 실시간 관심 보행자 검출 및 추적)

Lee, Byung-Sun;Rhee, Eun-Joo
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2007.10a
- /
- pp.293-297
- /
- 2007
This thesis suggests a new algorithm to detects multiple moving objects using a CMODE(Correct Multiple Object DEtection) method in the color images acquired in real-time and to track the interested pedestrian using motion and hue information. The multiple objects are detected, and then shaking trees or moving cars are removed using structural characteristics and shape information of the man , the interested pedestrian can be detected, The first similarity judgment for tracking an interested pedestrian is to use the distance between the previous interested pedestrian's centroid and the present pedestrian's centroid. For the area where the first similarity is detected, three feature points are calculated using k-mean algorithm, and the second similarity is judged and tracked using the average hue value for the $3{\times}3$ area of each feature point. The zooming of camera is adjusted to track an interested pedestrian at a long distance easily and the FOV(Field of View) of camera is adjusted in case the pedestrian is not situated in the fixed range of the screen. As a experiment results, comparing the suggested CMODE method with the labeling method, an average approach rate is one fourth of labeling method, and an average detecting time is faster three times than labeling method. Even in a complex background, such as the areas where trees are shaking or cars are moving, or the area of shadows, interested pedestrian detection is showed a high detection rate of average 96.5%. The tracking of an interested pedestrian is showed high tracking rate of average 95% using the information of situation and hue, and interested pedestrian can be tracked successively through a camera FOV and zooming adjustment.
PDF

A Study on the Architectural Environment as a Combination of Performance and Event (퍼포먼스.이벤트의 결합체로서 건축환경연구)

김주미
- Archives of design research
- /
- v.14
- /
- pp.121-138
- /
- 1996
The purpose of this study is to develop a new architectural language and design strategies that would anticipate and incorporate new historical situations and new paradigms to understand the world. It consists of four sections as follows: First, it presents a new interpretation of space, human body, and movement that we find in modern art and tries to combine that new artistic insight with environmental design to provide a theoretical basis for performance-event architecture. Second, it conceives of architectural environment as a combination of space, movement, and probabilistic situations rather than a mere conglomeration of material. It also perceives the environment as a stage for performance and the act of designing as a performance. Third, in this context, man is conceived of as an organic system that responds to, interacts with, and adapts himself to his environment through self-regulation. By the same token, architecture should be a dynamic system that undergoes a constant transformation in its attempt to accommodate human actions and behaviors as he copes with the contemporary philosophy characterized by the principle of uncertainty, fast-changing society, and the new developments in technology. Fourth, the relativistic and organic view-point that constitutes the background for all this is radically different from the causalistic and mechanistic view that characterized the forms and functions of modernistic design. The present study places a great emphases on dematerialistic conception of environment and puts forth a disprogramming method that would accommodate interchangeability in the passage of time and the intertextuality of form and function. In the event, performance-event architecture is a strategy based on the systems world-view that would enable the recovery of man's autonomy and the reconception of his environment as an object of art.
PDF

Fixed Pattern Noise Reduction in Infrared Videos Based on Joint Correction of Gain and Offset (적외선 비디오에서 Gain과 Offset 결합 보정을 통한 고정패턴잡음 제거기법)

Kim, Seong-Min;Bae, Yoon-Sung;Jang, Jae-Ho;Ra, Jong-Beom
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.49 no.2
- /
- pp.35-44
- /
- 2012
Most recent infrared (IR) sensors have a focal-plane array (FPA) structure. Spatial non-uniformity of a FPA structure, however, introduces unwanted fixed pattern noise (FPN) to images. This non-uniformity correction (NUC) of a FPA can be categorized into target-based and scene-based approaches. In a target-based approach, FPN can be separated by using a uniform target such as a black body. Since the detector response randomly drifts along the time axis, however, several scene-based algorithms on the basis of a video sequence have been proposed. Among those algorithms, the state-of-the-art one based on Kalman filter uses one-directional warping for motion compensation and only compensates for offset non-uniformity of IR camera detectors. The system model using one-directional warping cannot correct the boundary region where a new scene is being introduced in the next video frame. Furthermore, offset-only correction approaches may not completely remove the FPN in images if it is considerably affected by gain non-uniformity. Therefore, for FPN reduction in IR videos, we propose a joint correction algorithm of gain and offset based on bi-directional warping. Experiment results using simulated and real IR videos show that the proposed scheme can provide better performance compared with the state-of-the art in FPN reduction.
PDF KSCI

Image Mosaicking Using Feature Points Based on Color-invariant (칼라 불변 기반의 특징점을 이용한 영상 모자이킹)

Kwon, Oh-Seol;Lee, Dong-Chang;Lee, Cheol-Hee;Ha, Yeong-Ho
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.46 no.2
- /
- pp.89-98
- /
- 2009
In the field of computer vision, image mosaicking is a common method for effectively increasing restricted the field of view of a camera by combining a set of separate images into a single seamless image. Image mosaicking based on feature points has recently been a focus of research because of simple estimation for geometric transformation regardless distortions and differences of intensity generating by motion of a camera in consecutive images. Yet, since most feature-point matching algorithms extract feature points using gray values, identifying corresponding points becomes difficult in the case of changing illumination and images with a similar intensity. Accordingly, to solve these problems, this paper proposes a method of image mosaicking based on feature points using color information of images. Essentially, the digital values acquired from a digital color camera are converted to values of a virtual camera with distinct narrow bands. Values based on the surface reflectance and invariant to the chromaticity of various illuminations are then derived from the virtual camera values and defined as color-invariant values invariant to changing illuminations. The validity of these color-invariant values is verified in a test using a Macbeth Color-Checker under simulated illuminations. The test also compares the proposed method using the color-invariant values with the conventional SIFT algorithm. The accuracy of the matching between the feature points extracted using the proposed method is increased, while image mosaicking using color information is also achieved.
PDF KSCI

A Content-based Video Rate-control Algorithm Interfaced to Human-eye (인간과 결합한 내용기반 동영상 율제어)

황재정;진경식;황치규
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.28 no.3C
- /
- pp.307-314
- /
- 2003
In the general multiple video object coder, more interested objects such as speaker or moving object is consistently coded with higher priority. Since the priority of each object may not be fixed in the whole sequence and be variable on frame basis, it must be adjusted in a frame. In this paper, we analyze the independent rate control algorithm and global algorithm that the QP value is controled by the static parameters, object importance or priority, target PSNR, weighted distortion. The priority among static parameters is analyzed and adjusted into dynamic parameters according to the visual interests or importance obtained by camera interface. Target PSNR and weighted distortion are proportionally derived by using magnitude, motion, and distortion. We apply those parameters for the weighted distortion control and the priority-based control resulting in the efficient bit-rate distribution. As results of this paper, we achieved that fewer bits are allocated for video objects which has less importance and more bits for those which has higher visual importance. The duration of stability in the visual quality is reduced to less than 15 frames of the coded sequence. In the aspect of PSNR, the proposed scheme shows higher quality of more than 2d13 against the conventional schemes. Thus the coding scheme interfaced to human- eye proves an efficient video coder dealing with the multiple number of video objects.
PDF KSCI

A study on the lip shape recognition algorithm using 3-D Model (3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구)

배철수
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.3 no.1
- /
- pp.59-68
- /
- 1999
Recently, research and developmental direction of communication system is concurrent adopting voice data and face image in speaking to provide more higher recognition rate then in the case of only voice data. Therefore, we present a method of lipreading in speech image sequence by using the 3-D facial shape model. The method use a feature information of the face image such as the opening-level of lip, the movement of jaw, and the projection height of lip. At first, we adjust the 3-D face model to speeching face image sequence. Then, to get a feature information we compute variance quantity from adjusted 3-D shape model of image sequence and use the variance quality of the adjusted 3-D model as recognition parameters. We use the intensity inclination values which obtaining from the variance in 3-D feature points as the separation of recognition units from the sequential image. After then, we use discrete HMM algorithm at recognition process, depending on multiple observation sequence which considers the variance of 3-D feature point fully. As a result of recognition experiment with the 8 Korean vowels and 2 Korean consonants, we have about 80% of recognition rate for the plosives and vowels. We propose that usability with visual distinguishing factor that using feature vector because as a result of recognition experiment for recognition parameter with the 10 korean vowels, obtaining high recognition rate.
PDF

Influence of 3D Characteristics Perception on Presence, and Presence on Visual Fatigue and Perceived Eye Movement (3D 영상 특성 인식이 프레즌스, 그리고 프레즌스가 시각 피로도와 인지된 안구운동에 미치는 영향)

Yang, Ho-Cheol;Chung, Dong-Hun
- Journal of Broadcast Engineering
- /
- v.17 no.1
- /
- pp.60-72
- /
- 2012
After the movie "AVATAR" became a good model of cash-cow in 3D movie, the profit of 3D-movie significantly reduced. One of the reasons why it happens comes from rare understanding of human factors for instance how viewers get immersed, but sometimes tired. Although 3D images should be more considered human visual system including eye, unfortunately most communication research ignored human factors. For those reasons this study observed the effect of 3D video on viewers' psychological response, especially for perceived eye movement, perceived characteristics, visual fatigue, and presence. With 90 participants, the results show that viewers' perceived feature effects on their presence. In detail, first, materiality and tangibility are more important factors than clarity in 3D video, and it means that when making 3D content or devices, materiality and tangibility should be considers that any other factor. Second, this study examined whether we perceive our eyes as media, and the result shows that as viewers' presence level became higher we perceive eye movements more, and as viewers' presence level became higher perceived visual fatigue became lower. This result means that when we move eyes, we interact with surrounded environment, so 3D content needs to provide vivid features to be more interactive. On the other hand, since level of presence increase visual fatigue, it must be balanced when producing and playing.
https://doi.org/10.5909/JEB.2012.17.1.60 인용 PDF KSCI

Experimental Research for Traction force Sensor Development on Drawing Exercise Medical Instrument (재활 및 교정을 위한 견인운동치료기의 견인측정센서 개발에 관한 실험적 연구)

Lee, Sang-sik;Park, Won-yeop;Lee, Choong-ho
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.2 no.2
- /
- pp.3-8
- /
- 2009
The traction system has been mainly used for rehabilitation and correction of patients with spine or gait diseases in orthopedics or at home. Some problems could occur in human body when patients forced their training using the traction system. So it needs to measure a traction force and control the training time. However, most of products on market have no sensor measuring traction force. Thus we designed and made a sensor detecting traction force using strain gauge, amplifier for transition to output signal and experiment devices for performance test. We carried out experiment of a sensor detecting a traction force and measured electric responses of it with respect to traction loads. Maximum error was within about 1% for experiments in static condition and the average error was about 0.7% for experiments in dynamic condition. We concluded that it is possible to use the developed sensor for measurement of traction force since the maximum output variation of a sensor detecting a traction force was about 0.3% in $0^{\circ}C-60^{\circ}C$ temperature condition.
PDF

MPEG Video Segmentation using Two-stage Neural Networks and Hierarchical Frame Search (2단계 신경망과 계층적 프레임 탐색 방법을 이용한 MPEG 비디오 분할)

Kim, Joo-Min;Choi, Yeong-Woo;Chung, Ku-Sik
- Journal of KIISE:Software and Applications
- /
- v.29 no.1_2
- /
- pp.114-125
- /
- 2002
In this paper, we are proposing a hierarchical segmentation method that first segments the video data into units of shots by detecting cut and dissolve, and then decides types of camera operations or object movements in each shot. In our previous work[1], each picture group is divided into one of the three detailed categories, Shot(in case of scene change), Move(in case of camera operation or object movement) and Static(in case of almost no change between images), by analysing DC(Direct Current) component of I(Intra) frame. In this process, we have designed two-stage hierarchical neural network with inputs of various multiple features combined. Then, the system detects the accurate shot position, types of camera operations or object movements by searching P(Predicted), B(Bi-directional) frames of the current picture group selectively and hierarchically. Also, the statistical distributions of macro block types in P or B frames are used for the accurate detection of cut position, and another neural network with inputs of macro block types and motion vectors method can reduce the processing time by using only DC coefficients of I frames without decoding and by searching P, B frames selectively and hierarchically. The proposed method classified the picture groups in the accuracy of 93.9-100.0% and the cuts in the accuracy of 96.1-100.0% with three different together is used to detect dissolve, types of camera operations and object movements. The proposed types of video data. Also, it classified the types of camera movements or object movements in the accuracy of 90.13% and 89.28% with two different types of video data.
PDF KSCI

Search Result 672, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)