• Title/Summary/Keyword: Visual Scene

Search Result 369, Processing Time 0.038 seconds

Visual Search Model based on Saliency and Scene-Context in Real-World Images (실제 이미지에서 현저성과 맥락 정보의 영향을 고려한 시각 탐색 모델)

  • Choi, Yoonhyung;Oh, Hyungseok;Myung, Rohae
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.41 no.4
    • /
    • pp.389-395
    • /
    • 2015
  • According to much research on cognitive science, the impact of the scene-context on human visual search in real-world images could be as important as the saliency. Therefore, this study proposed a method of Adaptive Control of Thought-Rational (ACT-R) modeling of visual search in real-world images, based on saliency and scene-context. The modeling method was developed by using the utility system of ACT-R to describe influences of saliency and scene-context in real-world images. Then, the validation of the model was performed, by comparing the data of the model and eye-tracking data from experiments in simple task in which subjects search some targets in indoor bedroom images. Results show that model data was quite well fit with eye-tracking data. In conclusion, the method of modeling human visual search proposed in this study should be used, in order to provide an accurate model of human performance in visual search tasks in real-world images.

Functional Analysis of Music Used in Film

Scene-Based Video Watermarking Using Temporal Spread Spectrum in Com pressed Domain (압축 영역에서 시간축 확산 스펙트럼을 이용한 장면단위의 비디오 워터마킹)

  • 최윤희;강경표;최태선
    • Proceedings of the IEEK Conference
    • /
    • 2002.06d
    • /
    • pp.93-96
    • /
    • 2002
  • This paper presents robust and efficient scene-based video watermarking method using visual rhythm (spatio-temporal slice) in compressed domain. Scene change can be detected easily using visual rhythm and video sequences are conveniently edited at the scene boundaries. Therefore, scene-based watermark embedding Process it a natural choice. Temporal spread spectrum can be achieved by applying spread spectrum methods to visual rhythm. Additive Gaussian noise, low-pass filtering, median filtering and histogram equalization attack are simulated for all frames. Frame sub-sampling is also simulated as a typical video attack Simulation results show that proposed algorithm is robust and efficient in the presence of such kind of attacks.

  • PDF

A Rate Control Algorithm of MPEG-2 Video Encoding Based Target Bit Matching at Scene Changes (장면전환 발생시 예상 비트 조정을 통한 MPEG-2 비디오 부호화 비트율 제어 알고리즘)

  • Moon Ho-seok;Park Sang-sung;Sohn Myung-ho;Jang Dong-sik
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.12
    • /
    • pp.1621-1627
    • /
    • 2004
  • The decrease of visual quality at scene change occurs when the difference between the amount of target bits and actual coding is high. Especially, scene change at the P-Picture can lead to severely degrade visual qualities at itself and the pictures referencing it. In this paper, under the occurrence of scene change, we propose a new method, based on the analysis of existing inaccurate bits allocation, to improve the visual qualities of scene-changed and following pictures. The method allocates extra bits to scene-changed Picture and changes them upto the level of the complexity of intra picture. Also, the method changes target bits of following pictures upto the complexity of picture prior to the scene change. Computer simulation shows that the proposed method has improved 0.5-1.2dB higher than TM5 method in terms of PSNR.

On the Visual Scene Validity of the Microcomputer Aided Port Design Simulator (마이크로 컴퓨터를 이용한 항만설계 시뮬레이터의 영상정보 신뢰성에 관한 연구)

  • 김환수
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.3 no.2
    • /
    • pp.1-12
    • /
    • 1997
  • One of the main uses for ship simulators is in the field of port design, and an increasing number of simulators, of varying degrees of fidelity, are being used for this purpose. An essential feature of all such simulators is their visual scene, which must be of sufficent fidelity to convey the key visual cues adequately. This paper examines the ability of a number of experienced mariners to perceive speeds and distances correctly using Computer Generated Imagery visual scenes of different fidelity, compared with their performance at sea. From the results, it was found that the microcomputer based simulator might be considered, as far as its visual scene representation is concerned, to be as valid as the full mission ship simulator for the port design task.

  • PDF

A New MPEG-2 Rate Control Scheme Using Scene Change Detection

  • Park, Sang-Gyu;Lee, Young-Sun;Chang, Hyun-Sik
    • ETRI Journal
    • /
    • v.18 no.2
    • /
    • pp.61-74
    • /
    • 1996
  • We propose two new rate control schemes to improve MPEG-2 rate control in view of visual quality when scene changes happen. Two proposed schemes are characterized by real-time and non real-time improvement to reduce the impact of scene changes. We also propose a new target-bit prediction method using spatial activity of pictures and present a simple and efficient scene change detection scheme using signed difference of mean absolute difference (MAD). Computer simulation results show that the proposed real-time algorithm effectively alleviates visual quality degradation after scene changes. The proposed non real-time algorithm gives maximum 2 dB improvement in peak signal-to-noise ratio (PSNR) at a scene-changed picture, compared with MPEG-2 rate control scheme and it shows better quality than the real-time one.

  • PDF

Modeling the Visual Target Search in Natural Scenes

  • Park, Daecheol;Myung, Rohae;Kim, Sang-Hyeob;Jang, Eun-Hye;Park, Byoung-Jun
    • Journal of the Ergonomics Society of Korea
    • /
    • v.31 no.6
    • /
    • pp.705-713
    • /
    • 2012
  • Objective: The aim of this study is to predict human visual target search using ACT-R cognitive architecture in real scene images. Background: Human uses both the method of bottom-up and top-down process at the same time using characteristics of image itself and knowledge about images. Modeling of human visual search also needs to include both processes. Method: In this study, visual target object search performance in real scene images was analyzed comparing experimental data and result of ACT-R model. 10 students participated in this experiment and the model was simulated ten times. This experiment was conducted in two conditions, indoor images and outdoor images. The ACT-R model considering the first saccade region through calculating the saliency map and spatial layout was established. Proposed model in this study used the guide of visual search and adopted visual search strategies according to the guide. Results: In the analysis results, no significant difference on performance time between model prediction and empirical data was found. Conclusion: The proposed ACT-R model is able to predict the human visual search process in real scene images using salience map and spatial layout. Application: This study is useful in conducting model-based evaluation in visual search, particularly in real images. Also, this study is able to adopt in diverse image processing program such as helper of the visually impaired.

Multimodal Context Embedding for Scene Graph Generation

  • Jung, Gayoung;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • v.16 no.6
    • /
    • pp.1250-1260
    • /
    • 2020
  • This study proposes a novel deep neural network model that can accurately detect objects and their relationships in an image and represent them as a scene graph. The proposed model utilizes several multimodal features, including linguistic features and visual context features, to accurately detect objects and relationships. In addition, in the proposed model, context features are embedded using graph neural networks to depict the dependencies between two related objects in the context feature vector. This study demonstrates the effectiveness of the proposed model through comparative experiments using the Visual Genome benchmark dataset.

Imaging a scene from experience given verbal experssions

  • Sakai, Y.;Kitazawa, M.;Takahashi, S.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1995.10a
    • /
    • pp.307-310
    • /
    • 1995
  • In the conventional systems, a human must have knowledge of machines and of their special language in communicating with machines. In one side, it is desirable for a human but in another side, it is true that achieving it is very elaborate and is also a significant cause of human error. To reduce this sort of human load, an intelligent man-machine interface is desirable to exist between a human operator and machines to be operated. In the ordinary human communication, not only linguistic information but also visual information is effective, compensating for each others defect. From this viewpoint, problem of translating verbal expressions to some visual image is discussed here in this paper. The location relation between any two objects in a visual scene is a key in translating verbal information to visual information, as is the case in Fig.l. The present translation system advances in knowledge with experience. It consists of Japanese Language processing, image processing, and Japanese-scene translation functions.

  • PDF

An Analysis on the Visual Image and Harmony of the Construction Method in the Slope Scene -A Case on the Daejeon${\~}$Jinju Highway- (고속도로 비탈면 경관의 법면공법에 따른 시각적 이미지와 조화성 분석 - 대전${\~}$진주간 고속도로를 대상으로 -)

  • Lee Jeong
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.33 no.1 s.108
    • /
    • pp.33-48
    • /
    • 2005
  • The purpose of this study was to discover the landscape visual image of the slope scene and their harmony with surrounding sceneries. This research utilized the basic study tool of psycho-physics and processed the case study of ten types of slope construction scene along the highway. The analysis was performed by the data obtained from the questionnaires and the photos for the slope construction scene. The questionnaires for analysis the image of the slope construction scene and their harmony with surrounding sceneries were designed using semantic differential scale and 5 point Likert-scale. The major findings were as follows. 1. At the part of the visual preferences analysis, the slope revegetation methods showed high level of preferences generally than on the slope structure methods. While the slope revegetation methods were estimated friendly, continuity, harmonious, soft, light and wide, the slope revegetation methods were estimated unstable, female, static, simple, omnipresent, appeared as policeman of weak inclination. Also the slope structure methods were estimated stable, manly, complicated, steep and healthy but rough, unharmonious, unfamiliar and heavy. 2. Psychological factors, related to the satisfaction for the slope revegetation methods were composed of three factors, aesthetic, individuality and physical character. And the slope structure methods were composed of five factors, aesthetic, individuality, stability, physical character, and complexity. 3. At the part of harmony with surrounding landscapes, the slope revegetation methods were evaluated highly but the slope structure methods received the lowest evaluation. Also the harmony analysis with surrounding view on the slope revegetation methods showed degree of high more than average in all texture, form, color and scale but the slope structure methods showed degree of fewer than average degree in form, scale, color and texture.