• 제목/요약/키워드: Visual Scene

검색결과 369건 처리시간 0.012초

실제 이미지에서 현저성과 맥락 정보의 영향을 고려한 시각 탐색 모델 (Visual Search Model based on Saliency and Scene-Context in Real-World Images)

  • 최윤형;오형석;명노해
    • 대한산업공학회지
    • /
    • 제41권4호
    • /
    • pp.389-395
    • /
    • 2015
  • According to much research on cognitive science, the impact of the scene-context on human visual search in real-world images could be as important as the saliency. Therefore, this study proposed a method of Adaptive Control of Thought-Rational (ACT-R) modeling of visual search in real-world images, based on saliency and scene-context. The modeling method was developed by using the utility system of ACT-R to describe influences of saliency and scene-context in real-world images. Then, the validation of the model was performed, by comparing the data of the model and eye-tracking data from experiments in simple task in which subjects search some targets in indoor bedroom images. Results show that model data was quite well fit with eye-tracking data. In conclusion, the method of modeling human visual search proposed in this study should be used, in order to provide an accurate model of human performance in visual search tasks in real-world images.

Functional Analysis of Music Used in Film

압축 영역에서 시간축 확산 스펙트럼을 이용한 장면단위의 비디오 워터마킹 (Scene-Based Video Watermarking Using Temporal Spread Spectrum in Com pressed Domain)

  • 최윤희;강경표;최태선
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.93-96
    • /
    • 2002
  • This paper presents robust and efficient scene-based video watermarking method using visual rhythm (spatio-temporal slice) in compressed domain. Scene change can be detected easily using visual rhythm and video sequences are conveniently edited at the scene boundaries. Therefore, scene-based watermark embedding Process it a natural choice. Temporal spread spectrum can be achieved by applying spread spectrum methods to visual rhythm. Additive Gaussian noise, low-pass filtering, median filtering and histogram equalization attack are simulated for all frames. Frame sub-sampling is also simulated as a typical video attack Simulation results show that proposed algorithm is robust and efficient in the presence of such kind of attacks.

  • PDF

장면전환 발생시 예상 비트 조정을 통한 MPEG-2 비디오 부호화 비트율 제어 알고리즘 (A Rate Control Algorithm of MPEG-2 Video Encoding Based Target Bit Matching at Scene Changes)

  • 문호석;박상성;손명호;장동식
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제31권12호
    • /
    • pp.1621-1627
    • /
    • 2004
  • 장면전환에 따른 화질열화는 예상 비트량과 실제 부호화량의 차이가 많을 때 발생한다. 특히 장면전환이 P화면에서 발생된 경우에는 장면전환이 발생된 P화면뿐만 아니라, P화면을 참조하는 화면들의 화질에 심각한 열화가 발생한다. 본 논문에서는 장면전환이 발생했을 때 부적절한 비트율 제어의 원인을 토대로 장면전환 화면과 이후 화면들의 화질을 개선하는 방법을 제시하였다. 장면전환 화면에는 추가비트를 할당하는 기존 방법과 예상 비트를 인트라 화면의 부호화 수준으로 할당하는 새로운 방법을 적용하였다. 그리고 장면전환 이후 화면들에는 예상 비트 할당을 장면전환 발생이전 화면의 부호화 수준으로 할당하는 방법을 제안하여 화질을 개선시켰다. 실험 결과 제안하는 알고리즘이 기존 알고리즘보다 화질향상이 있었고, TM5와 비교해서는 0.5∼1.2dB의 PSNR 향상을 보였다.

마이크로 컴퓨터를 이용한 항만설계 시뮬레이터의 영상정보 신뢰성에 관한 연구 (On the Visual Scene Validity of the Microcomputer Aided Port Design Simulator)

  • 김환수
    • 해양환경안전학회지
    • /
    • 제3권2호
    • /
    • pp.1-12
    • /
    • 1997
  • One of the main uses for ship simulators is in the field of port design, and an increasing number of simulators, of varying degrees of fidelity, are being used for this purpose. An essential feature of all such simulators is their visual scene, which must be of sufficent fidelity to convey the key visual cues adequately. This paper examines the ability of a number of experienced mariners to perceive speeds and distances correctly using Computer Generated Imagery visual scenes of different fidelity, compared with their performance at sea. From the results, it was found that the microcomputer based simulator might be considered, as far as its visual scene representation is concerned, to be as valid as the full mission ship simulator for the port design task.

  • PDF

A New MPEG-2 Rate Control Scheme Using Scene Change Detection

  • Park, Sang-Gyu;Lee, Young-Sun;Chang, Hyun-Sik
    • ETRI Journal
    • /
    • 제18권2호
    • /
    • pp.61-74
    • /
    • 1996
  • We propose two new rate control schemes to improve MPEG-2 rate control in view of visual quality when scene changes happen. Two proposed schemes are characterized by real-time and non real-time improvement to reduce the impact of scene changes. We also propose a new target-bit prediction method using spatial activity of pictures and present a simple and efficient scene change detection scheme using signed difference of mean absolute difference (MAD). Computer simulation results show that the proposed real-time algorithm effectively alleviates visual quality degradation after scene changes. The proposed non real-time algorithm gives maximum 2 dB improvement in peak signal-to-noise ratio (PSNR) at a scene-changed picture, compared with MPEG-2 rate control scheme and it shows better quality than the real-time one.

  • PDF

Modeling the Visual Target Search in Natural Scenes

  • Park, Daecheol;Myung, Rohae;Kim, Sang-Hyeob;Jang, Eun-Hye;Park, Byoung-Jun
    • 대한인간공학회지
    • /
    • 제31권6호
    • /
    • pp.705-713
    • /
    • 2012
  • Objective: The aim of this study is to predict human visual target search using ACT-R cognitive architecture in real scene images. Background: Human uses both the method of bottom-up and top-down process at the same time using characteristics of image itself and knowledge about images. Modeling of human visual search also needs to include both processes. Method: In this study, visual target object search performance in real scene images was analyzed comparing experimental data and result of ACT-R model. 10 students participated in this experiment and the model was simulated ten times. This experiment was conducted in two conditions, indoor images and outdoor images. The ACT-R model considering the first saccade region through calculating the saliency map and spatial layout was established. Proposed model in this study used the guide of visual search and adopted visual search strategies according to the guide. Results: In the analysis results, no significant difference on performance time between model prediction and empirical data was found. Conclusion: The proposed ACT-R model is able to predict the human visual search process in real scene images using salience map and spatial layout. Application: This study is useful in conducting model-based evaluation in visual search, particularly in real images. Also, this study is able to adopt in diverse image processing program such as helper of the visually impaired.

Multimodal Context Embedding for Scene Graph Generation

  • Jung, Gayoung;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • 제16권6호
    • /
    • pp.1250-1260
    • /
    • 2020
  • This study proposes a novel deep neural network model that can accurately detect objects and their relationships in an image and represent them as a scene graph. The proposed model utilizes several multimodal features, including linguistic features and visual context features, to accurately detect objects and relationships. In addition, in the proposed model, context features are embedded using graph neural networks to depict the dependencies between two related objects in the context feature vector. This study demonstrates the effectiveness of the proposed model through comparative experiments using the Visual Genome benchmark dataset.

Imaging a scene from experience given verbal experssions

  • Sakai, Y.;Kitazawa, M.;Takahashi, S.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1995년도 Proceedings of the Korea Automation Control Conference, 10th (KACC); Seoul, Korea; 23-25 Oct. 1995
    • /
    • pp.307-310
    • /
    • 1995
  • In the conventional systems, a human must have knowledge of machines and of their special language in communicating with machines. In one side, it is desirable for a human but in another side, it is true that achieving it is very elaborate and is also a significant cause of human error. To reduce this sort of human load, an intelligent man-machine interface is desirable to exist between a human operator and machines to be operated. In the ordinary human communication, not only linguistic information but also visual information is effective, compensating for each others defect. From this viewpoint, problem of translating verbal expressions to some visual image is discussed here in this paper. The location relation between any two objects in a visual scene is a key in translating verbal information to visual information, as is the case in Fig.l. The present translation system advances in knowledge with experience. It consists of Japanese Language processing, image processing, and Japanese-scene translation functions.

  • PDF

고속도로 비탈면 경관의 법면공법에 따른 시각적 이미지와 조화성 분석 - 대전${\~}$진주간 고속도로를 대상으로 - (An Analysis on the Visual Image and Harmony of the Construction Method in the Slope Scene -A Case on the Daejeon${\~}$Jinju Highway-)

  • 이정
    • 한국조경학회지
    • /
    • 제33권1호
    • /
    • pp.33-48
    • /
    • 2005
  • The purpose of this study was to discover the landscape visual image of the slope scene and their harmony with surrounding sceneries. This research utilized the basic study tool of psycho-physics and processed the case study of ten types of slope construction scene along the highway. The analysis was performed by the data obtained from the questionnaires and the photos for the slope construction scene. The questionnaires for analysis the image of the slope construction scene and their harmony with surrounding sceneries were designed using semantic differential scale and 5 point Likert-scale. The major findings were as follows. 1. At the part of the visual preferences analysis, the slope revegetation methods showed high level of preferences generally than on the slope structure methods. While the slope revegetation methods were estimated friendly, continuity, harmonious, soft, light and wide, the slope revegetation methods were estimated unstable, female, static, simple, omnipresent, appeared as policeman of weak inclination. Also the slope structure methods were estimated stable, manly, complicated, steep and healthy but rough, unharmonious, unfamiliar and heavy. 2. Psychological factors, related to the satisfaction for the slope revegetation methods were composed of three factors, aesthetic, individuality and physical character. And the slope structure methods were composed of five factors, aesthetic, individuality, stability, physical character, and complexity. 3. At the part of harmony with surrounding landscapes, the slope revegetation methods were evaluated highly but the slope structure methods received the lowest evaluation. Also the harmony analysis with surrounding view on the slope revegetation methods showed degree of high more than average in all texture, form, color and scale but the slope structure methods showed degree of fewer than average degree in form, scale, color and texture.