• Title/Summary/Keyword: visual understanding

Search Result 744, Processing Time 0.031 seconds

Trends in Video Visual Relationship Understanding (비디오 시각적 관계 이해 기술 동향)

  • Y.J. Kwon;D.H. Kim;J.H. Kim;S.C. Oh;J.S. Ham;J.Y. Moon
    • Electronics and Telecommunications Trends
    • /
    • v.38 no.6
    • /
    • pp.12-21
    • /
    • 2023
  • Visual relationship understanding in computer vision allows to recognize meaningful relationships between objects in a scene. This technology enables the extraction of representative information within visual content. We discuss the technology of visual relationship understanding, specifically focusing on videos. We first introduce visual relationship understanding concepts in videos and then explore the latest existing techniques. Next, we present benchmark datasets commonly used in video visual relationship understanding. Finally, we discuss future research directions in video visual relationship understanding.

Improving visual relationship detection using linguistic and spatial cues

  • Jung, Jaewon;Park, Jongyoul
    • ETRI Journal
    • /
    • v.42 no.3
    • /
    • pp.399-410
    • /
    • 2020
  • Detecting visual relationships in an image is important in an image understanding task. It enables higher image understanding tasks, that is, predicting the next scene and understanding what occurs in an image. A visual relationship comprises of a subject, a predicate, and an object, and is related to visual, language, and spatial cues. The predicate explains the relationship between the subject and object and can be categorized into different categories such as prepositions and verbs. A large visual gap exists although the visual relationship is included in the same predicate. This study improves upon a previous study (that uses language cues using two losses) and a spatial cue (that only includes individual information) by adding relative information on the subject and object of the extant study. The architectural limitation is demonstrated and is overcome to detect all zero-shot visual relationships. A new problem is discovered, and an explanation of how it decreases performance is provided. The experiment is conducted on the VRD and VG datasets and a significant improvement over previous results is obtained.

Image Understanding for Visual Dialog

  • Cho, Yeongsu;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • v.15 no.5
    • /
    • pp.1171-1178
    • /
    • 2019
  • This study proposes a deep neural network model based on an encoder-decoder structure for visual dialogs. Ongoing linguistic understanding of the dialog history and context is important to generate correct answers to questions in visual dialogs followed by questions and answers regarding images. Nevertheless, in many cases, a visual understanding that can identify scenes or object attributes contained in images is beneficial. Hence, in the proposed model, by employing a separate person detector and an attribute recognizer in addition to visual features extracted from the entire input image at the encoding stage using a convolutional neural network, we emphasize attributes, such as gender, age, and dress concept of the people in the corresponding image and use them to generate answers. The results of the experiments conducted using VisDial v0.9, a large benchmark dataset, confirmed that the proposed model performed well.

A Study on Process Analysis of Visual Understanding on accordance in Attention Time (주시시간에 따른 시각적 이해과정 분석에 관한 연구)

  • Kim, Jong-Ha
    • Korean Institute of Interior Design Journal
    • /
    • v.20 no.4
    • /
    • pp.101-108
    • /
    • 2011
  • When observing an object in a space, a part of it is remembered into our perception in the time for paying attention or conscious observation and it reaches to our visual understanding. In this study, it examined characteristics by each subject through the process of visual understanding by changes in such observation time. The results from this study are summarized as belows: First, through analysis of the observation data focused on the distance between the observed points, it was able to apply those visual theories organized before to the analysis of characteristics of the time for understanding by each subject. Second, there showed big differences in the time for visual understanding by each subject according to changes in the observation time so that it was found that there were big differences according to the characteristics of subject's intention or purpose of the observation of a space. Third, as the number of continuous observation gives an important clue in judgement of how well the space was understood, it was able to compare and organize the mutual characteristics of the time the attention was concentrated, the time observed intentionally and the time understood visually. Fourth, it was found that the shorter subjects gave the intentional observation in observing a space, the longer they spent the time for paying attention, while the less they could understand it visually.

Visual Verb and ActionNet Database for Semantic Visual Understanding (동영상 시맨틱 이해를 위한 시각 동사 도출 및 액션넷 데이터베이스 구축)

  • Bae, Changseok;Kim, Bo Kyeong
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.14 no.5
    • /
    • pp.19-30
    • /
    • 2018
  • Visual information understanding is known as one of the most difficult and challenging problems in the realization of machine intelligence. This paper proposes deriving visual verb and construction of ActionNet database as a video database for video semantic understanding. Even though development AI (artificial intelligence) algorithms have contributed to the large part of modern advances in AI technologies, huge amount of database for algorithm development and test plays a great role as well. As the performance of object recognition algorithms in still images are surpassing human's ability, research interests shifting to semantic understanding of video contents. This paper proposes candidates of visual verb requiring in the construction of ActionNet as a learning and test database for video understanding. In order to this, we first investigate verb taxonomy in linguistics, and then propose candidates of visual verb from video description database and frequency of verbs. Based on the derived visual verb candidates, we have defined and constructed ActionNet schema and database. According to expanding usability of ActionNet database on open environment, we expect to contribute in the development of video understanding technologies.

Analysis of Types of Students' Visual Thinking and Instructional Effects in Elementary Science Classes (초등 과학수업에서 학생들이 구성한 비주얼 씽킹의 유형 및 수업 효과)

  • Hong, Minhae;Lim, Heejun
    • Journal of Korean Elementary Science Education
    • /
    • v.40 no.1
    • /
    • pp.100-112
    • /
    • 2021
  • Based on the importance of visual representation for scientific understanding, this study applied visual thinking in elementary science classes. This study analyzed elementary students' visual thinking and investigated the instructional influences. Students' perceptions on the class applying visual thinking were also investigated. The subject were 38 fourth grade students, 18 in experimental group and 20 in control group. For the unit of 'Shadow and mirror', on-line and off-line blended classes were applied in both group because of COVID-19. The experimental group student were asked to construct their own visual thinking, while the control group students used traditional workbook. The results were as follows. First, students' visual thinking can be classified into three different types, which are 'activity recall type', 'result summary type', and 'core concept representation type' based on what they represent rather than how they represent. Second, applying visual thinking in science class showed significant effects on science academic achievement, science related attitude, and creative academic efficacy. Third, students' perceptions on applying visual thinking in science classes were very positive. Students perceived visual thinking activities were interesting and helpful for understanding science. Educational implications of applying visual thinking in elementary science classes were discussed.

Elementary School Teachers' Use of Visual Representations and their Perceptions of the Functions of Visual Representations (초등교사의 시각적 표상 활용 실태 및 시각적 표상의 기능에 대한 인식)

  • Yoon, Hye-Gyoung;Park, Jisun
    • Journal of Korean Elementary Science Education
    • /
    • v.37 no.2
    • /
    • pp.219-231
    • /
    • 2018
  • This study surveyed the elementary school teachers' use of visual representations and their perceptions of the functions of visual representations in the teaching of electricity unit. A total of 110 elementary teachers who have experiences in teaching electricity unit responded to online survey. The result showed firstly that most of the teachers use visual representations in their teaching and it is mostly limited to those presented in textbooks or images that they can get easily from internet search. Secondly, elementary teachers thought that they have high ability in using visual representations and low ability in understanding students' visual presentation ability. Thirdly, visual representations are more often preferred to be used as teacher-centered ways than student-centered ways for motivating students and conceptual understanding. However, in case of scientific inquiry, both teacher-centered and student-centered ways were equally preferred. Lastly, the teachers' perceptions of the functions of visual representations were categorized into 'teaching-instrumental function', 'learning-instrumental function', 'communicative-instrumental function' and 8 subcategories were found. The most frequent function was the 'information delivery function' in the 'teaching-instrumental function' category. Implications for teacher education and further studies were discussed.

Persuasive Effects Depending on the Type of Creative Ads in Social Media and User Sensitivity and Empathy (SNS 미디어의 크리에이티브 유형과 사용자의 민감성 및 공감적 이해에 따른 설득 효과)

  • Kim, Jae-Young
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.5
    • /
    • pp.145-154
    • /
    • 2022
  • The purpose of this study is to investigate the difference in advertising effect of visual rhetoric type of Facebook ads depending on the user sensitivity and level of empathy. The experiment was designed as a between-subjects factorial design (visual rhetoric type) × 2 (brand sensitivity) × 2 (level of empathic understanding). The results of the experiment performed to analyze the strategies of Facebook ads for ads effectiveness can be summarized as follows: a three-way interaction effect for persuasive effects was found among the type of visual rhetoric, brand sensitivity, and empathic understanding for both types of visual rhetoric. Breaking down it by type of rhetoric, no interacting effect was observed between brand sensitivity and empathic understanding levels for the visual simile ads in most of the dependent variables. For the visual metaphor ads, however, the brand sensitivity and empathic understanding levels were found to have interaction effect in all dependent variables.

An Analysis of Students' Understanding on Unit Fraction : Focusing on Teaching Context and Visual Representation (단위분수에 대한 초등학교 3학년 학생들의 이해 분석 : 지도 맥락과 시각적 표현의 관점에서)

  • Lim, Miin
    • The Mathematical Education
    • /
    • v.57 no.1
    • /
    • pp.37-54
    • /
    • 2018
  • Despite the significance of fraction in elementary mathematics education, it is not easy to teach it meaningfully in connection with real life in Korea. This study aims to investigate and analyze 3rd grade students' understanding on unit fraction concepts and on comparison of unit fractions and to identify the parts which need to be supplemented in relation to unit fraction. For these purposes, I reviewed previous studies and extracted chapters which cover unit fractions in elementary mathematics textbooks based on 2009 revised curriculums and analyzed teaching contexts and visual representations of unit fractions. From this point of view, I constructed a test which consists of three problems based on Chval et al(2013) to investigate students' understanding on unit fraction. To apply this test, I selected forty-one 3rd grade students and examined that students' aspects of understanding on unit fraction. The results were analyzed both qualitatively and quantitatively. In this study, I present the analysis results and provide implications and some didactical suggestions for teaching contexts and visual representations of unit fraction based on the discussion.

A study on the Visual Effects Based on Women Jacket Silhouettes (여성복 상의(Jacket)의 실루엣에 관한 감성공학적 접근)

  • 양지은;이연순
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 1998.11a
    • /
    • pp.235-249
    • /
    • 1998
  • The purpose of this study is to provide an understanding of the designs that aid in the production or selection of clothing that generally corresponds with the contours of the human body. The study looks at effective and attractive clothing design used in various situations with the goal of gaining understanding of the nuances of women's jackets. To achieve the goals of this jacket study, a sensuous test was employed and several horizontal sections, based on silhouette appearance, were compared and then analysed. The sensuous test was aimed at understanding the visual effects of the silhouette of the jacket.

  • PDF