• 제목/요약/키워드: Visual

검색결과 18,760건 처리시간 0.048초

지능형 이동 로봇에서 강인 물체 인식을 위한 영상 문맥 정보 활용 기법 (Utilization of Visual Context for Robust Object Recognition in Intelligent Mobile Robots)

  • 김성호;김준식;권인소
    • 로봇학회논문지
    • /
    • 제1권1호
    • /
    • pp.36-45
    • /
    • 2006
  • In this paper, we introduce visual contexts in terms of types and utilization methods for robust object recognition with intelligent mobile robots. One of the core technologies for intelligent robots is visual object recognition. Robust techniques are strongly required since there are many sources of visual variations such as geometric, photometric, and noise. For such requirements, we define spatial context, hierarchical context, and temporal context. According to object recognition domain, we can select such visual contexts. We also propose a unified framework which can utilize the whole contexts and validates it in real working environment. Finally, we also discuss the future research directions of object recognition technologies for intelligent robots.

  • PDF

오페라하우스의 객석음향평가에 대한 시지각의 영향 (The Effects of Visual Input on the Evaluation of the Acoustics in the Opera Houses)

  • 김수연;전진용
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2004년도 추계학술대회논문집
    • /
    • pp.772-777
    • /
    • 2004
  • Opera house acoustics were subjectively evaluated in order to investigate the effect of performance stage views on the audience's perception of the seat acoustics in an opera house. Nine seats from an existing opera house were selected for the auditory and/or visual experiments according to seating area distribution and acoustical parameters such as RT and $1-IACC_{E3}$. The recorded music, convolved from the impulse response, was presented with and without visual images of the stage. Subjects were asked to assess the auditory/visual descriptors and overall impression of the music at each seat. The results showed that good visual input helps produce a favorable impression of the acoustics, but a limited view degrades acoustical impression. The acoustical parameters in the tested seats were also investigated to find the relationship between the acoustical parameters and the visual/sound impression.

  • PDF

Image Understanding for Visual Dialog

  • Cho, Yeongsu;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • 제15권5호
    • /
    • pp.1171-1178
    • /
    • 2019
  • This study proposes a deep neural network model based on an encoder-decoder structure for visual dialogs. Ongoing linguistic understanding of the dialog history and context is important to generate correct answers to questions in visual dialogs followed by questions and answers regarding images. Nevertheless, in many cases, a visual understanding that can identify scenes or object attributes contained in images is beneficial. Hence, in the proposed model, by employing a separate person detector and an attribute recognizer in addition to visual features extracted from the entire input image at the encoding stage using a convolutional neural network, we emphasize attributes, such as gender, age, and dress concept of the people in the corresponding image and use them to generate answers. The results of the experiments conducted using VisDial v0.9, a large benchmark dataset, confirmed that the proposed model performed well.

동적 도시 환경에서 의미론적 시각적 장소 인식 (Semantic Visual Place Recognition in Dynamic Urban Environment)

  • 사바 아르샤드;김곤우
    • 로봇학회논문지
    • /
    • 제17권3호
    • /
    • pp.334-338
    • /
    • 2022
  • In visual simultaneous localization and mapping (vSLAM), the correct recognition of a place benefits in relocalization and improved map accuracy. However, its performance is significantly affected by the environmental conditions such as variation in light, viewpoints, seasons, and presence of dynamic objects. This research addresses the problem of feature occlusion caused by interference of dynamic objects leading to the poor performance of visual place recognition algorithm. To overcome the aforementioned problem, this research analyzes the role of scene semantics in correct detection of a place in challenging environments and presents a semantics aided visual place recognition method. Semantics being invariant to viewpoint changes and dynamic environment can improve the overall performance of the place matching method. The proposed method is evaluated on the two benchmark datasets with dynamic environment and seasonal changes. Experimental results show the improved performance of the visual place recognition method for vSLAM.

The Effects of Variety and Visual Cue on PerceivedQuantity and Consumer Attitude toward Participationinto Sales Promotion Events

  • Lee, Changhyun;Kim, Youngchan
    • Asia Marketing Journal
    • /
    • 제21권1호
    • /
    • pp.65-87
    • /
    • 2019
  • Most studies on how people perceive a given quantity of items were conducted with visual cues exclusively and only offered spatial area based explanations, such as spatial estimation and perceptual grouping theories. This article establishes how people perceive a given quantity when only a written description is provided without any visual cues. Across two studies we show that variety decreases perceived quantity when a variety cue is given, while variety increases perceived quantity when a visual cue is not given. This is because people tend to rely heavily on spatial areas when a visual cue is present and because people are prone to confirmation bias when they are provided with no visual cues but only written descriptions. Furthermore, we highlight that quantity perception has a mediation effect on consumers' attitude-the intention to participate in sales promotional events. Lastly, we summarize the article and discuss its contributions, implications, limitations, and suggestions for future research.

온라인 사이트 상품 배치에 따른 시각정보처리가 소비자 주의에 미치는 영향 (The Effect of Visual Information Processing of Online Product Display on Consumers' Attention)

  • 이주현;이동일
    • 한국프랜차이즈경영연구
    • /
    • 제6권2호
    • /
    • pp.17-30
    • /
    • 2015
  • Companies display the product information to the consumers in their online presences. As importance of online marketing activities are growing, most of the franchisers also uses the product images on their online sites to provide the vivid visual information of their merchandises. But the previous researches do not provide the rigorous understanding on the nature of how the consumers process the visual information. In this study, we explore the theoretical backgrounds of the visual information processing and set up the research proposition on the orders and directions of the online visitors' attentions. The visit data from experimental online shop for 81 days was analysed with repeated measure ANOVA. We found that the nature of visual information processing in the online environment is different from that of the ordinary text process. At the end of this study, implication and the limitations of the research were discussed.

The Effect of Visual and Verbal Scaffoldings on Web-Based Problem Solving Performance

  • RHA, Ilju;PARK, Soyoung
    • Educational Technology International
    • /
    • 제11권2호
    • /
    • pp.1-24
    • /
    • 2010
  • The study aimed to investigate the differential effects of visual and verbal scaffoldings on web-based problem solving performance. A quasi-experiment with 143 high school students in South Korea was administered. Each student's visualization tendency score was obtained at the beginning of the study. Based on the visualization tendency scores, students were divided into two groups; low and high level visualization tendency groups. Then each group was split in half and randomly assigned to one of the two lessons - one with visual scaffolding and the other with verbal scaffolding. The contents of the two lessons were the same. All students' performance was measured through an essay assignment for a problem solving at the end of the lesson. The result showed that the visual scaffolding group outperformed the verbal scaffolding group (F=22.54, p<.01), regardless of each student's visualization tendency level. The effect size was 0.81, indicating high practical significance. There was no statistically significant interaction effect between scaffolding modalities and students' visualization tendency levels. These findings imply that visual scaffolding is an effective strategy to promote students' problem solving performance.

Helping People with Visual Disability Using AI

  • Naif Al Otaibi;Tariq S Almurayziq
    • International Journal of Computer Science & Network Security
    • /
    • 제24권1호
    • /
    • pp.205-208
    • /
    • 2024
  • Artificial Intelligence (AI) technology has evolved rapidly in recent years and is used in everything from banking to email management to surgery, but without the help of the visible, most of the fun features of the Internet include visual impairment. It benefits people with disabilities. The main purpose of this study is to find ways to help people with visual impairments using AI technology. A visually impaired request is made for the visually impaired. For example, when a message arrives that the program will notify you by voice (reads the sender's name, read the message, and replies to it if necessary), this is a special program installed on your mobile phone. This program uses a customized algorithm developed in Python to convert written text to voice, read text, and convert voice to written text on a message when a visually impaired person wants to respond. Then it sends the response in the form of a text message. Therefore, the research should lead to programs for people with visual impairments. This program makes mobile phones easier and more comfortable to use and makes the daily life easier for visual impairments.

영상 기반 위치 인식을 위한 대규모 언어-이미지 모델 기반의 Bag-of-Objects 표현 (Large-scale Language-image Model-based Bag-of-Objects Extraction for Visual Place Recognition)

  • 정승운;박병재
    • 센서학회지
    • /
    • 제33권2호
    • /
    • pp.78-85
    • /
    • 2024
  • We proposed a method for visual place recognition that represents images using objects as visual words. Visual words represent the various objects present in urban environments. To detect various objects within the images, we implemented and used a zero-shot detector based on a large-scale image language model. This zero-shot detector enables the detection of various objects in urban environments without additional training. In the process of creating histograms using the proposed method, frequency-based weighting was applied to consider the importance of each object. Through experiments with open datasets, the potential of the proposed method was demonstrated by comparing it with another method, even in situations involving environmental or viewpoint changes.