• Title/Summary/Keyword: Visual Intelligence

Search Result 244, Processing Time 0.026 seconds

Hybrid Learning for Vision-and-Language Navigation Agents (시각-언어 이동 에이전트를 위한 복합 학습)

  • Oh, Suntaek;Kim, Incheol
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.9
    • /
    • pp.281-290
    • /
    • 2020
  • The Vision-and-Language Navigation(VLN) task is a complex intelligence problem that requires both visual and language comprehension skills. In this paper, we propose a new learning model for visual-language navigation agents. The model adopts a hybrid learning that combines imitation learning based on demo data and reinforcement learning based on action reward. Therefore, this model can meet both problems of imitation learning that can be biased to the demo data and reinforcement learning with relatively low data efficiency. In addition, the proposed model uses a novel path-based reward function designed to solve the problem of existing goal-based reward functions. In this paper, we demonstrate the high performance of the proposed model through various experiments using both Matterport3D simulation environment and R2R benchmark dataset.

Tracking Method of Dynamic Smoke based on U-net (U-net기반 동적 연기 탐지 기법)

  • Gwak, Kyung-Min;Rho, Young J.
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.4
    • /
    • pp.81-87
    • /
    • 2021
  • Artificial intelligence technology is developing as it enters the fourth industrial revolution. Active researches are going on; visual-based models using CNNs. U-net is one of the visual-based models. It has shown strong performance for semantic segmentation. Although various U-net studies have been conducted, studies on tracking objects with unclear outlines such as gases and smokes are still insufficient. We conducted a U-net study to tackle this limitation. In this paper, we describe how 3D cameras are used to collect data. The data are organized into learning and test sets. This paper also describes how U-net is applied and how the results is validated.

Compression Method for MPEG CDVA Global Feature Descriptors (MPEG CDVA 전역 특징 서술자 압축 방법)

  • Kim, Joonsoo;Jo, Won;Lim, Guentaek;Yun, Joungil;Kwak, Sangwoon;Jung, Soon-heung;Cheong, Won-Sik;Choo, Hyon-Gon;Seo, Jeongil;Choi, Yukyung
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.295-307
    • /
    • 2022
  • In this paper, we propose a novel compression method for scalable Fisher vectors (SCFV) which is used as a global visual feature description of individual video frames in MPEG CDVA standard. CDVA standard has adopted a temporal descriptor redundancy removal technique that takes advantage of the correlation between global feature descriptors for adjacent keyframes. However, due to the variable length property of SCFV, the temporal redundancy removal scheme often results in inferior compression efficiency. It is even worse than the case when the SCFVs are not compressed at all. To enhance the compression efficiency, we propose an asymmetric SCFV difference computation method and a SCFV reconstruction method. Experiments on the FIVR dataset show that the proposed method significantly improves the compression efficiency compared to the original CDVA Experimental Model implementation.

Implementation of Camera-Based Autonomous Driving Vehicle for Indoor Delivery using SLAM (SLAM을 이용한 카메라 기반의 실내 배송용 자율주행 차량 구현)

  • Kim, Yu-Jung;Kang, Jun-Woo;Yoon, Jung-Bin;Lee, Yu-Bin;Baek, Soo-Whang
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.4
    • /
    • pp.687-694
    • /
    • 2022
  • In this paper, we proposed an autonomous vehicle platform that delivers goods to a designated destination based on the SLAM (Simultaneous Localization and Mapping) map generated indoors by applying the Visual SLAM technology. To generate a SLAM map indoors, a depth camera for SLAM map generation was installed on the top of a small autonomous vehicle platform, and a tracking camera was installed for accurate location estimation in the SLAM map. In addition, a convolutional neural network (CNN) was used to recognize the label of the destination, and the driving algorithm was applied to accurately arrive at the destination. A prototype of an indoor delivery autonomous vehicle was manufactured, and the accuracy of the SLAM map was verified and a destination label recognition experiment was performed through CNN. As a result, the suitability of the autonomous driving vehicle implemented by increasing the label recognition success rate for indoor delivery purposes was verified.

The Effect of Characteristics of Social Intelligence Robots on Satisfaction and Intention to Use: Focused on User of Single Person Households (소셜 지능로봇의 특성이 만족과 사용의도에 미치는 영향: 1인 가구 소셜 지능로봇 사용자를 중심으로)

  • Jeon, Gyuri;Lee, Chaehyun;Jung, Sungmi;Choi, Jeongil
    • Journal of Korean Society for Quality Management
    • /
    • v.52 no.1
    • /
    • pp.95-113
    • /
    • 2024
  • Purpose: This study focused on the societal changes associated with the entry into an ultra-aged society and the increase in single-person households. The core objective of this research is to investigate how social intelligent robots can bring about positive changes in the lives of individuals in single-person households and how such changes influence user satisfaction and the intention to use these robots. Methods: The study employed a cross-sectional analysis using a structural equation model. A survey designed to assess the impact of social intelligent robots' characteristics, such as perceived encouragement, empathy, presence, appearance, and attachment, on user satisfaction and usage intentions was conducted. Data were collected from a total of 335 users and analyzed using the structural equation model. Results: In the characteristics of social intelligent robots for single-person households, it was found that empathy, presence, and attachment significantly influenced satisfaction, while perceived encouragement, empathy, and attachment significantly influenced usage intentions. The research results indicate differences between enhancing user satisfaction and increasing the intention to use social intelligent robots. The findings suggest the essential need for a user-centric approach in the design and development of social intelligent robots. Additionally, it was observed that emotional support plays a crucial role in users' experiences with social intelligent robots. Conclusion: This study verified the impact of social intelligent robots on satisfaction and usage intentions based on users' experiences. It examined the influence of linguistic, visual, and personal characteristics of robots on user experiences, providing insights into how technological and human aspects of social intelligent robots interact to shape user satisfaction and usage intentions. Consequently, the study confirmed that social intelligent robots can bring positive changes to human life, emphasizing the necessity for the advancement of robot technology in a human-centric direction.

An Interdisciplinary Approach to the Human/Posthuman Discourses Emerging From Cybernetics and Artificial Intelligence Technology (4차 산업혁명 시대의 사이버네틱스와 휴먼·포스트휴먼에 관한 인문학적 지평 연구)

  • Kim, Dong-Yoon
    • Journal of Broadcast Engineering
    • /
    • v.24 no.5
    • /
    • pp.836-848
    • /
    • 2019
  • This paper aims at providing a critical view over the cybernetics theory especially of first generation on which the artificial intelligence heavily depends nowadays. There has been a commonly accepted thought that the conception of artificial intelligence could not has been possible without being influenced by N. Wiener's cybernetic feedback based information system. Despite the founder of contemporary cybernetics' ethical concerns in order to avoid an increasing entropy phenomena(social violence, economic misery, wars) produced through a negative dynamics of the western modernity regarded as the most advanced form of humanism. In this civilizationally changing atmosphere, the newly born cybernetic technology was thus firmly believed as an antidote to these vices deeply rooted in humanism itself. But cybernetics has been turned out to be a self-organizing, self-controlling mechanical system that entails the possibility of telegraphing human brain (which are transformed into patterns) through the uploading of human brain neurons digitalized by the artificial intelligence embedded into computing technology. On this background emerges posthuman (or posthumanism) movement of which concepts have been theorized mainly by its ardent apostles like N. K. Hayles, Neil Bedington, Laurent Alexandre, Donna J. Haraway. The converging of NBIC Technologies leading to the opening of a much more digitalizing society has served as a catalyst to promote the posthuman representations and different narratives especially in the contemporary visual arts as well as in the study of humanities including philosophy and fictional literature. Once Bruno Latour wrote "Modernity is often defined in terms of humanism, either as a way of saluting the birth of 'man' or as a way of announcing his death. But this habit is itself modern, because it remains asymmetrical. It overlooks the simultaneous birth of 'nonhumaniy' - things, or objects, or beasts, - and the equally strange beginning of a crossed-out God, relegated to the sidelines."4) These highly suggestive ideas enable us to better understand what kind of human beings would emerge following the dazzlingly accelerating advancement of artificial intelligence technology. We wonder whether or not this newly born humankind would become essentially Homo Artificialis as a neuronal man stripping off his biological apparatus. However due to this unprecedented situation humans should deal with enormous challenges involving ethical, metaphysical, existential implications on their life.

The Analysis of K-WISC-IV Profiles in Children with High-Functioning Autism Spectrum Disorder (고기능 자폐 스펙트럼 장애 아동의 K-WISC-IV 프로파일 분석 및 융합적 적용)

  • Cho, Eun-Young;Kim, Hyun-Mi;Song, Dong-Ho;Cheon, Keun-Ah
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.7
    • /
    • pp.341-348
    • /
    • 2017
  • The aim of this study is to distinguish children with high-functioning autism spectrum disorder (ASD) from the norm group by identifying their Intelligence with Korean Wechsler Intelligence Scale for Children-Fourth Edition (K-WISC-IV) profile analysis. The article were administered to 90 children with high-functioning ASD (6-16) years and has surveyed the average of the Full scale IQ, index scores, and subtest scores of K-WISC-IV. Also, this study has conducted a single-subject T-test in order to verify whether Full scale IQ, index scores, subtest scores are different from those of the norm group. The results show that children with high-functioning ASD achieved significantly lower scores on Processing Speed Index, compared to the norm group. Furthermore, their scores in Comprehension, Picture Concept, Picture completion, Coding, and Symbol Search were significantly lower than those of the norm group. It is likely that what have turned out to be the cognitive weaknesses of high-functioning ASD children by K-WISC-IV analysis, including slow process speed, low social judgement, and difficulty in visual stimuli in everyday life are interrelated to their unique characters.

Development of Noise and AI-based Pavement Condition Rating Evaluation System (소음도·인공지능 기반 포장상태등급 평가시스템 개발)

  • Han, Dae-Seok;Kim, Young-Rok
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.1
    • /
    • pp.1-8
    • /
    • 2021
  • This study developed low-cost and high-efficiency pavement condition monitoring technology to produce the key information required for pavement management. A noise and artificial intelligence-based monitoring system was devised to compensate for the shortcomings of existing high-end equipment that relies on visual information and high-end sensors. From idea establishment to system development, functional definition, information flow, architecture design, and finally, on-site field evaluations were carried out. As a result, confidence in the high level of artificial intelligence evaluation was secured. In addition, hardware and software elements and well-organized guidelines on system utilization were developed. The on-site evaluation process confirmed that non-experts could easily and quickly investigate and visualized the data. The evaluation results could support the management works of road managers. Furthermore, it could improve the completeness of the technologies, such as prior discriminating techniques for external conditions that are not considered in AI learning, system simplification, and variable speed response techniques. This paper presents a new paradigm for pavement monitoring technology that has lasted since the 1960s.

The Cognitive Performance, Emotional and Behavioral Problems of the Children with ADHD Showing the Difference between Visual and Auditory Attention (시각 주의력과 청각 주의력의 차이를 보이는 주의력 결핍.과잉활동장애 아동의 인지기능과 정서 및 행동 문제)

  • Son, Jung Woo
    • Korean Journal of Biological Psychiatry
    • /
    • v.13 no.2
    • /
    • pp.70-81
    • /
    • 2006
  • Objective : The purpose of this study was to investigate the differences of the cognitive performance, emotional and behavioral problems among the attention-deficit/hyperactivity disorder(ADHD) groups that show the difference between visual and auditory attention. Method : Using 'ADHD Diagnostic System(ADS)', visual attention and auditory attention of 98 children diagnosed as ADHD were measured. According to the omission and commission error of ADS, they were divided into three groups ; 1) the group whose each visual omission and commission error scores were higher than each auditory omission and commission error scores(VV group), 2) the group whose each auditory omission and commission error scores were higher than each visual omission and commission error scores(AA group), 3) the group that was the rest of VV and AA group(M group). And the results of both the subscales of Korean Educational Development Institute-Wechsler Intelligence Scale for Children(KEDI-WISC) and the subscales of Korean Child Behavior Checklist(K-CBCL) among three groups were compared. Finally, the correlation between the visual omission, visual commission, auditory omission, auditory commission error and the results of KEDI-WISC, K-CBCL were investigated. Results : The results were as follows ; 1) In 98 ADHD children, the number of VV group(N=56) was higher than that of AA (N=10) and M group (N=32). 2) All mean scores of the subscales of KEDI-WISC of VV group were higher than those of M and AA group. The score of verbal IQ(p=.039) of VV group was significantly higher than that of AA group and the scores of block design(p=.015), Kaufman's factor 2(p=.045), performance IQ(p=.004) were significantly higher than those of M group. The score of full IQ(p=.004) were significantly higher than that of M and AA group. 3) The mean scores of all K-CBCL subscales of VV group were higher than those of M and AA group, except the score of Somatic complaint subscale. The score of Social subscale(p=.041) of VV group was significantly higher than that of AA group. The score of Withdrawn subscale(p=.021) of AA group was significantly higher than that of VV group. 4) There were no significant correlation between the scores of visual omission/commission error and those of each subscale of KEDI-WISC. But, there were many significant correlations between the scores of auditory omission/commission error and those of each subscale of KEDI-WISC. 5) There were significant correlation between the score of the visual omission error and that of Thought problem subscale(r=.205, p=.043) of K-CBCL. There were significant correlation between the scores of the auditory omission error and those of Social subscale(r=-.319, p=.001), Social problems subscale(r=.206, p=.042), Thought problem subscale(r=.235, p=.021). Finally, there were significant correlation between the scores of auditory commission error and those of Social subscale(r=-.241, p=.017), Thought problem subscale(r=.235, p=.020). Conclusion : The ADHD children whose auditory attention ability were higher than visual attention ability had relatively better cognitive performance and less emotional/behavioral problems than the others. The more comprehensive experiment will be needed about the cognitive performance, emotion and behavior problems of the ADHD children showing the difference between visual and auditory attention.

  • PDF

Embodied Conversational Agent Using a Virtual Character to Induce Children's Verbal Communication (가상 캐릭터를 활용하여 아동의 구어 대화를 유도하는 대화형 에이전트)

  • Choi, Jiyeong;Jung, Keechul
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.10
    • /
    • pp.1296-1306
    • /
    • 2020
  • Childhood verbal communication impacts children's language skills and has a positive effect as partners use more vocabulary. But reduction in family time, caused by lowered age for private education and so on, has reduced the chance for children to speak with partners who have a proficient language skill. This vacancy was naturally occupied by the media, which has become one of the cornerstones of the growth of kids' contents. Kids contents are making various attempts to expand the breadth of services. But most contents still focus on unilateral visual information delivery yet, so there is a limit to satisfy the vacancy of conversation partners. Therefore this paper suggests an ECA(Embodied conversational agent) to induce children's spoken conversation using a virtual character frequently used in kids contents. This system is implemented by the voice bot and agent model produced using an IBM assistant and Unity. As a result of using ECA for 66 children of 5-9 years old, it showed meaningful results in terms of induction of verbal communication.