Search | Korea Science

Interpretable Visual Question Answering via Explain Sentence Generation (설명 문장 생성을 통한 해석 가능한 시각적 질의응답 모델 분석)

Kim, Danil;Han, Bohyung
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2020.07a
- /
- pp.359-362
- /
- 2020
본 연구에서는 설명 문장 생성을 통한 해석 가능한 시각적 질의응답 모델을 설계하고 학습 방법을 제시한다. 설명 문장은 시각적 질의응답 모델이 응답을 예측하는 데에 필요한 이미지 및 질문 정보와 적절한 논리적인 정보의 조합 및 정답 추론 과정이 함의되어 있을 것으로 기대한다. 설명 문장 생성 과정이 포함된 시각적 질의응답의 기본적인 모델을 기반으로 여러 가지 학습방법을 통해 설명 문장 생성 과정과 응답 예측 과정간의 상호관계를 분석한다. 이러한 상호작용을 적극적으로 활용할 수 있는 보다 개선 시각적 질의응답 모델을 제안한다. 또한 학습한 결과를 바탕으로 설명 문장의 특성을 활용하여 시각적 질의응답 추론 과정을 개선함으로써 시각적 질의응답 모델의 발전 방향을 논의한다. 본 실험을 통해서 응답 예측에 적절한 설명 문장을 제시하는 해석 가능한 시각적 질의응답 모델을 제공한다.
PDF

An Analysis of Lessons to Teach Proportional Reasoning with Visual Models: Focused on Ratio table, Double Number Line, and Double Tape Diagram (시각적 모델을 활용한 비례 추론 수업 분석: 비표, 이중수직선, 이중테이프 모델을 중심으로)

Seo, Eunmi;Pang, JeongSuk;Lee, Jiyoung
- Journal of Educational Research in Mathematics
- /
- v.27 no.4
- /
- pp.791-810
- /
- 2017
This study explored the possibility of using visual models in teaching proportional reasoning based on the review of previous studies. Many studies on proportional reasoning emphasize that students tend to simply apply formal procedures without understanding the meaning behind them and that using visual models may be an alternative to help students develop informal strategies and proportional reasoning. Given these, we re-constructed and implemented the unit of a textbook to teach sixth graders proportional reasoning with ratio table, double number line, and double tape diagram. The results of this study showed that such visual models helped students understand the meaning of proportion, explore the properties of proportion, and solve proportional problems. However, several difficulties that students experienced in using the visual models led us to suggest cautionary notes when to teach proportional reasoning with visual models. As such, this study is expected to provide empirical information for textbook developers and teachers who teach proportional reasoning with visual models.
PDF KSCI

Visual Representations for Improving Proportional Reasoning in Solving Word Problems (비례 추론을 돕는 시각적 모델에 대하여: 초등 수학 교과서의 비례식과 비례배분 실생활 문제를 대상으로)

Yim, Jae Hoon;Lee, Hyung Sook
- Journal of Educational Research in Mathematics
- /
- v.25 no.2
- /
- pp.189-206
- /
- 2015
There has been a recurring call for using visual representations in textbooks to improve the teaching and learning of proportional reasoning. However, the quantity as well as quality of visual representations used in textbooks is still very limited. In this article, we analyzed visual representations presented in a Grade 6 textbook from two perspectives of proportional reasoning, multiple-batches perspective and variable-parts perspective, and discussed the potential of the double number line and the double tape diagram to help develop the idea 'things covary while something stays the same', which is critical to reason proportionally. We also classified situations that require proportional reasoning into five categories and provided ways of using the double number line and the double tape diagram for each category.
PDF KSCI

A Spatial Pyramid Matching LDA Model using Sparse Coding for Classification of Sports Scene Images (스포츠 이미지 분류를 위한 희소 부호화 기법을 이용한 공간 피라미드 매칭 LDA 모델)

Jeon, Jin;Kim, Munchurl
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.06a
- /
- pp.35-36
- /
- 2016
본 논문에서는 기존 Bag-of-Visual words (BoW) 접근법에서 반영하지 못한 이미지의 공간 정보를 활용하기 위해서 Spatial Pyramid Matching (SPM) 기법을 Latent Dirichlet Allocation (LDA) 모델에 결합하여 이미지를 분류하는 모델을 제안한다. BoW 접근법은 이미지 패치를 시각적 단어로 변환하여 시각적 단어의 분포로 이미지를 표현하는 기법이며, 기존의 방식이 이미지 패치의 위치정보를 활용하지 못하는 점을 극복하기 위하여 SPM 기법을 도입하는 연구가 진행되어 왔다. 또한 이미지 패치를 정확하게 표현하기 위해서 벡터 양자화 대신 희소 부호화 기법을 이용하여 이미지 패치를 시각적 단어로 변환하였다. 제안하는 모델은 BoW 접근법을 기반으로 위치정보를 활용하는 SPM 을 LDA 모델에 적용하여 시각적 단어의 토픽을 추론함과 동시에 multi-class SVM 분류기를 이용하여 이미지를 분류한다. UIUC 스포츠 데이터를 이용하여 제안하는 모델의 분류 성능을 검증하였다.
PDF

A Visual Model for the Perception of the Optical illusions from Discrete Dot Stimuli (이산 도트 자극에서 시각적 착시를 인식하는 시각 모델)

Jung, Eun-Hwa;Hong, Keong-Ho
- The KIPS Transactions:PartB
- /
- v.10B no.6
- /
- pp.639-646
- /
- 2003
This paper proposes a neural network model for extracting optical illusions produced by a sequence of discontinuous dot stimuli. The proposed model is based on visual cell's characters founded by visual information processing path. This study approaches on the basis of physiological observation of the perceptual phenomena that some simple ways of discrete dots are perceived as a continuous virtual contour rather than as separate dots. This paper presents the implementation of the optical illusions from discrete dot stimuli that are composed of virtual polygons from 6 to 10 dots. This experimental data are similar to those of Smith & Vos's physiological experiments. The proposed model shows that it can extract continuous illusion contours from discrete dot stimuli successfully.
https://doi.org/10.3745/KIPSTB.2003.10B.6.639 인용 PDF KSCI

Features Analysis of Character by Visual Types of Body and Temperament (체격기질유형의 시각적 모델에 의한 캐릭터의 유형특성 분석)

김남훈
- Proceedings of the Korea Multimedia Society Conference
- /
- 2003.11b
- /
- pp.613-620
- /
- 2003
애니메이션의 사전제작과정에서 캐릭터의 보다 효율적 제작, 분석, 평가을 위해 시각적 모델화및 데이터 베이스 구축이 필요했다. 캐릭터의 체격과 기질 관계를 조명하기 위해 W. 셀던의 체격기질유형 이론으로부터 정형화된 분석 틀을 구축하여 개념을 시각화하고 유형분석 모델을 만들었으며, 또한 케이스 스터디를 위해 세 애니메이션에 나타난 캐릭터의 유형들을 분석함으로써 모델의 적용 가능성과 나아가 애니메이션의 캐릭터에서 표현된 외적 형상과 내적 기질의 상관성을 추론하여 실제 적용이 가능한 기초적 데이터가 되도록 하였다.
PDF

A Study on Multiplication Expression Method by Visual Model (시각적 모델에 따른 곱셈식 표현 방법에 대한 연구)

Kim, Juchang;Lee, Kwnagho
- Education of Primary School Mathematics
- /
- v.22 no.1
- /
- pp.65-82
- /
- 2019
In this study, students' multiplication expression method according to visual model was analyzed through paper test and eye tracking test. As a result of the paper-pencil test, students were presented with multiplication formula. In the group model (number of individual pieces in a group) ${\times}$ (number of group) in the array model (column) ${\times}$ (row), but in the array model, the proportion of students who answered the multiplication formula in the (row) ${\times}$ (column). From these results, we derived the appropriate model presentation method for multiplication instruction and the multiplication expression method for visual model.
https://doi.org/10.7468/jksmec.2019.22.1.65 인용 PDF KSCI HTML

Intuitive Quasi-Eigenfaces for Facial Animation (얼굴 애니메이션을 위한 직관적인 유사 고유 얼굴 모델)

Kim, Ig-Jae;Ko, Hyeong-Seok
- Journal of the Korea Computer Graphics Society
- /
- v.12 no.2
- /
- pp.1-7
- /
- 2006
블렌드 쉐입 기반 얼굴 애니메이션을 위해 기저 모델(Expression basis)을 생성하는 방법을 크게 두 가지로 구분하면, 애니메이터가 직접 모델링을 하여 생성하는 방법과 통계적 방법에 기초하여 모델링하는 방법이 있다. 그 중 애니메이터에 의한 수동 모델링 방법으로 생성된 기저 모델은 직관적으로 표정을 인식할 수 있다는 장점으로 인해 전통적인 키프레임 제어가 가능하다. 하지만, 표정 공간(Expression Space)의 일부분만을 커버하기 때문에 모션데이터로부터의 재복원 과정에서 많은 오차를 가지게 된다. 반면, 통계적 방법을 기반으로 한 기저모델 생성 방법은 거의 모든 표정공간을 커버하는 고유 얼굴 모델(Eigen Faces)을 생성하므로 재복원 과정에서 최소의 오차를 가지지만, 시각적으로 직관적이지 않은 표정 모델을 만들어 낸다. 따라서 본 논문에서는 수동으로 생성한 기저모델을 유사 고유 얼굴 모델(Quasi-Eigen Faces)로 변형하는 방법을 제시하고자 한다. 결과로 생성되는 기저 모델은 시각적으로 직관적인 얼굴 표정을 유지하면서도 통계적 방법에 의한 얼굴표정 공간의 커버 영역과 유사하도록 확장할 수 있다.
PDF

Formulating the Landscape Preference Model Using a Mixed Conditional Logit (조건부 로짓함수를 이용한 경관선호 모델: 지리산 국립공원 방문자를 대상으로)

Lee, Deokjae
- Journal of Korean Society of Forest Science
- /
- v.95 no.6
- /
- pp.768-777
- /
- 2006
The purpose of this study lies in formulating the landscape preference model using a conditional logit that involves the effect of visual elements as well as landscape itself on landscape preferences. To measure landscape preferences, a photo-questionnaire composed of paired photographs of the Cairngorms National Park of Scotland and the Jirisan National Park of Korea was distributed to visitors to the Jirisan National Park of Korea. Visual elements of landscape quantitatively measured by photogrammetry were reduced to orthogonal principal components that were subsequently used as explanatory variables in a conditional logit. As a result, the mixed conditional logit including the effect of landscape itself satisfied the Independence of Irrelevant Alternatives (IIA) property and showed reliable goodness of fit (${\rho}^2=0.25$). It was concluded that the mixed conditional logit including the effect of landscape itself was appropriate for landscape preference model rather than usual conditional logit excluding the effect.
PDF KSCI

희소 부호화 기법과 토픽 모델링을 통한 이미지 분류 모델

Jeon, Jin;Kim, Munchurl
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.07a
- /
- pp.49-50
- /
- 2015
본 논문에서는 이미지를 시각적 단어로 표현하여 분석하는 기법인 bag-of-visual words (BoW) 모델을 기반으로 latent dirichlet allocation (LDA) 모델을 결합하여 시각적 단어의 구조를 파악하여 이미지를 분류할 수 있는 모델을 제안한다. 우선 이미지를 시각적 단어로 기존의 방법보다 정확하게 표현하기 위해서 희소 부호화(sparse coding) 기법을 적용한다. 기존의 BoW 모델은 하나의 이미지 패치를 하나의 단어로 표현하였지만, 희소 부호화 기법을 통해 하나의 이미지 패치를 여러 개의 단어로 표현할 수 있다. 제안하는 모델을 이용하여 이미지를 분류하기 위해서 분류 성능 측정에 많이 쓰이는 multi-class SVM 기법을 이용한다. UIUC 스포츠 데이터를 이용한 성능 측정을 통해 제안한 기법의 클래스 분류 성능을 검증하였다.
PDF

Search Result 1,212, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)