• 제목/요약/키워드: spatial cues

검색결과 34건 처리시간 0.025초

멀티채널 Spatial Audio Coding에서의 효율적인 Spatial Cues 사용과 그에 따른 Spectrum 보상방법 (An efficient method of spatial cues and compensation method of spectrums on multichannel spatial audio coding)

  • 이병화;백승권;서정일;한민수
    • 대한음성학회지:말소리
    • /
    • 제53호
    • /
    • pp.157-169
    • /
    • 2005
  • This paper proposes an efficiently representing method of spatial cues on multichannel spatial audio coding. The Binaural Cue Coding (BCC) method introduced recently represents multichannel audio signals by means of Inter Channel Level Difference (ICLD) or Source Index (SI). We tried to express more efficiently ICLD and SI information based on Inter Channel Correlation in this paper. We adopt different spatial cues according to ICC and propose a compensation method of empty spectrums created by using SI. We performed a MOS test and measuring spectral distortion. The results show that the proposed method can reduce the bitrate of side information without large degradation of the audio quality.

  • PDF

이중채널 잡음음성인식을 위한 공간정보를 이용한 통계모델 기반 음성구간 검출 (Statistical Model-Based Voice Activity Detection Using Spatial Cues for Dual-Channel Noisy Speech Recognition)

  • 신민화;박지훈;김홍국;이연우;이성로
    • 말소리와 음성과학
    • /
    • 제2권3호
    • /
    • pp.141-148
    • /
    • 2010
  • In this paper, voice activity detection (VAD) for dual-channel noisy speech recognition is proposed in which spatial cues are employed. In the proposed method, a probability model for speech presence/absence is constructed using spatial cues obtained from dual-channel input signal, and a speech activity interval is detected through this probability model. In particular, spatial cues are composed of interaural time differences and interaural level differences of dual-channel speech signals, and the probability model for speech presence/absence is based on a Gaussian kernel density. In order to evaluate the performance of the proposed VAD method, speech recognition is performed for speech segments that only include speech intervals detected by the proposed VAD method. The performance of the proposed method is compared with those of several methods such as an SNR-based method, a direction of arrival (DOA) based method, and a phase vector based method. It is shown from the speech recognition experiments that the proposed method outperforms conventional methods by providing relative word error rates reductions of 11.68%, 41.92%, and 10.15% compared with SNR-based, DOA-based, and phase vector based method, respectively.

  • PDF

The Effects of Variety and Visual Cue on PerceivedQuantity and Consumer Attitude toward Participationinto Sales Promotion Events

  • Lee, Changhyun;Kim, Youngchan
    • Asia Marketing Journal
    • /
    • 제21권1호
    • /
    • pp.65-87
    • /
    • 2019
  • Most studies on how people perceive a given quantity of items were conducted with visual cues exclusively and only offered spatial area based explanations, such as spatial estimation and perceptual grouping theories. This article establishes how people perceive a given quantity when only a written description is provided without any visual cues. Across two studies we show that variety decreases perceived quantity when a variety cue is given, while variety increases perceived quantity when a visual cue is not given. This is because people tend to rely heavily on spatial areas when a visual cue is present and because people are prone to confirmation bias when they are provided with no visual cues but only written descriptions. Furthermore, we highlight that quantity perception has a mediation effect on consumers' attitude-the intention to participate in sales promotional events. Lastly, we summarize the article and discuss its contributions, implications, limitations, and suggestions for future research.

Improving visual relationship detection using linguistic and spatial cues

  • Jung, Jaewon;Park, Jongyoul
    • ETRI Journal
    • /
    • 제42권3호
    • /
    • pp.399-410
    • /
    • 2020
  • Detecting visual relationships in an image is important in an image understanding task. It enables higher image understanding tasks, that is, predicting the next scene and understanding what occurs in an image. A visual relationship comprises of a subject, a predicate, and an object, and is related to visual, language, and spatial cues. The predicate explains the relationship between the subject and object and can be categorized into different categories such as prepositions and verbs. A large visual gap exists although the visual relationship is included in the same predicate. This study improves upon a previous study (that uses language cues using two losses) and a spatial cue (that only includes individual information) by adding relative information on the subject and object of the extant study. The architectural limitation is demonstrated and is overcome to detect all zero-shot visual relationships. A new problem is discovered, and an explanation of how it decreases performance is provided. The experiment is conducted on the VRD and VG datasets and a significant improvement over previous results is obtained.

항공 라이다데이터를 이용한 산림영역 탐지 (Detection of Forest Areas using Airborne LIDAR Data)

  • 황세란;김성준;이임평
    • Spatial Information Research
    • /
    • 제18권3호
    • /
    • pp.23-32
    • /
    • 2010
  • 산림영역에서 획득된 라이다데이터는 산림영역의 DTM생성, 수고 및 산림생체량 추정과 같은 산림연구에 효과적으로 이용될 수 있다. 이를 위한 핵심적인 전처리 과정으로 본 연구는 라이다데이터로부터 산림영역을 효과적으로 탐지하기 위한 방법을 개발하고자 한다. 먼저 라이다데이터로부터 산림영역으로 판단하기에 효과적인 다반사 특성, 높이값 편차 및 공간적 분포에 기반한 세 가지 인지적 단서를 제시하였다. 각 단서들에 기반하여 산림후보영역을 탐지하고, 오분류를 제거하고 경계를 정제하기 위한 이진형태학적처리를 수행하여 최종산림영역을 결정하였다. 항공영상을 이용하여 생성한 기준데이터로 검증한 결과에 따르면 세 종류 단서에 의한 방법 모두 약 90% 이상의 정확도를 보이는 것으로 평가되었다. 특히 다반사 특성에 기반한 방법이 다른 방법에 비교하여 정확도 및 단순도 측면에서 보다 좋은 방법으로 판단된다. 또한, 각각의 단서에 기반한 개별적인 결과를 조합하면 분류 정확도가 개선되는 것을 확인하였다.

유아의 물체위치 기억에 관한 연구 (Memory-for-Object Location in Toddlers)

  • 김미해
    • 아동학회지
    • /
    • 제7권1호
    • /
    • pp.85-95
    • /
    • 1986
  • The purpose of the present research was to study effects of experimental conditions and developmental tendency in the use of external cues in memory-for-object location in toddlers. This study consisted of two experiments. In study 1, the subjects were 12 toddlers, 18 to 23 months old ; in study 2, 30 toddlers, 24 to 41 months old. The findings showed that memory-for-object location in toddlers was different in accordance with experimental conditions; that is, memory-for-object location in the natural condition was significantly better than in the artificial condition. Effects of external cues were found ; that is, memory-for-object location was best in the condition of spatial cues, and next best in the condition of picture cues, and least good in the no cue condition.

  • PDF

한글 단어 재인에서 시각적 요인에 따른 공간주의의 영향 (The Effect of Spatial Attention in Hangul Word Recognition: Depending on Visual Factors)

  • 이고은;이혜원
    • 인지과학
    • /
    • 제34권1호
    • /
    • pp.1-20
    • /
    • 2023
  • 본 연구에서는 시각적 요인에 따라 한글 단어 재인에 미치는 공간주의의 영향을 살펴보았다. 시각적 요인에 따라 공간주의의 영향이 다르게 나타나는지 살펴보기 위해 단어의 시각적 복잡성(실험 1)과 단어의 밝기 대비(실험 2)를 조작하였다. 단어의 복잡성에 따라 받침이 있는 조건과 받침이 없는 조건으로 나뉘었고, 단어의 대비에 따라 대비가 높은 조건과 대비가 낮은 조건으로 나뉘었다. 어휘판단과제를 사용하여 공간단서가 표적 위치에 주어지는 경우(타당 시행)와 표적 위치에 주어지지 않는 경우(비타당 시행) 간의 수행 차이를 단서효과로 계산하여 주의의 영향을 살펴보았다. 실험 결과, 단어의 복잡성에 따라서는 단서효과가 유사하게 나타났으므로, 공간주의의 영향은 복잡성에 의해 달라지지 않는 것으로 해석되었다. 단어의 대비에 따라서는 고대비 조건에 비해 저대비 조건에서 단서효과가 크게 나타났다. 대비가 낮을 때 공간주의의 영향이 더 커지는 것은 자극의 신호를 강화시키는 공간주의의 기제로 설명되었다.

내러티브 공간에 의한 이북(e-book)의 시각화 유형 (The Type of e-book's Visualization by the Narrative Space)

  • 신승윤;정현선
    • 한국콘텐츠학회논문지
    • /
    • 제14권7호
    • /
    • pp.103-114
    • /
    • 2014
  • 본 연구는 이북(e-book)의 내러티브 시각화 연구를 독자적인 영역으로 구축하여 발전시키기 위한 연출분류를 제안하는 데에 목적이 있다. 이를 위해 전 세계적으로 작품성과 흥행성을 인정받은 디즈니 애니메이션의 이북을 분석 대상으로 연구하였다. 먼저 이론적 고찰을 통해 이북의 영상 공간 구조와 관점적 지각원리를 파악하였다. 다음으로 운동을 일으키는 주체를 찾고, 운동 단서에 의해 현존감 높은 공간 경험을 가능하게 하는 연출 요소들을 관찰하였다. 분석 과정에서 등장 요소와 매체, 카메라와 독자의 운동단서를 13개로 분류하고 코드로 정의하였다. 이를 기준으로 분석 대상의 사용 빈도를 분석하여 46개의 결합 운동으로 분류하고 4가지 그룹으로 정의하였다. 이를 실제 공간 경험, 내러티브 공간 경험, 캐릭터성 경험으로 분류하여 운동 단서의 특징을 분석하였다. 본 연구는 이북의 내러티브 시각화 유형 분류하여, 이북을 영상언어로 확장할 수 있는 체계를 마련한 기조 연구로써 의미가 있다.

가상현실 표시장치에서의 시간적, 공간적, 회화적 해상도에 따른 가상물체 이동작업의 인간성능 평가 (Human Performance Evaluation of Virtual Object Moving Task in the Different Temporal, Spatial and Pictorial Resolution of a Stereoscopic Display)

  • 박재희
    • 산업공학
    • /
    • 제18권1호
    • /
    • pp.82-87
    • /
    • 2005
  • Most of virtual reality systems ask users to control 3D objects or to navigate 3D world using 3D controllers. To maximize the human performance in the control, the design of virtual reality system and its input and output devices should be optimized. In this study, an experiment was designed to investigate the effects of three resolution factors of a virtual reality system on the human performance. Six subjects conducted the experiment for the factors; two frame rates, three spatial resolutions, and three pictorial contents. The result showed that the greater the spatial resolution was, the higher the human performance was. For the temporal resolution, fixed frame rate at 18 Hz was better than the varied maximized frame rate. For the pictorial contents, the virtual space with orientation cues marked the greatest performance than the other two conditions; the virtual space without any orientation cue and the virtual space like real world. These results could be applied for the design of virtual reality systems.

Automatic Person Identification using Multiple Cues

  • Swangpol, Danuwat;Chalidabhongse, Thanarat
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.1202-1205
    • /
    • 2005
  • This paper describes a method for vision-based person identification that can detect, track, and recognize person from video using multiple cues: height and dressing colors. The method does not require constrained target's pose or fully frontal face image to identify the person. First, the system, which is connected to a pan-tilt-zoom camera, detects target using motion detection and human cardboard model. The system keeps tracking the moving target while it is trying to identify whether it is a human and identify who it is among the registered persons in the database. To segment the moving target from the background scene, we employ a version of background subtraction technique and some spatial filtering. Once the target is segmented, we then align the target with the generic human cardboard model to verify whether the detected target is a human. If the target is identified as a human, the card board model is also used to segment the body parts to obtain some salient features such as head, torso, and legs. The whole body silhouette is also analyzed to obtain the target's shape information such as height and slimness. We then use these multiple cues (at present, we uses shirt color, trousers color, and body height) to recognize the target using a supervised self-organization process. We preliminary tested the system on a set of 5 subjects with multiple clothes. The recognition rate is 100% if the person is wearing the clothes that were learned before. In case a person wears new dresses the system fail to identify. This means height is not enough to classify persons. We plan to extend the work by adding more cues such as skin color, and face recognition by utilizing the zoom capability of the camera to obtain high resolution view of face; then, evaluate the system with more subjects.

  • PDF