• 제목/요약/키워드: cue

검색결과 549건 처리시간 0.024초

밝기 변화를 고려한 색상과 채도의 확률 모델에 기반한 조명변화에 간인한 컬러분할 (Color Segmentation robust to Illumination Variations based on Statistical Methods of Hue and Saturation including Brightness)

  • 김치호;유범재;김학배
    • 대한전기학회논문지:시스템및제어부문D
    • /
    • 제54권10호
    • /
    • pp.604-614
    • /
    • 2005
  • Color segmentation takes great attentions since a color is an effective and robust visual cue for characterizing one object from other objects. Color segmentation is, however, suffered from color variation induced from irregular illumination changes. This paper proposes a reliable color modeling approach in HSI (Hue-Saturation-Intensity) rotor space considering intensity information by adopting B-spline curve fitting to make a mathematical model for statistical characteristics of a color with respect to brightness. It is based on the fact that color distribution of a single-colored object is not invariant with respect to brightness variations even in HS (Hue-Saturation) plane. The proposed approach is applied for the segmentation of human skin areas successfully under various illumination conditions.

The Role of Prosodic Boundary Cues in Word Segmentation in Korean

  • Kim, Sa-Hyang
    • 음성과학
    • /
    • 제13권1호
    • /
    • pp.29-41
    • /
    • 2006
  • This study investigates the degree to which various prosodic cues at the boundaries of prosodic phrases in Korean contribute to word segmentation. Since most phonological words in Korean are produced as one Accentual Phrase (AP), it was hypothesized that the detection of acoustic cues at AP boundaries would facilitate word segmentation. The prosodic characteristics of Korean APs include initial strengthening at the beginning of the phrase and pitch rise and final lengthening at the end. A perception experiment utilizing an artificial language learning paradigm revealed that cues conforming to the aforementioned prosodic characteristics of Korean facilitated listeners' word segmentation. Results also indicated that duration and amplitude cues were more helpful in segmentation than pitch. Nevertheless, results did show that a pitch cue that did not conform to the Korean AP interfered with segmentation.

  • PDF

말속도와 강도 변조에 따른 경도 마비말장애 환자의 말 용인도 변화 (The Change of Acceptability for the Mild Dysarthric Speakers' Speech due to Speech Rate and Loudness Manipulation)

  • 김지연;성철재
    • 말소리와 음성과학
    • /
    • 제7권1호
    • /
    • pp.47-55
    • /
    • 2015
  • This study examined whether speech acceptability was changed under various conditions of prosodic manipulations. Both speech rate and voice loudness reportedly are associated with acceptability and intelligibility. Speech samples by twelve speakers with mild dysarthria were recorded. Speech rate and loudness changes were made by digitally manipulating habitual sentences. 3 different loudness levels (70, 75, & 80dB) and 4 different speech rates (normal, 20% rapidly, 20% slowly, & 40% slowly) were presented to 12 SLPs (speech language pathologists). SLPs evaluated sentence acceptability by 7-point Likert scale. Repeated ANOVA were conducted to determine if the prosodic type of resynthesized cue resulted in a significant change in speech acceptability. A faster speech rate (20% rapidly) rather than habitual and slower rates (20%, 40% slowly) resulted in significant improvement in acceptability ratings (p <.001). An increased vocal loudness (up to 80dB) resulted in significant improvement in acceptability ratings (p <.05). Speech rate and loudness changes in the prosodic properties of speech may contribute to improved acceptability.

Real time tracking of multiple humans for mobile robot application

  • Park, Joon-Hyuk;Park, Byung-Soo;Lee, Seok;Park, Sung-Kee;Kim, Munsang
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2002년도 ICCAS
    • /
    • pp.100.3-100
    • /
    • 2002
  • This paper presents the method for detection and tracking of multiple humans robustly in mobile platform. The perception of human is performed in real time through the processing of images acquired from a moving stereo vision system. We performed multi-cue integration such as human shape, skin color and depth information to detect and track each human in moving background scene. Human shape is measured by edge-based template matching on distance transformed image. Improving robustness for human detection, we apply the human face skin color in HSV color space. And we could increase the accuracy and the robustness in both detection and tracking by applying random sampling stochastic estimati...

  • PDF

Improving visual relationship detection using linguistic and spatial cues

  • Jung, Jaewon;Park, Jongyoul
    • ETRI Journal
    • /
    • 제42권3호
    • /
    • pp.399-410
    • /
    • 2020
  • Detecting visual relationships in an image is important in an image understanding task. It enables higher image understanding tasks, that is, predicting the next scene and understanding what occurs in an image. A visual relationship comprises of a subject, a predicate, and an object, and is related to visual, language, and spatial cues. The predicate explains the relationship between the subject and object and can be categorized into different categories such as prepositions and verbs. A large visual gap exists although the visual relationship is included in the same predicate. This study improves upon a previous study (that uses language cues using two losses) and a spatial cue (that only includes individual information) by adding relative information on the subject and object of the extant study. The architectural limitation is demonstrated and is overcome to detect all zero-shot visual relationships. A new problem is discovered, and an explanation of how it decreases performance is provided. The experiment is conducted on the VRD and VG datasets and a significant improvement over previous results is obtained.

Dual Effect of Price in E-Commerce Environment: Focusing on Trust and Distrust Building Processes

  • Lee, Jung
    • Asia pacific journal of information systems
    • /
    • 제24권3호
    • /
    • pp.393-415
    • /
    • 2014
  • This study examines the dynamics of trust and distrust at different price levels. We first note that trust and distrust are built with cognitive and affective foundations, and price is viewed as a financial burden or product quality information. Then, we relate price changes to trust and distrust, and hypothesize their interactions: price as a quality cue will positively moderate the cognitive dimension of trust, whereas price as financial burden will negatively moderate the affective dimensions of trust and distrust. We surveyed 263 online mall shoppers in Korea. Among our eight hypotheses, six are fully supported and two are partially supported. The result shows that price perception interacts with both the cognitive and affective dimensions of trust and distrust, but its specific impacts are distinguished by the price perceptions, whether it is financial burden or product quality information.

Sensory substitution in perceiving architectural surfaces

  • Kim, Young-Kil;Young, Rockefeller-S.L.
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 1992년도 춘계공동학술대회 발표논문 및 초록집; 울산대학교, 울산; 01월 02일 May 1992
    • /
    • pp.573-580
    • /
    • 1992
  • 인공건물의 평면특성에 대한 시각을 통한 인지를 청각으로 대체했을 경우의 인지능력을 측정하였다. 정상적으로 시각(visual)을 이용하겠으나, 시각 장애자의 경우는 청각(auditory) 또는 촉각(tactile) 또는 두가지 모두를 사용하게 된다. Psychophysical approach를 사용하여 모의평면에 대한 인지능력을 JND단위로 측정하였다. 청각적인 신호를 관찰자에게 제공하기 위해 전자장치(electronic ranging device)가 고안되었다. 이 장치는 목표물까지의 거리를 초음파의 이동시간으로 측정하여 음의세기(sound level)로 발생시켜 준다. 관찰자는 이 음의 세기를 듣고 거리를 추정하고 물표의 방향은 이 장비를 쥔 손의 방향, 즉, proprioceptive cue를 이용하게 된다. 세가지 task에 대한 실험은 평면의 slantness, 두 평면이 교차하는 모서리의 크기, 두 평면사이의 공간(aperture size)등에 대한 인지능력의 측정실험이다. 실험결과를 보면, 관찰자는 시각신호 대신에 청각신호를 사용할 수 있는 능력이 있는 것으로 나타났다. 세가지 task별 JND측정치는 slant angle 6도, 모서리의 concavity 10도, angular aperature size 3-5도로 나타났다. 이 결과는 정상인이 시각을 이용한 인지능력과 큰 차이가 없음을 보여주고 있다.

  • PDF

음운 구조가 한국어 단어 분절에 미치는 영향 (The role of prosodic phrasing in Korean word segmentation)

  • 김사향
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.114-118
    • /
    • 2007
  • The current study investigates the degree to which various prosodic cues at the boundaries of a prosodic phrase in Korean (Accentual Phrase) contributed to word segmentation. Since most phonological words in Korean are produced as one AP, it was hypothesized that the detection of acoustic cues at AP boundaries would facilitate word segmentation. The prosodic characteristics of Korean APs include initial strengthening at the beginning of the phrase and pitch rise and final lengthening at the end. A perception experiment revealed that the cues that conform to the above-mentioned prosodic characteristics of Korean facilitated listeners' word segmentation. Results also showed that duration and amplitude cues were more helpful in segmentation than pitch. Further, the results showed that a pitch cue that did not conform to the Korean AP interfered with segmentation.

  • PDF

한국어 폐쇄음 발음과 최근의 발음 변이: 발화 형태별 VOT와 f0를 중심으로 (Korean stop pronunciation and current sound change: Focused on VOT and f0 in different pronunciation types)

  • 김지은
    • 말소리와 음성과학
    • /
    • 제9권3호
    • /
    • pp.41-47
    • /
    • 2017
  • The purpose of this study is to examine how speakers use VOT and f0 to distinguish tense, lax, and aspirated stops in isolated sentence reading and paragraph readings. To do so, a total of 20 males between the ages of 20-25 years old were asked to read (1) isolated sentences, (2) information-oriented text and (3) emotional expressive texts in which the stop pronunciation's VOT value and f0 were measured thereafter. The main results are as follows. In the isolate sentence reading, lax stops, and aspirated stops were distinguished by both VOT and f0, but for the Korean men that read reading texts, VOT is not a cue to distinguish between lax and aspirated stops. In general, the VOT differences between lax stops and aspirated stops were smaller for information-oriented texts and emotional expressive texts than that of the isolate sentence reading. In the paragraph reading that induces a natural utterance, the f0 dependence is greater for the distinction between lax and aspirated stops.

From Exoscope into the Next Generation

  • Nishiyama, Kenichi
    • Journal of Korean Neurosurgical Society
    • /
    • 제60권3호
    • /
    • pp.289-293
    • /
    • 2017
  • An exoscope, high-definition video telescope operating monitor system to perform microsurgery has recently been proposed an alternative to the operating microscope. It enables surgeons to complete the operation assistance by visualizing magnified images on a display. The strong points of exoscope are the wide field of view and deep focus. It minimized the need for repositioning and refocusing during the procedure. On the other hand, limitation of magnifying object was an emphasizing weak point. The procedures are performed under 2D motion images with a visual perception through dynamic cue and stereoscopically viewing corresponding to the motion parallax. Nevertheless, stereopsis is required to improve hand and eye coordination for high precision works. Consequently novel 3D high-definition operating scopes with various mechanical designs have been developed according to recent high-tech innovations in a digital surgical technology. It will set the stage for the next generation in digital image based neurosurgery.