• Title/Summary/Keyword: cue

Search Result 549, Processing Time 0.029 seconds

Color Segmentation robust to Illumination Variations based on Statistical Methods of Hue and Saturation including Brightness (밝기 변화를 고려한 색상과 채도의 확률 모델에 기반한 조명변화에 간인한 컬러분할)

  • Kim, Chi-Ho;You, Bum-Jae;Kim, Hagbae
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.54 no.10
    • /
    • pp.604-614
    • /
    • 2005
  • Color segmentation takes great attentions since a color is an effective and robust visual cue for characterizing one object from other objects. Color segmentation is, however, suffered from color variation induced from irregular illumination changes. This paper proposes a reliable color modeling approach in HSI (Hue-Saturation-Intensity) rotor space considering intensity information by adopting B-spline curve fitting to make a mathematical model for statistical characteristics of a color with respect to brightness. It is based on the fact that color distribution of a single-colored object is not invariant with respect to brightness variations even in HS (Hue-Saturation) plane. The proposed approach is applied for the segmentation of human skin areas successfully under various illumination conditions.

The Role of Prosodic Boundary Cues in Word Segmentation in Korean

  • Kim, Sa-Hyang
    • Speech Sciences
    • /
    • v.13 no.1
    • /
    • pp.29-41
    • /
    • 2006
  • This study investigates the degree to which various prosodic cues at the boundaries of prosodic phrases in Korean contribute to word segmentation. Since most phonological words in Korean are produced as one Accentual Phrase (AP), it was hypothesized that the detection of acoustic cues at AP boundaries would facilitate word segmentation. The prosodic characteristics of Korean APs include initial strengthening at the beginning of the phrase and pitch rise and final lengthening at the end. A perception experiment utilizing an artificial language learning paradigm revealed that cues conforming to the aforementioned prosodic characteristics of Korean facilitated listeners' word segmentation. Results also indicated that duration and amplitude cues were more helpful in segmentation than pitch. Nevertheless, results did show that a pitch cue that did not conform to the Korean AP interfered with segmentation.

  • PDF

The Change of Acceptability for the Mild Dysarthric Speakers' Speech due to Speech Rate and Loudness Manipulation (말속도와 강도 변조에 따른 경도 마비말장애 환자의 말 용인도 변화)

  • Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.47-55
    • /
    • 2015
  • This study examined whether speech acceptability was changed under various conditions of prosodic manipulations. Both speech rate and voice loudness reportedly are associated with acceptability and intelligibility. Speech samples by twelve speakers with mild dysarthria were recorded. Speech rate and loudness changes were made by digitally manipulating habitual sentences. 3 different loudness levels (70, 75, & 80dB) and 4 different speech rates (normal, 20% rapidly, 20% slowly, & 40% slowly) were presented to 12 SLPs (speech language pathologists). SLPs evaluated sentence acceptability by 7-point Likert scale. Repeated ANOVA were conducted to determine if the prosodic type of resynthesized cue resulted in a significant change in speech acceptability. A faster speech rate (20% rapidly) rather than habitual and slower rates (20%, 40% slowly) resulted in significant improvement in acceptability ratings (p <.001). An increased vocal loudness (up to 80dB) resulted in significant improvement in acceptability ratings (p <.05). Speech rate and loudness changes in the prosodic properties of speech may contribute to improved acceptability.

Real time tracking of multiple humans for mobile robot application

  • Park, Joon-Hyuk;Park, Byung-Soo;Lee, Seok;Park, Sung-Kee;Kim, Munsang
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.100.3-100
    • /
    • 2002
  • This paper presents the method for detection and tracking of multiple humans robustly in mobile platform. The perception of human is performed in real time through the processing of images acquired from a moving stereo vision system. We performed multi-cue integration such as human shape, skin color and depth information to detect and track each human in moving background scene. Human shape is measured by edge-based template matching on distance transformed image. Improving robustness for human detection, we apply the human face skin color in HSV color space. And we could increase the accuracy and the robustness in both detection and tracking by applying random sampling stochastic estimati...

  • PDF

Improving visual relationship detection using linguistic and spatial cues

  • Jung, Jaewon;Park, Jongyoul
    • ETRI Journal
    • /
    • v.42 no.3
    • /
    • pp.399-410
    • /
    • 2020
  • Detecting visual relationships in an image is important in an image understanding task. It enables higher image understanding tasks, that is, predicting the next scene and understanding what occurs in an image. A visual relationship comprises of a subject, a predicate, and an object, and is related to visual, language, and spatial cues. The predicate explains the relationship between the subject and object and can be categorized into different categories such as prepositions and verbs. A large visual gap exists although the visual relationship is included in the same predicate. This study improves upon a previous study (that uses language cues using two losses) and a spatial cue (that only includes individual information) by adding relative information on the subject and object of the extant study. The architectural limitation is demonstrated and is overcome to detect all zero-shot visual relationships. A new problem is discovered, and an explanation of how it decreases performance is provided. The experiment is conducted on the VRD and VG datasets and a significant improvement over previous results is obtained.

Dual Effect of Price in E-Commerce Environment: Focusing on Trust and Distrust Building Processes

  • Lee, Jung
    • Asia pacific journal of information systems
    • /
    • v.24 no.3
    • /
    • pp.393-415
    • /
    • 2014
  • This study examines the dynamics of trust and distrust at different price levels. We first note that trust and distrust are built with cognitive and affective foundations, and price is viewed as a financial burden or product quality information. Then, we relate price changes to trust and distrust, and hypothesize their interactions: price as a quality cue will positively moderate the cognitive dimension of trust, whereas price as financial burden will negatively moderate the affective dimensions of trust and distrust. We surveyed 263 online mall shoppers in Korea. Among our eight hypotheses, six are fully supported and two are partially supported. The result shows that price perception interacts with both the cognitive and affective dimensions of trust and distrust, but its specific impacts are distinguished by the price perceptions, whether it is financial burden or product quality information.

Sensory substitution in perceiving architectural surfaces

  • Kim, Young-Kil;Young, Rockefeller-S.L.
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1992.04b
    • /
    • pp.573-580
    • /
    • 1992
  • 인공건물의 평면특성에 대한 시각을 통한 인지를 청각으로 대체했을 경우의 인지능력을 측정하였다. 정상적으로 시각(visual)을 이용하겠으나, 시각 장애자의 경우는 청각(auditory) 또는 촉각(tactile) 또는 두가지 모두를 사용하게 된다. Psychophysical approach를 사용하여 모의평면에 대한 인지능력을 JND단위로 측정하였다. 청각적인 신호를 관찰자에게 제공하기 위해 전자장치(electronic ranging device)가 고안되었다. 이 장치는 목표물까지의 거리를 초음파의 이동시간으로 측정하여 음의세기(sound level)로 발생시켜 준다. 관찰자는 이 음의 세기를 듣고 거리를 추정하고 물표의 방향은 이 장비를 쥔 손의 방향, 즉, proprioceptive cue를 이용하게 된다. 세가지 task에 대한 실험은 평면의 slantness, 두 평면이 교차하는 모서리의 크기, 두 평면사이의 공간(aperture size)등에 대한 인지능력의 측정실험이다. 실험결과를 보면, 관찰자는 시각신호 대신에 청각신호를 사용할 수 있는 능력이 있는 것으로 나타났다. 세가지 task별 JND측정치는 slant angle 6도, 모서리의 concavity 10도, angular aperature size 3-5도로 나타났다. 이 결과는 정상인이 시각을 이용한 인지능력과 큰 차이가 없음을 보여주고 있다.

  • PDF

The role of prosodic phrasing in Korean word segmentation (음운 구조가 한국어 단어 분절에 미치는 영향)

  • Kim, Sa-Hyang
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.114-118
    • /
    • 2007
  • The current study investigates the degree to which various prosodic cues at the boundaries of a prosodic phrase in Korean (Accentual Phrase) contributed to word segmentation. Since most phonological words in Korean are produced as one AP, it was hypothesized that the detection of acoustic cues at AP boundaries would facilitate word segmentation. The prosodic characteristics of Korean APs include initial strengthening at the beginning of the phrase and pitch rise and final lengthening at the end. A perception experiment revealed that the cues that conform to the above-mentioned prosodic characteristics of Korean facilitated listeners' word segmentation. Results also showed that duration and amplitude cues were more helpful in segmentation than pitch. Further, the results showed that a pitch cue that did not conform to the Korean AP interfered with segmentation.

  • PDF

Korean stop pronunciation and current sound change: Focused on VOT and f0 in different pronunciation types (한국어 폐쇄음 발음과 최근의 발음 변이: 발화 형태별 VOT와 f0를 중심으로)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.41-47
    • /
    • 2017
  • The purpose of this study is to examine how speakers use VOT and f0 to distinguish tense, lax, and aspirated stops in isolated sentence reading and paragraph readings. To do so, a total of 20 males between the ages of 20-25 years old were asked to read (1) isolated sentences, (2) information-oriented text and (3) emotional expressive texts in which the stop pronunciation's VOT value and f0 were measured thereafter. The main results are as follows. In the isolate sentence reading, lax stops, and aspirated stops were distinguished by both VOT and f0, but for the Korean men that read reading texts, VOT is not a cue to distinguish between lax and aspirated stops. In general, the VOT differences between lax stops and aspirated stops were smaller for information-oriented texts and emotional expressive texts than that of the isolate sentence reading. In the paragraph reading that induces a natural utterance, the f0 dependence is greater for the distinction between lax and aspirated stops.

From Exoscope into the Next Generation

  • Nishiyama, Kenichi
    • Journal of Korean Neurosurgical Society
    • /
    • v.60 no.3
    • /
    • pp.289-293
    • /
    • 2017
  • An exoscope, high-definition video telescope operating monitor system to perform microsurgery has recently been proposed an alternative to the operating microscope. It enables surgeons to complete the operation assistance by visualizing magnified images on a display. The strong points of exoscope are the wide field of view and deep focus. It minimized the need for repositioning and refocusing during the procedure. On the other hand, limitation of magnifying object was an emphasizing weak point. The procedures are performed under 2D motion images with a visual perception through dynamic cue and stereoscopically viewing corresponding to the motion parallax. Nevertheless, stereopsis is required to improve hand and eye coordination for high precision works. Consequently novel 3D high-definition operating scopes with various mechanical designs have been developed according to recent high-tech innovations in a digital surgical technology. It will set the stage for the next generation in digital image based neurosurgery.