• Title/Summary/Keyword: 영상 언어

Search Result 530, Processing Time 0.024 seconds

Texture Feature-Based Language Identification Using Gabor Feature and Wavelet-Domain BDIP and BVLC Features (Gabor 특징과 웨이브렛 영역의 BDIP와 BVLC 특징을 이용한 질감 특징 기반 언어 인식)

  • Jang, Ick-Hoon;Lee, Woo-Shin;Kim, Nam-Chul
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.4
    • /
    • pp.76-85
    • /
    • 2011
  • In this paper, we propose a texture feature-based language identification using Gabor feature and wavelet-domain BDIP (block difference of inverse probabilities) and BVLC (block variance of local correlation coefficients) features. In the proposed method, Gabor and wavelet transforms are first applied to a test image. The wavelet subbands are next denoised by Donoho's soft-thresholding. The magnitude operator is then applied to the Gabor image and the BDIP and BVLC operators to the wavelet subbands. Moments for Gabor magnitude image and each subband of BDIP and BVLC are computed and fused into a feature vector. In classification, the WPCA (whitened principal component analysis) classifier, which is usually adopted in the face identification, searches the training feature vector most similar to the test feature vector. Experimental results show that the proposed method yields excellent language identification with rather low feature dimension for a document image DB.

A Speech Recognition System based on a New Endpoint Estimation Method jointly using Audio/Video Informations (음성/영상 정보를 이용한 새로운 끝점추정 방식에 기반을 둔 음성인식 시스템)

  • 이동근;김성준;계영철
    • Journal of Broadcast Engineering
    • /
    • v.8 no.2
    • /
    • pp.198-203
    • /
    • 2003
  • We develop the method of estimating the endpoints of speech by jointly using the lip motion (visual speech) and speech being included in multimedia data and then propose a new speech recognition system (SRS) based on that method. The endpoints of noisy speech are estimated as follows : For each test word, two kinds of endpoints are detected from visual speech and clean speech, respectively Their difference is made and then added to the endpoints of visual speech to estimate those for noisy speech. This estimation method for endpoints (i.e. speech interval) is applied to form a new SRS. The SRS differs from the convention alone in that each word model in the recognizer is provided an interval of speech not Identical but estimated respectively for the corresponding word. Simulation results show that the proposed method enables the endpoints to be accurately estimated regardless of the amount of noise and consequently achieves 8 o/o improvement in recognition rate.

A Study on the Expression Analysis of Social Topics in Taiwan's New Wave Movies - Focused on Hou Hsiao-hsien and Yang Teh-chang (대만 뉴웨이브 영화의 사회의제 표현 분석 연구 - 허우 샤오시엔과 에드워드 양이 중심으로)

  • Lee, Tae-hoon;ZHANG, YIRAN
    • Journal of Digital Convergence
    • /
    • v.19 no.7
    • /
    • pp.349-358
    • /
    • 2021
  • In the 1980s, the rapid development of Hong Kong genre films began the myth of Hong Kong's New Wave films, which had a profound impact on Taiwanese films of the same period. Later, two leading film directors, Hou Xiaoxien and Edward Yang, appeared in the process of being influenced by Taiwanese film Ganyu Wave. In this paper, we conducted research on the art style, theme style, film language, and aesthetic narrative methods of films of Hou Xiaoxien and Edward Yang against the backdrop of Taiwan's New Wave era. In addition, the visual characteristics of Taiwan's New Wave films, and the two directors have drawn suggestions on Taiwan's new generation of directors and the Taiwanese film industry, and presented a colorful film creation scheme for the creation and innovation of the new generation of filmmakers.

A Comparative Study of Spatial Composition in East Asian Hanging Scrolls and Contemporary Digital Vertical Videos (동양의 전통 족자와 현대의 디지털 세로 영상의 공간 구성 비교 연구)

  • Sun Ling;Kim Yoojin
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.289-298
    • /
    • 2024
  • As digital mobile technology has advanced, vertical videos have emerged as a prominent format in the contemporary media field, presenting a new visual language that challenges traditional horizontal-centric aesthetic norms. This study delves into the visual and structural parallels and distinctions between traditional East Asian Hanging scrolls and contemporary vertical videos by applying traditional spatial composition techniques such as the 'Three Distances', 'One River, Two Banks', 'Intended Blank', and 'Unity of Poetry, Calligraphy, and Painting' to the creation of modern vertical videos. Through this comparative analysis, the research examines how vertical layouts enhance depth and layering of the screen, deepen emotional expression, and offer creators new avenues for expression. By juxtaposing the spatial compositions of traditional East Asian Hanging scrolls with those prevalent in today's digital vertical videos, this study seeks to uncover new visual languages and aesthetic values within the evolving media field.

Language Education System with Structured Programming (구조적 프로그래밍을 위한 언어 학습 시스템)

  • Park, Kyoung-Wook;Ryu, Nam-Hoon;Kim, Eung-Kon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.5 no.5
    • /
    • pp.459-464
    • /
    • 2010
  • Computer programs are required from all areas in society including machine, space, aviation, and medicine. However, the programming curriculum is getting hard despite a lot of teaching materials and video lessons. Programming languages are very diverse, but most of them use the same structure, and they only have different expression methods. Therefore, if one learns one programming language, then it doesn't need to spend a lot of time and efforts to learn another programming langue. Most programming languages use the structure of sequence, selection, and repletion in general. The important thing for programming learners is the structure or algorithm of programming not the grammar of program. This study designed and implemented the language learning system to learn structured programming by using a flowchart.

A Study on High-speed Image Binarization Using SIMD (SIMD를 이용한 영상의 고속 이진화에 관한 연구)

  • Kim, Doo-Sik;Lee, Sang-Ho;Kim, Byeong-Geun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11a
    • /
    • pp.775-778
    • /
    • 2002
  • 영상 이진화란 명도 영상(gray-scaled image)을 이진 영상(bi-leveled image)으로 변환하는 것을 말한다. 영상 이진화는 문서 인식, 비디오 영상 분석 등과 같이 영상처리 분야에서 많이 사용되는 기본적인 영상 처리 과정에 해당한다. 본 논문은 Intel 사의 Pentium 계열 프로세서에서 지원하는 SIMD(Single-Instruction Multiple-Data) 기술을 이용하여 영상 이진화를 고속으로 수행하는 방법을 소개한다. 우편영상에 대하여 실험한 결과, SSE2 명령어로 구현된 프로그램은 기존의 C 언어로 구현된 프로그램에 비하여 4배 이상의 속도 향상을 보였다.

  • PDF

Two Languages in One Brain Shown by fMRI: Orthography Specific Effects in L2 (fMRI에 나타난 모국어와 외국어로서의 한국문자와 중국문자의 차이)

  • 이동훈;이홍재;문찬홍;유재욱;남기춘
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2002.05a
    • /
    • pp.216-221
    • /
    • 2002
  • 본 연구는 문자 규칙 심층성이 다른 문자체계인 한국어와 중국어의 차이가 이중언어화자의 모국어 처리와 외국어 처리에서 각각 어떤 대뇌 활성화의 차이를 가져오는지 fMRI (functional Magnetic Resonance Imaging)를 이용하여 살펴보았다. 중국어 (Ll)-한국어(L2) 이중언어화자 및 한국어(Ll)-중국어(L2) 이중언어화자를 제 2언어 습득시기에 따라 초기 및 후기 이중언어화자로 구분하여 모국어 차이와 습득시기에 따른 영향을 알아보았다. 실험 1에서는 어휘 판단 과제(lexical decision task)를 실시하였고, 실험 2에서는 의미 판단 과제(semantic decision task)를 각각 실시하였다. 어휘판단과제를 사용한 실험 1의 결과는 음운처리와 관련된 좌반구 SMG(supramarginal gyrus), 하두정소엽(inferior parietal lobule, BA 39, 40)에서 중국어-한국어 초기 및 후기이중언어화자의 경우, 한국어 조건에서 보다 많은 활성화를 보였으나, 한국어-중국어 화자의 경우 활성화가 나타나지 않았다. 철자처리에 관련된 방추상회(fusiform gyrus, BA 37, 19) 영역에서는 중국어-한국어 화자뿐만 아니라, 한국어-중국어 인중언어화자의 경우도 중국어 조건에서 보다 많은 활성화를 보였다. 실험 2에서 사용한 의미판단과제의 경우, 중국어-한국어 이중언어화자의 경우 어휘판단과제를 사용한 실험 1의 결과에서 보고된 한국어 특정적인 반응, 즉 SMG영역에서의 활성화의 증가가 실험 2에서는 나타나지 않았다. 그러나 한국어-중국어 이중언어화자의 경우, 실험 1에서 나타난 것과 같이 철자처리 혹은 의미처리와도 관련된다고 보고되는 방추상회(fusiform gyrus)등의 영역 유의미한 차이를 나타났다. 이는 어휘 판단과제와 의미판단과제가 유도하는 뇌 활성화 양상이 다름을 시사한다. 종합해 볼 때, 이중언어화자의 뇌 영상 연구에서 어휘수준에서는 거의 공통적인 활성화를 보인다는 개략적 수준의 연구 결과를 넘어, 음운처리 및 철자처리와 같은 어휘접근 수준에서는 이중언어화자들의 뇌 활성화가 다르게 일어남을 보여주고 있다. 따라서 이중언어 화자의 뇌 기전을 밝히기 위해서도 보다 개략적 수준을 넘어 언어처리의 세부적인 수준에 따른 접근이 필요함을 시사한다.

  • PDF

An fMRI study on the cerebellar lateralization during visuospatial and verbal tasks (공간 및 언어 과제 수행 시 소뇌의 편측화에 관한 뇌 기능 연구)

  • Chung, Soon-Cheol;Sohn, Jin-Hun;Choi, Mi-Hyun;Lee, Su-Jeong;Yang, Jae-Woong;Lee, Beob-Yi
    • Science of Emotion and Sensibility
    • /
    • v.12 no.4
    • /
    • pp.425-432
    • /
    • 2009
  • The purposes of the study were to examine cerebellar areas and lateralization responsible for visuospatial and verbal tasks using functional Magnetic Resonance Imaging(fMRI). Eight healthy male college students($21.5\;{\pm}\;2.3$ years) and eight male college students($23.3\;{\pm}\;0.5$ years) participated in this fMRI study of visuospatial and verbal tasks, respectively. Functional brain images were taken from 3T MRI using the single-shot EPI method. All functional images were aligned with anatomical images using affine transformation routines built into SPM99. The experiment consisted of four blocks. Each block included a control task(1 minute) and a cognitive task(1 minute). A run was 8 minutes long. Using the subtraction procedure, activated areas in the cerebellum during the visuospatial and verbal tasks were color-coded by t-score. A cerebellar lateralization index was calculated for both cognition tasks using number of activated voxels. The activated cerebellar regions during the both cognition tasks of this study agree with previous results. Since the number of activated voxels of the left and right cerebellar hemisphere was almost same, there was no cerebellar lateralization for both cognition tasks.

  • PDF

Functional MR Imaging of Language System : Comparative Study between Visual and Auditory Instructions in Word Generation Task (언어 중추 영역에 대한 기능적 자기공명영상: 시각적, 청각적 지시 과제에 관한 비교)

  • 구은회;권대철;김동성;송인찬
    • Journal of Biomedical Engineering Research
    • /
    • v.24 no.4
    • /
    • pp.241-246
    • /
    • 2003
  • To evaluate the usefulness if functional MR imaging(MRI) for the determination of language dominance system and to assess differences in the visual and auditory instrument language generation task according to activation task or activated area. Functional maps of the language area were obtained during visual and auditory instructions in word generation tasks in 6 healthy volunteer with right-handness were examined on a 1.5T scanner and the EPI BOLD technique, and three pulse sequence technique get of the true axial planes. Both task consisted of 96 phases including 6 activations and rests contents. Postprocessing were done on MRDx program by using cross correlation method. Two task compare the blain activation area surveyed of 1anguage lateralization index. To evaluated of the detection rates of Broca. Wernicke, pre-frontal lobe, Supplementary Motor Area (SMA) and pre-motor cortex areas and the differences of language lateraliaztion among two word generation task To lateralization index survey in 1anguage area on right and left in brain get to activation area pixel in brain. Compared to visual and auditory instrument task in the language areas get to the lateralization index. Two language generation task high detection rates of Broca and Wernicke areas. The visual instruction no detected in the auditory area, and auditory instruction no detected in the visual area. There was statistics significant different of them among language generation task. 1'his indicated that language area obtained image of the brain functional MR imaging usefulness in the visual and auditory task instrument.

Real-time Implementation of H.263 Encoder Using TMS320C6201 (TMS320C6201을 이용한 H.263 동영상 부호화기의 실시간 구현)

  • 김민성;정재호
    • Proceedings of the IEEK Conference
    • /
    • 2001.09a
    • /
    • pp.63-66
    • /
    • 2001
  • 본 논문에서는 TI사의 TMS320C6201 DSP를 이용하여 H.263 동영상 부호화기를 실시간 구현하고자 한다. 구현한 부호화기는 QCIF 형식의 영상을 사용하여 ITU-T H.263 권고안의 기본 모드를 따라 주로 C 언어와 intrinsics를 사용하여 구현하였다. 특히, 속도 향상을 위해서 고속 메모리의 사용을 극대화하는데 중점을 두었고, 연산량이 많은 모듈에 대한 최적화와 데이터의 병렬 처리 및 DMA (Direct Memory Access) 전송 등을 고려하여 구현하였다.

  • PDF