• Title/Summary/Keyword: 영상언어인식

Search Result 94, Processing Time 0.027 seconds

Design and Implementation of a Language Identification System for Handwriting Input Data (필기 입력데이터에 대한 언어식별 시스템의 설계 및 구현)

  • Lim, Chae-Gyun;Kim, Kyu-Ho;Lee, Ki-Young
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.10 no.1
    • /
    • pp.63-68
    • /
    • 2010
  • Recently, to accelerate the Ubiquitous generation, the input interface of the mobile machinery and tools are actively being researched. In addition with the existing interfaces such as the keyboard and curser (mouse), other subdivisions including the handwriting, voice, vision, and touch are under research for new interfaces. Especially in the case of small-sized mobile machinery and tools, there is a increasing need for an efficient input interface despite the small screens. This is because, additional installment of other devices are strictly limited due to its size. Previous studies on handwriting recognition have generally been based on either two-dimensional images or algorithms which identify handwritten data inserted through vectors. Futhermore, previous studies have only focused on how to enhance the accuracy of the handwriting recognition algorithms. However, a problem arisen is that when an actual handwriting is inserted, the user must select the classification of their characters (e.g Upper or lower case English, Hangul - Korean alphabet, numbers). To solve the given problem, the current study presents a system which distinguishes different languages by analyzing the form/shape of inserted handwritten characters. The proposed technique has treated the handwritten data as sets of vector units. By analyzing the correlation and directivity of each vector units, a more efficient language distinguishing system has been made possible.

Speech Activity Decision with Lip Movement Image Signals (입술움직임 영상신호를 고려한 음성존재 검출)

  • Park, Jun;Lee, Young-Jik;Kim, Eung-Kyeu;Lee, Soo-Jong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.1
    • /
    • pp.25-31
    • /
    • 2007
  • This paper describes an attempt to prevent the external acoustic noise from being misrecognized as the speech recognition target. For this, in the speech activity detection process for the speech recognition, it confirmed besides the acoustic energy to the lip movement image signal of a speaker. First of all, the successive images are obtained through the image camera for PC. The lip movement whether or not is discriminated. And the lip movement image signal data is stored in the shared memory and shares with the recognition process. In the meantime, in the speech activity detection Process which is the preprocess phase of the speech recognition. by conforming data stored in the shared memory the acoustic energy whether or not by the speech of a speaker is verified. The speech recognition processor and the image processor were connected and was experimented successfully. Then, it confirmed to be normal progression to the output of the speech recognition result if faced the image camera and spoke. On the other hand. it confirmed not to output of the speech recognition result if did not face the image camera and spoke. That is, if the lip movement image is not identified although the acoustic energy is inputted. it regards as the acoustic noise.

Multimedia Tour Contents Service System Employing GPS-based Location Information (GPS 위치정보를 이용한 멀티미디어 관광 콘텐츠 제공 서비스)

  • Kim, Young-Cheol;Kim, Sang-Tae;Cha, Hyeon-Cheol;Kim, Hyun-Deok
    • 한국IT서비스학회:학술대회논문집
    • /
    • 2009.11a
    • /
    • pp.55-58
    • /
    • 2009
  • 위치기반 서비스는 사용자의 위치를 자동으로 인식하여 사용자의 위치를 고려한 맞춤형 서비스를 제공하며, 일반적으로 위치정보 획득을 위해 GPS(Global Positioning System)를 이용하고 있다. 본 논문에서는 GPS 수신기를 통해 수신된 정보로부터 차량의 위치정보를 인식하고, 이를 사전에 설정된 관광 차량의 이동경로와 비교 분석하여 안내 방송이 필요한 각 관광지까지의 거리와 소요시간 등을 실시간으로 인식한다. 또, 이러한 인식결과를 이용하여 사전 또는 실시간으로 설정된 관광지의 멀티미디어 관광 콘텐츠를 영상과 음성으로 재생하여 관광객에게 제공하게 된다. 특히, 각각 사용 언어가 다른 사용자가 동시에 서비스를 이용하더라도 별도의 음성채널을 통하여 서비스를 제공하며, 사용자 수신기에서 채널을 선택하여 이용하므로 사용자 편의성을 높일 수 있다.

  • PDF

A feasibility study on new stimulation method in fMRI language examinations using custom designed images (기능적 자기공명영상의 언어기능검사 시 image를 이용한 자극방법의 타당성 연구)

  • Choi, Kwan-Woo;Son, Soon-Yong;Jeong, Mi-Ae;Min, Jung-Whan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.11
    • /
    • pp.5005-5011
    • /
    • 2011
  • The purpose of this work is to know the validity of a new stimulation method in cognitive functional imaging using custom-designed images correspond to words or syllables improving the shortcomings of existing method using text. From March 2011 to May five Subjects in need of language related functional MRI scanning were selected and both of text stimulating method and image stimulating method sacanning were carried out three times each. Using 3.0T Philps MRI machine and Invivo Co's Eloquence system, data acquisition was performed with EPI-BOLD technique. Post processing was performed with SPM 99 while the activated signals were determined within 95 percent confidence level.The number of activation clusters and the activation ratio inside ROI were compared. As as result, all of the subject showed activation inside Broca area but it did not have statistical significance. In conclusion, the image sitimulation method has potential because image itself is a common means of recognition and it can be recognised easily even if there language barrier. This stimulation method can be applied to replacing the exising scanning method especially in the elderly, infants, foerigners who may not fully understand about the examination.

A study on the lip shape recognition algorithm using 3-D Model (3차원 모델을 이용한 입모양 인식 알고리즘에 관한 연구)

  • 배철수
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.3 no.1
    • /
    • pp.59-68
    • /
    • 1999
  • Recently, research and developmental direction of communication system is concurrent adopting voice data and face image in speaking to provide more higher recognition rate then in the case of only voice data. Therefore, we present a method of lipreading in speech image sequence by using the 3-D facial shape model. The method use a feature information of the face image such as the opening-level of lip, the movement of jaw, and the projection height of lip. At first, we adjust the 3-D face model to speeching face image sequence. Then, to get a feature information we compute variance quantity from adjusted 3-D shape model of image sequence and use the variance quality of the adjusted 3-D model as recognition parameters. We use the intensity inclination values which obtaining from the variance in 3-D feature points as the separation of recognition units from the sequential image. After then, we use discrete HMM algorithm at recognition process, depending on multiple observation sequence which considers the variance of 3-D feature point fully. As a result of recognition experiment with the 8 Korean vowels and 2 Korean consonants, we have about 80% of recognition rate for the plosives and vowels. We propose that usability with visual distinguishing factor that using feature vector because as a result of recognition experiment for recognition parameter with the 10 korean vowels, obtaining high recognition rate.

  • PDF

A Shape Decomposition of Handwritten Hangul Patterns Using Convex Hull (볼록 헐을 이용한 필기 한글 패턴의 모양 분해)

  • 박정선;오일석
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10b
    • /
    • pp.440-442
    • /
    • 2000
  • 필기 한글 문자 인식을 위해서는 패턴을 구성하는 획 성분을 분석하는 작업이 필수적이다. 획 성분 추출을 위해 사용한 세선화 방법은 입력 영상을 왜곡하는 단점을 가지고 있다. 이를 극복하기 위하여 본 논문은 입력 영상을 왜곡하지 않고 의미 있는 부품 단위로 분할하는 방법을 제안한다. 의미 있는 부품이란 유사 볼록하게 분할된 영역을 의미한다. 분할 방법은 먼저 입력 영상에 볼록 헐 연산을 적용하여 오목 영역을 생성한다. 이 오목 영역에서 분할 기준(anchor point)점을 탐지하고 획의 반대편 외곽선 상에서 분할 끝(terminal point)점을 찾아 분할 경로를 구성하여 획을 분할한다. 모든 부품이 유사 볼록 조건을 만족할 때까지 위 과정을 반복 수행한다. 제안한 방법은 두 개의 파라미터만을 가지며 간단한 프로시져로 구성되어 있다. 또한 필기 한글 패턴뿐 아니라 여러 언어에 적용 가능하다는 장점을 갖는다.

  • PDF

Development of Handwritten Form Recognition System for Automated Database Construction (DB 자동 구축을 위한 필기 형식문서 인식 시스템의 개발)

  • Kim, Dong-Jun;Cho, Sung-Jung;Ryu, Sung-Ho;Rhee, Taik-Heon;Kim, Jin-Hyung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.04a
    • /
    • pp.1047-1050
    • /
    • 2000
  • 형식문서는 현재 정보의 체계화된 표현 및 저장 수단으로서 널리 사용되어 왔다. 최근 이러한 형식문서들을 데이터베이스화해주는 시스템들이 보급되고 있다. 그러나 대부분 외국의 시스템을 기반으로 작성되어 한글, 영어, 숫자, 한자등 다양한 필기 문자들이 사용되는 국내 환경의 특수성을 적절히 반영하지 못하고 있다. 그 결과, 대부분의 경우 아직도 사람이 직접 자료를 입력해야만 한다. 본 논문에서는 이러한 국내 실정에 맞게 다양한 언어의 필기 문자 인식기를 결합하여 형식 문서의 정보를 자동으로 데이터베이스에 입력해 주는 시스템을 제안한다. 제안된 시스템은 영상을 인식한 뒤 그 결과를 검증하는 방법을 통하여 정보의 입력을 보다 효율적으로 수행할 수 있을 뿐 아니라, 전체 작업을 단계별로 분할하여 병렬적으로 수행할 수 있게 함으로써 처리율을 향상시킬 수 있게 하였다.

  • PDF

Design and Implementation of Face Recognition Security System for ATM based on extracting skin color using Java (Java로 구현한 피부색 추출 기반 ATM 안면 인식 보안 시스템의 설계 및 구현)

  • Kang, Bo-Gyung;Bae, Seok Chan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.04a
    • /
    • pp.373-376
    • /
    • 2010
  • 요즘 현금카드나 신용카드를 훔치고 비밀번호를 알아내 ATM(현금자동지급기)에서 현금을 인출하는 범죄가 늘고 있는데 범인들은 대부분 선글라스, 안경, 마스크, 모자 등으로 얼굴을 가리고 인출을 함으로 은행의 CCTV는 범인색출에 거의 도움이 되지 않는다. 본 논문에서의 영상처리는 모두 Java언어를 사용하였으며 피부색을 사전 추출하는 과정을 거쳐 구현된 분류기능을 이용해 얼굴의 이목구비들의 위치를 인식하도록 한다. 이는 ATM이용자가 선글라스, 안경, 마스크 등으로 얼굴을 가리면 기기에서 애초에 서비스 받는 것을 불가능 하게 하여, 범죄를 예방할 수 있게 한다. 또한 카드의 사용자 정보와 서비스를 시도했던 시간과 캡쳐 이미지를 저장해 놓음으로써 범인의 인상착의, 알리바이 등을 확인하는데 크게 도움을 주는 ATM 안면 인식 보안 시스템의 가능성을 제안하고자 한다.

Building Living Lab for Acquiring Behavioral Data for Early Screening of Developmental Disorders

  • Kim, Jung-Jun;Kwon, Yong-Seop;Kim, Min-Gyu;Kim, Eun-Soo;Kim, Kyung-Ho;Sohn, Dong-Seop
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.8
    • /
    • pp.47-54
    • /
    • 2020
  • Developmental disorders are impairments of brain and/or central nervous system and refer to a disorder of brain function that affects languages, communication skills, perception, sociality and so on. In diagnosis of developmental disorders, behavioral response such as expressing emotions in proper situation is one of observable indicators that tells whether or not individual has the disorders. However, diagnosis by observation can allow subjective evaluation that leads erroneous conclusion. This research presents the technological environment and data acquisition system for AI based screening of autism disorder. The environment was built considering activities for two screening protocols, namely Autism Diagnostic Observation Schedule (ADOS) and Behavior Development Screening for Toddler (BeDevel). The activities between therapist and baby during the screening are fully recorded. The proposed software in this research was designed to support recording, monitoring and data tagging for learning AI algorithms.

A Study on the Instructional Model utilizing Scratch for Introductory Programming Classes of SW-Major Students (SW전공자 프로그래밍 입문 수업의 스크래치 활용 수업 모형 연구)

  • KO, Kwangil
    • Convergence Security Journal
    • /
    • v.18 no.2
    • /
    • pp.59-67
    • /
    • 2018
  • The programming language is a core education area of software that is becoming increasingly important in the age of the fourth industrial revolution, but it requires mathematical knowledge and logical thinking skills, so that many local private university and college students with low basic skills are having difficulties learning it. This problem occasionally causes SW-major students to lose interest and confidence in their majors during the introductory course of programming languages; making them change their majors, or give up their studies. In this study, we designed an instructional model using Scratch for educating C-language which is a typical programming introductory language. To do this, we analyzed the concepts that can be trained by Scratch among the programming concepts supported by C-language, and developed the examples of Scratch for exercising the concepts. In addition, we designed an instructional model, by which the programming concepts are first learned through Scratch and then C-language is taught, and conducted an experiment on the SW-major freshman students of a local private university to verify the effectiveness of the model. In the situation where SW education is becoming common, we expect that this study will help programming language education of security IT students.

  • PDF