• Title/Summary/Keyword: 텍스트 인식

Search Result 779, Processing Time 0.029 seconds

김욱동 지음 "탈춤의 미학"을 읽고

  • Im, Jae-Hae
    • The Korean Publising Journal, Monthly
    • /
    • s.153
    • /
    • pp.14-14
    • /
    • 1994
  • 오류 하나는 탈춤이 전승되고 연행하는 현장을 보지 않고 채록된 보고서만 텍스트로 삼아서 탈춤의 미학을 밝히겠다는 것이다. 그것은 마치 연극은 보지 않고 희곡만 봐야 연극미학을 제대로 연구할 수 있다는 것이나 다름없다. 오류 둘은 현장론이 무엇인지도 모른 채 다만 현장답사가 곧 현장론이라는 인식의 착각이다.

  • PDF

Speech Recognition based Message Transmission System for the Hearing Impaired Persons (청각장애인을 위한 음성인식 기반 메시지 전송 시스템)

  • Kim, Sung-jin;Cho, Kyoung-woo;Oh, Chang-heon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.12
    • /
    • pp.1604-1610
    • /
    • 2018
  • The speech recognition service is used as an ancillary means of communication by converting and visualizing the speaker's voice into text to the hearing impaired persons. However, in open environments such as classrooms and conference rooms it is difficult to provide speech recognition service to many hearing impaired persons. For this, a method is needed to efficiently provide it according to the surrounding environment. In this paper, we propose a system that recognizes the speaker's voice and transmits the converted text to many hearing impaired persons as messages. The proposed system uses the MQTT protocol to deliver messages to many users at the same time. The end-to-end delay was measured to confirm the service delay of the proposed system according to the QoS level setting of the MQTT protocol. As a result of the measurement, the delay between the most reliable Qos level 2 and 0 is 111ms, confirming that it does not have a great influence on conversation recognition.

Identification and Recovery of Elided Information for Text Animation (텍스트 애니메이션을 위한 생략 정보 파악 및 복원)

  • Chang, Eun-Young;Park, Jong-C.
    • Annual Conference on Human and Language Technology
    • /
    • 2004.10d
    • /
    • pp.205-213
    • /
    • 2004
  • 음성인식기술을 실제 생활에 적용할 때 발생하는 대표적인 문제로, 인식기의 낮은 인식률로 인한 오동작을 들 수 있다. 본 연구에서는. 텔레뱅킹 도메인에서의 HTK(Hidden Markov Model Toolkit) 연속 음성 인식 시스템과, 최대 엔트로피 기법에 기반한 사용자 발화에서의 핵심이 되는 단어(주로 고유 명사들)들에 대한 인식 신뢰도의 측정 방법을 제시한다. 음향특징과 언어특징들을 모두 고려하여 인식 신뢰도를 구하였으며 인식된 단어들에 대해 오인식 되었음을 약 86%의 정확도로 판단할 수 있음을 확인하였다. 본 인식신뢰도를 이용하여 차후에 음성인식의 확인대화(Clarification Dialog)모델을 개발하는데 활용하고자 한다.

  • PDF

Sign Language Shape Recognition Using SOFM Neural Network (SOFM 신경망을 이용한 수화 형상 인식)

  • Park, Kyung-Woo
    • Journal of Integrative Natural Science
    • /
    • v.3 no.1
    • /
    • pp.38-42
    • /
    • 2010
  • 인간은 정보전달을 위하여 언어 이외에 동작, 표정과 같은 비언어적인 수단을 이용한다. 이러한 비언어적인 수단을 정확히 분석 할 수 있다면 인간과 컴퓨터간의 자연스럽고 지적인 인터페이스를 구축할 수 있게 된다. 본 논문은 별도의 센서를 부착하지 않은 단일 카메라 환경에서 손 형상을 입력정보로 사용하여 손 영역만을 분할한 후 자기 조직화 특징 지도(SOFM: Self Organized Feature Map) 신경망 알고리즘을 이용하여 손 형상을 인식함으로서 수화인식을 위한 보다 안정적이며 강인한 인식 시스템을 구현하고자 한다. 제안 방법으로는 피부색 정보를 이용하여 배경으로부터 손 영역만을 추출한 후 추출된 손 영역의 형상을 인식한다(전처리과정으로 모델이미지의 사이즈와 압축 및 컬러에 대한 정보를 정규화 시켰다). 또한 인식 효율을 높이기 위해 SOFM 신경망 알고리즘을 적용함으로서 보다 안정적으로 손 형상을 인식할 수 있게 되었으며, 손 형상 인식률에 대한 안전성과 정확성을 향상시킬 수 있었다. 그리고 인식된 손 형상의 의미를 텍스트로 보여줌으로서 사용자의 의사를 정확하게 전달할 수 있다.

An Analysis of Incheon's Identity of Place through Movies (영화를 통한 인천의 장소 정체성 분석)

  • Ahn, Chong-Uk
    • Journal of the Korean association of regional geographers
    • /
    • v.11 no.6
    • /
    • pp.501-516
    • /
    • 2005
  • Although Incheon metropolitan city is the third largest city in Korea, it is called 'the gateway to Seoul', 'the second city of port', and 'the satellite city'. The people in Incheon as well as other regions unconsciously recognize this city as 'border' and 'periphery' of Seoul through those expressions. These perceptions also develop a negative sense of place about Incheon. This study starts with analysis about marginal landscape images of Incheon in texts such as movies, stories, and geography textbooks. The represented text as movie has a gap between real space and it. Nevertheless, Its strong point is a making problems clear about recognition of reality. I will inquire the origin of senses of place about Incheon through analysis of represented texts. Moreover, I will present the notion of 'flight' that stands on the basis of Deleuze's Nomadism. Here, 'flight' means that the active subject continuously challenges and reforms the nature of periphery and dependency, the capital, and the economic subordination, and it will have to be new identity and direction of Incheon.

  • PDF

A Named Entity Recognition Model in Criminal Investigation Domain using Pretrained Language Model (사전학습 언어모델을 활용한 범죄수사 도메인 개체명 인식)

  • Kim, Hee-Dou;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.13-20
    • /
    • 2022
  • This study is to develop a named entity recognition model specialized in criminal investigation domains using deep learning techniques. Through this study, we propose a system that can contribute to analysis of crime for prevention and investigation using data analysis techniques in the future by automatically extracting and categorizing crime-related information from text-based data such as criminal judgments and investigation documents. For this study, the criminal investigation domain text was collected and the required entity name was newly defined from the perspective of criminal analysis. In addition, the proposed model applying KoELECTRA, a pre-trained language model that has recently shown high performance in natural language processing, shows performance of micro average(referred to as micro avg) F1-score 98% and macro average(referred to as macro avg) F1-score 95% in 9 main categories of crime domain NER experiment data, and micro avg F1-score 98% and macro avg F1-score 62% in 56 sub categories. The proposed model is analyzed from the perspective of future improvement and utilization.

Study on Participants' Perceptions of Sharing Economy Policies: A Text Ming Approach to Online Community Posts (공유경제 참여자의 비즈니스 등록정책에 대한 인식과 심적기재: 온라인 발화에 대한 텍스트마이닝)

  • Park, Soo Kyung
    • Journal of Digital Convergence
    • /
    • v.20 no.2
    • /
    • pp.47-56
    • /
    • 2022
  • With the advent of online platforms, individuals have been able to trade small resources, such as a room, in the market. However, as there is no clear regulation on these economic activities, various side effects have emerged. Accordingly, the government reestablished related policies to resolve the unintended consequences of these economic activities. However, the policy has not been implemented yet, and many participants do not comply with the policy. Therefore, this study intends to examine their perceptions in detail. For this purpose, a text mining technique was applied. Posts and comments from major online communities were collected. By applying the topic modeling technique, 5 topics were derived. Compliance with the government's policy is a voluntary decision. Therefore, it is necessary to carry out an in-depth understanding of the policy target. Therefore, based on this study, it is expected that in the future, methods to induce them to conform to policy can be discussed in detail.

Social perception of the Arduino lecture as seen in big data (빅데이터 분석을 통한 아두이노 강의에 대한 사회적 인식)

  • Lee, Eunsang
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.6
    • /
    • pp.935-945
    • /
    • 2021
  • The purpose of this study is to analyze the social perception of Arduino lecture using big data analysis method. For this purpose, data from January 2012 to May 2021 were collected using the Textom website as a keyword searched for 'arduino + lecture' in blogs, cafes, and news channels of NAVER website. The collected data was refined using the Textom website, and text mining analysis and semantic network analysis were performed by opening the Textom website, Ucinet 6, and Netdraw programs. As a result of text mining analysis such as frequency analysis, TF-IDF analysis, and degree centrality it was confirmed that 'education' and 'coding' were the top keywords. As a result of CONCOR analysis for semantic network analysis, four clusters can be identified: 'Arduino-related education', 'Physical computing-related lecture', 'Arduino special lecture', and 'GUI programming'. Through this study, it was possible to confirm various meaningful social perceptions of the general public in relation to Arduino lecture on the Internet. The results of this study will be used as data that provides meaningful implications for instructors preparing for Arduino lectures, researchers studying the subject, and policy makers who establish software education or coding education and related policies.

Implementation of Artificial Intelligence Speech Recognition Text Repository for Elementary Career Counseling (초등 진로 상담을 위한 인공지능 음성 인식 텍스트 레포지토리 구현)

  • Yu, Minjeong;Ma, Youngji;Koo, Dukhoi
    • 한국정보교육학회:학술대회논문집
    • /
    • 2021.08a
    • /
    • pp.327-333
    • /
    • 2021
  • Currently development of the Artificial Intelligence technology is rapidly progressing in the era of the Fourth Industrial Revolution. The government is trying to improve the education of Artificial Intelligence and cultivating human resources. However there are very few cases where A.I technology is actually used in public education classes. Therefore we designed a text repository by implementing A.I speech recognition to provide career counseling for elementary school students. In the meantime, there have been many difficulties in giving advance consultations required for students' career counseling. In this study we suggested A.I speech recognition technology which can solve addressed problem and we planned various ways to make the program more educational. To conclude we expect A.I technology implemented in this study provides effective solution to career counseling.

  • PDF

Scene Text Extraction in Natural Images using Hierarchical Feature Combination and Verification (계층적 특징 결합 및 검증을 이용한 자연이미지에서의 장면 텍스트 추출)

  • 최영우;김길천;송영자;배경숙;조연희;노명철;이성환;변혜란
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.4
    • /
    • pp.420-438
    • /
    • 2004
  • Artificially or naturally contained texts in the natural images have significant and detailed information about the scenes. If we develop a method that can extract and recognize those texts in real-time, the method can be applied to many important applications. In this paper, we suggest a new method that extracts the text areas in the natural images using the low-level image features of color continuity. gray-level variation and color valiance and that verifies the extracted candidate regions by using the high-level text feature such as stroke. And the two level features are combined hierarchically. The color continuity is used since most of the characters in the same text lesion have the same color, and the gray-level variation is used since the text strokes are distinctive in their gray-values to the background. Also, the color variance is used since the text strokes are distinctive in their gray-values to the background, and this value is more sensitive than the gray-level variations. The text level stroke features are extracted using a multi-resolution wavelet transforms on the local image areas and the feature vectors are input to a SVM(Support Vector Machine) classifier for the verification. We have tested the proposed method using various kinds of the natural images and have confirmed that the extraction rates are very high even in complex background images.