• 제목/요약/키워드: Image Recognition Technologies

검색결과 159건 처리시간 0.026초

딥러닝 기반 고성능 얼굴인식 기술 동향 (Research Trends for Deep Learning-Based High-Performance Face Recognition Technology)

  • 김형일;문진영;박종열
    • 전자통신동향분석
    • /
    • 제33권4호
    • /
    • pp.43-53
    • /
    • 2018
  • As face recognition (FR) has been well studied over the past decades, FR technology has been applied to many real-world applications such as surveillance and biometric systems. However, in the real-world scenarios, FR performances have been known to be significantly degraded owing to variations in face images, such as the pose, illumination, and low-resolution. Recently, visual intelligence technology has been rapidly growing owing to advances in deep learning, which has also improved the FR performance. Furthermore, the FR performance based on deep learning has been reported to surpass the performance level of human perception. In this article, we discuss deep-learning based high-performance FR technologies in terms of representative deep-learning based FR architectures and recent FR algorithms robust to face image variations (i.e., pose-robust FR, illumination-robust FR, and video FR). In addition, we investigate big face image datasets widely adopted for performance evaluations of the most recent deep-learning based FR algorithms.

기록관리 분야에서 한국어 자연어 처리 기술을 적용하기 위한 고려사항 (Considerations for Applying Korean Natural Language Processing Technology in Records Management)

  • 김학래
    • 한국기록관리학회지
    • /
    • 제22권4호
    • /
    • pp.129-149
    • /
    • 2022
  • 기록물은 과거와 현재를 포함하는 시간적 특성, 특정 언어에 제한되지 않는 언어적 특성, 기록물이 갖고 있는 다양한 유형을 복합적으로 갖고 있다. 기록물의 생성, 보존, 활용에 이르는 생애주기에서 텍스트, 영상, 음성으로 구성된 데이터의 처리는 많은 노력과 비용을 수반한다. 기계번역, 문서요약, 개체명 인식, 이미지 인식 등 자연어 처리 분야의 주요 기술은 전자기록과 아날로그 형태의 디지털화에 광범위하게 적용할 수 있다. 특히, 딥러닝 기술이 적용된 한국어 자연어 처리 분야는 다양한 형식의 기록물을 인식하고, 기록관리 메타데이터를 생성하는데 효과적이다. 본 논문은 한국어 자연어 처리를 기술을 소개하고, 기록 관리 분야에서 자연어 처리 기술을 적용하기 위한 고려사항을 논의한다. 기계번역, 광학문자인식과 같은 자연어 처리 기술이 기록물의 디지털 변환에 적용되는 과정은 파이썬 환경에서 구현한 사례로 소개한다. 한편, 자연어 처리 기술의 활용을 위해 기록관리 분야에서 자연어 처리 기술을 적용하기 위한 환경적 요소와 기록물의 디지털화 지침을 개선하기 위한 방안을 제안한다.

A Study on Image Labeling Technique for Deep-Learning-Based Multinational Tanks Detection Model

  • Kim, Taehoon;Lim, Dongkyun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제14권4호
    • /
    • pp.58-63
    • /
    • 2022
  • Recently, the improvement of computational processing ability due to the rapid development of computing technology has greatly advanced the field of artificial intelligence, and research to apply it in various domains is active. In particular, in the national defense field, attention is paid to intelligent recognition among machine learning techniques, and efforts are being made to develop object identification and monitoring systems using artificial intelligence. To this end, various image processing technologies and object identification algorithms are applied to create a model that can identify friendly and enemy weapon systems and personnel in real-time. In this paper, we conducted image processing and object identification focused on tanks among various weapon systems. We initially conducted processing the tanks' image using a convolutional neural network, a deep learning technique. The feature map was examined and the important characteristics of the tanks crucial for learning were derived. Then, using YOLOv5 Network, a CNN-based object detection network, a model trained by labeling the entire tank and a model trained by labeling only the turret of the tank were created and the results were compared. The model and labeling technique we proposed in this paper can more accurately identify the type of tank and contribute to the intelligent recognition system to be developed in the future.

영상처리 기반의 운전자 중심 정보처리 기술 개발 (A Driving Information Centric Information Processing Technology Development Based on Image Processing)

  • 양승훈;홍광수;김병규
    • 융합보안논문지
    • /
    • 제12권6호
    • /
    • pp.31-37
    • /
    • 2012
  • 오늘날 자동차 기술의 핵심은 IT 기반 융합 시스템기술로 변화하고 있다. 다양한 IT 기술을 접목하여 운전 중 다양한 상황에 대응하고 또한 운전자의 편의성을 지원하는 기술적 추세를 보이고 있다. 본 논문에서는 운전자의 안전성과 편의성을 증대하기 위해 영상 정보를 기반으로 도로 정보를 검출해 운전자에게 알려주고, 버튼을 직접 손으로 눌러야 하는 물리적 인터페이스를 대체할 비접촉식 인터페이스 기술을 융합한 Augmented Driving System (ADS) 기술을 제안한다. 본 기술은 카메라로부터 입력 받은 영상 정보를 제안된 알고리즘을 통해 앞차와의 거리, 차선, 교통 표지판을 검출하고 차량 내부를 주시하는 카메라와 운전자의 음성을 인식할 마이크를 기반으로 기본 음성인식과 동작인식이 융합된 인터페이스 기술을 제공한다. 이러한 요소 기술들은 운전자가 인지하지 못하더라도 운전자에게 현재의 주행상황을 인지하여 자동으로 알려줌으로써 교통사고 확률을 크게 낮출 수 있을 것이며, 또한 다양한 운전 중 기능 조작을 편리하게 지원함으로써 운전자의 전방 주시에 도움을 줄 수 있다. 본 논문에서 개발된 기술을 통해 테스트를 실시해 본 결과 표지판인식, 차선검출, 앞차와의 거리 검출 등의 인식률이 약 90% 이상이 되었다.

공간분석을 위한 정량적 분석 모델에 관한 연구 - 이미지 영상처리와 설문조사 데이터의 다중 회귀분석을 중심으로 - (A Study on Quantitative Analysis Model for Space Analysis - Focused on a Digital Image Processing and Multiple Regression Analysis of Recognition Amount -)

  • 이혁준
    • 한국실내디자인학회논문집
    • /
    • 제14권2호
    • /
    • pp.217-224
    • /
    • 2005
  • The lack of objective decisive criteria and the absence of analyzing tools accrued from the experiments on various types developed from space design process makes it difficult to select and execute alternatives for them. As an attempt of coping with these problems, the aims of this study is to establish space analysis' models and to propose possibility of analyzing models by utilizing the technology of image process. It is now under study in the field of artificial intelligence based on the accomplishment of digital images. This study focused on establishment an analysis model based on accomplished digital images and image processing framework. It helps utilize various processing technologies that are currently in use of image processes, and problems of the study can be supplemented through further follow-up studies. Finally, analysis model can be constructed gradually huge design data in the analogue data to the digital image database and be proposed with index in design or evaluation step.

개량 Douglas-Peucker 알고리즘 기반 고속 Shape Matching 알고리즘 (Fast Shape Matching Algorithm Based on the Improved Douglas-Peucker Algorithm)

  • 심명섭;곽주현;이창훈
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제5권10호
    • /
    • pp.497-502
    • /
    • 2016
  • Shape Contexts Recognition(SCR)은 도형이나 사물 등의 모양을 인식하는 기술로 문자인식, 모션인식, 얼굴인식, 상황인식 등의 기반이 되는 기술이다. 하지만 일반적인 SCR은 Shape의 모든 contour에 대해 히스토그램을 만들고 Shape A, B 비교를 위해 추출된 contour를 1:1 개수대로 매핑함으로써 처리속도가 느리다는 단점이 있다. 따라서 본 논문에서는 Shape 모양에 따라 윤곽선을 찾고 개량 DP 알고리즘 및 해리스코너 검출기를 이용하여 contour를 최적화시킴으로써 간략하면서도 더 효과적인 알고리즘을 만들었다. 이렇게 개선된 방법을 사용함으로써 기존방법보다 처리 수행속도가 빨라짐을 확인하였다.

A Memory-efficient Hand Segmentation Architecture for Hand Gesture Recognition in Low-power Mobile Devices

  • Choi, Sungpill;Park, Seongwook;Yoo, Hoi-Jun
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • 제17권3호
    • /
    • pp.473-482
    • /
    • 2017
  • Hand gesture recognition is regarded as new Human Computer Interaction (HCI) technologies for the next generation of mobile devices. Previous hand gesture implementation requires a large memory and computation power for hand segmentation, which fails to give real-time interaction with mobile devices to users. Therefore, in this paper, we presents a low latency and memory-efficient hand segmentation architecture for natural hand gesture recognition. To obtain both high memory-efficiency and low latency, we propose a streaming hand contour tracing unit and a fast contour filling unit. As a result, it achieves 7.14 ms latency with only 34.8 KB on-chip memory, which are 1.65 times less latency and 1.68 times less on-chip memory, respectively, compare to the best-in-class.

Multi-touch Detection Technology Using a Divergence IR Beam Profile for Large LCD Touch Solutions

  • Lee, Young-Joon;Lee, Won-Suk;Pushchin, Victor;Song, Moon-Bong
    • Journal of Information Display
    • /
    • 제11권4호
    • /
    • pp.169-172
    • /
    • 2010
  • This paper proposes a multi-touch detection technology that can be applied to large LCDs. To achieve this goal, a set of IR LEDs and sensors was used to construct an IR matrix, and a new algorithm based on Hough transform was applied. This approach reduced the "Ghost" response of the multi-touch detection technology to make it better than other IR touch recognition technologies, and showed robust performance in terms of multi-touch recognition.

Image Enhancement for Two-dimension bar code PDF417

  • Park, Ji-Hue;Woo, Hong-Chae
    • 한국정보기술응용학회:학술대회논문집
    • /
    • 한국정보기술응용학회 2005년도 6th 2005 International Conference on Computers, Communications and System
    • /
    • pp.69-72
    • /
    • 2005
  • As life style becomes to be complicated, lots of support technologies were developed. The bar code technology is one of them. It was renovating approach to goods industry. However, data storage ability in one dimension bar code came in limit because of industry growth. Two-dimension bar code was proposed to overcome one-dimension bar code. PDF417 bar code most commonly used in standard two-dimension bar codes is well defined at data decoding and error correction area. More works could be done in bar code image acquisition process. Applying various image enhancement algorithms, the recognition rate of PDF417 bar code is improved.

  • PDF

Medical Diagnosis Algorithm Based on Tongue Image on Mobile Device

  • Zhou, Zibo;Peng, Dongliang;Gao, Fumeng;Leng, Lu
    • Journal of Multimedia Information System
    • /
    • 제6권2호
    • /
    • pp.99-106
    • /
    • 2019
  • In traditional Chinese medical (TCM) science, tongue images can be observed for medical diagnosis; however, the tongue diagnosis of TCM is influenced by the subjective factors of doctors, and the diagnosis results vary from person to person. Quantitative TCM tongue diagnosis can improve the accuracy of diagnosis and increase the application value. In this paper, digital image processing and pattern recognition technologies are employed on mobile device to classify tongue images collected in different health states. First, through grayscale integral projection processing, the trough is found to localize the tongue body. Then the tongue body image is transferred from RGB color space to HSV color space, and the average H and S values are considered as the color features. Finally, the diagnosis results are obtained according to the relationship between the color characteristics and physical symptoms.