• Title/Summary/Keyword: visual and digital processing

Search Result 187, Processing Time 0.025 seconds

Improvement of Photogrammetry Image Merging in Satellite Image Processing (인공위성 영상처리를 위한 사진접합정확도 향상기법)

  • Kang, In-Joon;Choi, Chul-Ung
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.2 no.1 s.3
    • /
    • pp.93-98
    • /
    • 1994
  • This image of Kangseogu in Pusan, is a digital merge of aerial photos by scale of 1/1,200 map. The merge was carried out 2nd affine and bilinear interpolation. It can improve digital classification to help choose training sites and interprete classification results, and improve visual interpretation, as in this case, by adding detailed information to the multispectral TM data.

  • PDF

Efficient Object Classification Scheme for Scanned Educational Book Image (교육용 도서 영상을 위한 효과적인 객체 자동 분류 기술)

  • Choi, Young-Ju;Kim, Ji-Hae;Lee, Young-Woon;Lee, Jong-Hyeok;Hong, Gwang-Soo;Kim, Byung-Gyu
    • Journal of Digital Contents Society
    • /
    • v.18 no.7
    • /
    • pp.1323-1331
    • /
    • 2017
  • Despite the fact that the copyright has grown into a large-scale business, there are many constant problems especially in image copyright. In this study, we propose an automatic object extraction and classification system for the scanned educational book image by combining document image processing and intelligent information technology like deep learning. First, the proposed technology removes noise component and then performs a visual attention assessment-based region separation. Then we carry out grouping operation based on extracted block areas and categorize each block as a picture or a character area. Finally, the caption area is extracted by searching around the classified picture area. As a result of the performance evaluation, it can be seen an average accuracy of 83% in the extraction of the image and caption area. For only image region detection, up-to 97% of accuracy is verified.

Lip Reading Method Using CNN for Utterance Period Detection (발화구간 검출을 위해 학습된 CNN 기반 입 모양 인식 방법)

  • Kim, Yong-Ki;Lim, Jong Gwan;Kim, Mi-Hye
    • Journal of Digital Convergence
    • /
    • v.14 no.8
    • /
    • pp.233-243
    • /
    • 2016
  • Due to speech recognition problems in noisy environment, Audio Visual Speech Recognition (AVSR) system, which combines speech information and visual information, has been proposed since the mid-1990s,. and lip reading have played significant role in the AVSR System. This study aims to enhance recognition rate of utterance word using only lip shape detection for efficient AVSR system. After preprocessing for lip region detection, Convolution Neural Network (CNN) techniques are applied for utterance period detection and lip shape feature vector extraction, and Hidden Markov Models (HMMs) are then used for the recognition. As a result, the utterance period detection results show 91% of success rates, which are higher performance than general threshold methods. In the lip reading recognition, while user-dependent experiment records 88.5%, user-independent experiment shows 80.2% of recognition rates, which are improved results compared to the previous studies.

Edge Extraction Method Based on Color Image Model (컬러 영상 모델에 기반한 에지 추출기법)

  • Kim Tae-Eun
    • Journal of Digital Contents Society
    • /
    • v.4 no.1
    • /
    • pp.11-21
    • /
    • 2003
  • In computer vision, the goal of stereopsis is to determine the surface structure of real world form two or more perspective views of scene. It is similar to human visual system. We can avoid obstacles, recognize objects, and manipulate machine using three-dimensional information. Until recently, only gray-level images have been used as input to computation for depth determination, but the availability of color can further enhance the performance of computational stereopsis. There are many models to provide efficient color system. The simplest model, RGB model treats color as if it were composed of separate entities. Each color channel is processed individually by the same stereopsis module as used in the gray-level model. His Model decouples intensity component from color information. So it can deal with color properties without defect intensity information. Opponent color model is based on human visual system. In this model, the red-green-blue colors are combined into three opponent channels before further processing.

  • PDF

Depth estimation for surface-breaking cracks in steel-fiber reinforced concrete using ultrasonic surface waves

  • Ahmet S. Kirlangic;Zafer Iscan
    • Structural Monitoring and Maintenance
    • /
    • v.9 no.4
    • /
    • pp.373-388
    • /
    • 2022
  • A USW based diagnostic procedure is presented for estimating the depth of surface-breaking cracks. The diagnosis is demonstrated on seven lab-scale SFRC beam specimens, which are subjected to the CMOD controlled three-point bending test to create real bending cracks. Then, the recorded multiple ultrasonic signals are examined with the signal processing techniques, including wavelet transform and two-dimensional Fourier transform, to investigate the relationships between the crack depth and two diagnostic indices, namely the attenuation coefficient and dispersion index (DI). Finally, the reliabilities of these indices for depth estimation are verified with the visually measured crack depths as well as the crack features obtained with a digital image processing algorithm. It is found that the DI outperforms the attenuation coefficient in depth estimation, where this index displays good agreement with the visual inspection for 86% of the inspected specimens.

Voice Creator: A Vocal Customization Web Application Prototype (Voice Creator: 개인 맞춤형 목소리 생성 웹 어플리케이션 프로토타입)

  • Byeon, Hyeon Jeong;Yeo, Soohyun;Oh, Uran
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.567-569
    • /
    • 2021
  • Due to the important role of avatars in computer-mediated communication (CMC), a growing number of CMC-based services now support avatar customization options. However, in many cases, customization and personalization options are limited to visual features. In this paper, we propose and describe a prototype for a vocal customization web application. Titled Voice Creator, the app is designed for both able-bodied and speech- or hearing-impaired users who seek to communicate anonymously using digital voice identities.

A Study on Development of Responsive Web and Hybrid App using Bootstrap (부트스트랩을 이용한 반응형 웹 및 하이브리드 앱 개발)

  • Heo, Neung-Ho;Kim, Hyeong-Eun;Choi, Byung-Jun;Kim, Young-Jin;Kim, Yong-Hoon;Suh, Tae-Weon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.10a
    • /
    • pp.738-741
    • /
    • 2016
  • 다양한 크기의 모바일 기기가 등장함에 따라 기존 웹 어플리케이션으로는 개발자가 의도한대로 레이아웃을 구성하기 어려우며 네이티브 어플리케이션은 운영체제별로 각각 개발해야하기 때문에 많은 개발 시간을 필요로 한다는 단점이 있다. 본 논문에서는 부트스트랩을 이용하여 기기의 화면 크기에 가변적인 반응형 웹을 구축하고 모바일 운영체제의 웹뷰를 이용한 하이브리드 앱을 구현함으로써 개발과정 및 개발 시간 단축을 검증하였다.

Studying the Viewers' Acceptability on the Image Resolutions and Assessing the ROI-Based Scheme for Mobile Displays (이동형 단말기에서의 축구경기 시청을 위한 해상도 및 관심 영역 크기에 관한 사용자 만족도 조사)

  • Ko Jae-Seung;Ahn Il-Koo;Lee Jae-Ho;Seo Ki-Won;Kwon Jae-Hoon;Joo Young-Hun;Oh Yun-Je;Kim Chang-Ick
    • Journal of Broadcast Engineering
    • /
    • v.11 no.3 s.32
    • /
    • pp.336-348
    • /
    • 2006
  • The recent advances in multimedia signal coding and transmission technologies allow lots of users to watch videos on small LCD displays. In this paper, we briefly describe an intelligent display technique to provide small-display-viewers with comfortable experiences, and study the minimum image size tolerated and utility of displaying region of interest (ROI) only when needed. The study, with 111 participants, examines minimum image size to ensure viewers pleasant viewing experiences, and evaluates the degree of satisfaction when they are viewed with region of interest (ROI) only. The experimental results show that the ROI display enhances the viewers' satisfaction when the image size becomes less than $320{\times}240$, and thus it is useful to provide the intelligent display, if necessary, which can extract and display ROI only.

Efficient Browsing Method based on Metadata of Video Contents (동영상 컨텐츠의 메타데이타에 기반한 효율적인 브라우징 기법)

  • Chun, Soo-Duck;Shin, Jung-Hoon;Lee, Sang-Jun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.5
    • /
    • pp.513-518
    • /
    • 2010
  • The advancement of information technology along with the proliferation of communication and multimedia has increased the demand of digital contents. Video data of digital contents such as VOD, NOD, Digital Library, IPTV, and UCC are getting more permeated in various application fields. Video data have sequential characteristic besides providing the spatial and temporal information in its 3D format, making searching or browsing ineffective due to long turnaround time. In this paper, we suggest ATVC(Authoring Tool for Video Contents) for solving this issue. ATVC is a video editing tool that detects key frames using visual rhythm and insert metadata such as keywords into key frames via XML tagging. Visual rhythm is applied to map 3D spatial and temporal information to 2D information. Its processing speed is fast because it can get pixel information without IDCT, and it can classify edit-effects such as cut, wipe, and dissolve. Since XML data save key frame information via XML tag and keyword information, it can furnish efficient browsing.

A Study on the Web Application for Sailing Ship Location Information interface based by RIA (RIA기반의 선박항해정보를 위한 웹 애플리케이션 구축 "평택항 원양어선 항해정보현황 사례를 중심으로")

  • Jung, Hoe-Jun;Park, Dea-Woo;Han, Kyung-Don
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.613-616
    • /
    • 2009
  • Information present condition is using situation board by manual processing that is consisted of ship arrangement plan and letterpress and magnet etc. in Pyeongtaekhang's deep-sea fishing vessel company. Study that mark open sea far from land ship information of underway 37 ships that is accepted in every time in internet web application environment that is based on Ubiquitous Network in PC that is linked to internet. 3 through practical use of RIA of Flash technology base compose Digital Dash-Board in width grid structure only and do ship sailing addition that is operating in 6 oceans and latitude, hardness indication as well as various informations to do visual display do. Emphasized in dynamic Web Application construction because can heighten the convenience to operator and user, and take advantage of real time data.

  • PDF