• Title/Summary/Keyword: Image/Video Processing

Search Result 873, Processing Time 0.029 seconds

Image Restoration Filter for Preserving High Frequency Components in Impulse Noise Environments (임펄스 잡음 환경에서 고주파 성분을 보존하기 위한 영상 복원 필터)

  • Cheon, Bong-Won;Kim, Nam-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.4
    • /
    • pp.394-400
    • /
    • 2019
  • Noise removal is one of the required step in processing digital video and there are many researches to develop algorithm that fits with its purpose and environment. However, present impulse noise removal methods are lacking in its function in terms of removing noise in edge and high frequency factors. Therefore, this research has Extended range of masks depending on density to determine noise so that high frequency factors can be preserved. The range of resolution is set based on median and standard deviation of inside resolution after removing impulse noise. afterwards, those resolution within the range are calculated by adding weight to have the final output value. The suggested algorithm has an enhanced function in removing noise in various areas with many edge and high frequency factors than present methods and their functions are compared through simulation.

Design and Implementation of Early Warning Monitoring System for Cross-border Mining in Open-pit Mines (노천광산의 월경 채굴 조기경보 모니터링시스템의 설계 및 구현)

  • Li Ke;Byung-Won Min
    • Journal of Internet of Things and Convergence
    • /
    • v.10 no.2
    • /
    • pp.25-41
    • /
    • 2024
  • For the scenario of open pit mining, at present, manual periodic verification is mainly carried out in China with the help of video surveillance, which requires continuous investment in labor cost and has poor timeliness. In order to solve this difficult problem of early warning and monitoring, this paper researches a spatialized algorithmic model and designs an early warning system for open-pit mine transboundary mining, which is realized by calculating the coordinate information of the mining and extracting equipments and comparing it with the layer coordinates of the approval range of the mines in real time, so as to realize the determination of the transboundary mining behavior of the mines. By taking the Pingxiang area of Jiangxi Province as the research object, after the field experiment, it shows that the system runs stably and reliably, and verifies that the target tracking accuracy of the system is high, which can effectively improve the early warning capability of the open-pit mines' overstepping the boundary, improve the timeliness and accuracy of mine supervision, and reduce the supervision cost.

Research on depth information based object-tracking and stage size estimation for immersive audio panning (이머시브 오디오 패닝을 위한 깊이 정보 기반 객체 추적 및 무대 크기 예측에 관한 연구)

  • Kangeun Lee;Hongjun Park;Sungyoung Kim
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.5
    • /
    • pp.529-535
    • /
    • 2024
  • This paper presents our research on automatic audio panning for media content production. Previously, tracking an audio was done manually. With the advent of the immersive audio era, the need for an automatic audio panning system has increased, yet no substantial research has been progressed to date. Therefore, we propose a computer vision-based human tracking and depth feature processing system which processes depth feature through using 2-dimensional coordinates and models 3-dimensional view transformation for automatic audio panning to ensure audiovisual congruence. Also, this system applies stage size estimation model which gets input as an image and extrapolates stage width and depth as meter unit. Since our system estimates stage sizes and directly applies them to view transformation, no additional depth data training is required. To validate the proposed system, we also conducted a pilot test with Unity based sample video. Our team expects that our system will enable automated audio panning, assisting many audio engineers.

Multimedia Network Teaching System based on SMIL (SMIL을 기반으로 한 멀티미디어 네트워크 교육시스템)

  • Yu, Lei;Cao, Ke-Rang;Bang, Jin-Suk;Cho, Tae-Beom;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.524-527
    • /
    • 2008
  • Recently, digital and the Internet are widespread out of the world, and multimedia processing technology and the development of information and communication technology in education using the Internet as the demand is rapidly increasing. Also, we tan easily use informations with less restrictions of time and space. however, several kinds of audio, media to integrate multimedia data, such as the proliferation of demands for representation. Therefore, in 1998, W3C presented an international standard, SMIL in order to solve multimedia object representation and synchronization problems. By using SMIL, various multimedia elements can be integrated as a multimedia document with proper view in a spate and time. Using this SMIL document, we can create new internet radio broadcasting service that delivers not noly audio data but also various text, image and video. In this paper, with the system, teachers can easily create multimedia courseware and living broadcast their torture on network, students can receive audio-video information of the teacher, screen displays of the teachers computer. Moreover students can communicate with teacher simultaneously by text editor windows. Students can also order courseware after class.

  • PDF

Threat Situation Determination System Through AWS-Based Behavior and Object Recognition (AWS 기반 행위와 객체 인식을 통한 위협 상황 판단 시스템)

  • Ye-Young Kim;Su-Hyun Jeong;So-Hyun Park;Young-Ho Park
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.4
    • /
    • pp.189-198
    • /
    • 2023
  • As crimes frequently occur on the street, the spread of CCTV is increasing. However, due to the shortcomings of passively operated CCTV, the need for intelligent CCTV is attracting attention. Due to the heavy system of such intelligent CCTV, high-performance devices are required, which has a problem in that it is expensive to replace the general CCTV. To solve this problem, an intelligent CCTV system that recognizes low-quality images and operates even on devices with low performance is required. Therefore, this paper proposes a Saying CCTV system that can detect threats in real time by using the AWS cloud platform to lighten the system and convert images into text. Based on the data extracted using YOLO v4 and OpenPose, it is implemented to determine the risk object, threat behavior, and threat situation, and calculate the risk using machine learning. Through this, the system can be operated anytime and anywhere as long as the network is connected, and the system can be used even with devices with minimal performance for video shooting and image upload. Furthermore, it is possible to quickly prevent crime by automating meaningful statistics on crime by analyzing the video and using the data stored as text.

3D Facial Animation with Head Motion Estimation and Facial Expression Cloning (얼굴 모션 추정과 표정 복제에 의한 3차원 얼굴 애니메이션)

  • Kwon, Oh-Ryun;Chun, Jun-Chul
    • The KIPS Transactions:PartB
    • /
    • v.14B no.4
    • /
    • pp.311-320
    • /
    • 2007
  • This paper presents vision-based 3D facial expression animation technique and system which provide the robust 3D head pose estimation and real-time facial expression control. Many researches of 3D face animation have been done for the facial expression control itself rather than focusing on 3D head motion tracking. However, the head motion tracking is one of critical issues to be solved for developing realistic facial animation. In this research, we developed an integrated animation system that includes 3D head motion tracking and facial expression control at the same time. The proposed system consists of three major phases: face detection, 3D head motion tracking, and facial expression control. For face detection, with the non-parametric HT skin color model and template matching, we can detect the facial region efficiently from video frame. For 3D head motion tracking, we exploit the cylindrical head model that is projected to the initial head motion template. Given an initial reference template of the face image and the corresponding head motion, the cylindrical head model is created and the foil head motion is traced based on the optical flow method. For the facial expression cloning we utilize the feature-based method, The major facial feature points are detected by the geometry of information of the face with template matching and traced by optical flow. Since the locations of varying feature points are composed of head motion and facial expression information, the animation parameters which describe the variation of the facial features are acquired from geometrically transformed frontal head pose image. Finally, the facial expression cloning is done by two fitting process. The control points of the 3D model are varied applying the animation parameters to the face model, and the non-feature points around the control points are changed by use of Radial Basis Function(RBF). From the experiment, we can prove that the developed vision-based animation system can create realistic facial animation with robust head pose estimation and facial variation from input video image.

Design and Implementation of AR Model based Automatic Identification and Restoration Scheme for Line Scratches in Old Films (AR 모델 기반의 고전영화의 긁힘 손상의 자동 탐지 및 복원 시스템 설계와 구현)

  • Han, Ngoc-Soc;Kim, Seong-Whan
    • The KIPS Transactions:PartB
    • /
    • v.17B no.1
    • /
    • pp.47-54
    • /
    • 2010
  • Old archived film shows two major defects: line scratch and blobs. In this paper, we present a design and implementation of an automatic video restoration system for line scratches observed in archived film. We use autoregressive (AR) image model because we can make stochastic and specifically autoregressive image generation process with our PAST-PRESENT model and Sampling Pattern. We designed locality maximizing scanning pattern, which can generate nearly stationary time-like series of pixels, which is a strong requirement for a stochastic series to be autoregressive. The sampled pixel series undergoes filtering and model fitting using Durbin-Levinson algorithm before interpolation process. We designed three-stage film restoration system, which includes (1) film acquisition from VHS tapes, (2) simple line scratch detection and restoration, and (3) manual blob identification and sophisticated inpainting scheme. We implemented film acquisition and simple inpainting scheme on Texas Instruments DSP board TMS320DM642 EVM, and implemented our AR inpainting scheme on PC for sophisticated restoration. We experimented our scheme with two old Korean films: "Viva Freedom" and "Robot Tae-Kwon-V", and the experimental results show that our scheme improves Bertalmio's scheme for subjective quality (MOS), objective quality (PSNR), and especially restoration ratio (RR), which reflects how much similar to the manual inpainting results.

Alternative Tracing Method for Moving Object Using Reference Template in Real-time Image - Focusing on Parking Management System (참조 템플릿 기반 실시간 이동체 영상을 이용한 대안적 탐지 방안 - 주차관리시스템을 대상으로)

  • Joo, Yong Jin;Kang, Lee Seul;Hahm, Chang Hahk
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.32 no.5
    • /
    • pp.495-503
    • /
    • 2014
  • As the number of vehicles has been sharply increases, the significance of safety and effective operation issues in the parking lot is being emphasized, which takes a part of the transportation system. Recently, there have been several studies for the parking management by detecting moving object, however, recognizing numbers of fast-moving vehicles simultaneously in the picture is still a challenging problem. The parking lot in public area, or large-sized buildings has clear parking section, whereas the sensor system is configured to monitor a plurality of parking spaces. Therefore, by considering those parking lots, we suggested to develop the real-time parking availability information system by applying the real-time image processing techniques. with the help of template matching. Following the study, we wanted to provide the alternative method for parking management system through the reference template makers by recognizing movements of parked vehicles with the size and shape, regardless of direct detecting of driving movements. In addition, we evaluated the applicability and performances of the information system, presented in this study, and implemented a prototype system to simulate the parking statuses of each floor. In fat, it was possible to manage and analyze statistics about the total number of parking spaces and the number of vehicles parked through real-time video flames. We expected that the result of the study will be advanced, following the user-friendliness and cost reduction in operating parking management system and giving information by efficient analysis of parking situation.

Study On The Signal Radar Plan Position Indicator Scope Of The Data Expressed Scanning System Implemented As An Sticking Image On LCD Display (Plan Position Indicator Scope 주사방식의 Radar 영상신호를 LCD Display에 잔상영상으로 데이터 표출 구현에 관한 연구)

  • Shin, Hyun Jong;Yu, Hyeung Keun
    • Journal of Satellite, Information and Communications
    • /
    • v.10 no.3
    • /
    • pp.94-101
    • /
    • 2015
  • The display device is an important video information communication system device to connect between human and device. it transfers the information as characters, shapes, images and pattern to enable recognizing by eyes. Theres absolutely needs some key functions and role to quickly display informations. It can analyse a information through a PPI Scope of a cathode-ray tube(CRT) displays information which can perform a role. this research proposed a radar device to display informations as received signal. The radar display researches can apply to fixed function graphics pipeline algorithms of the large capacity type through a vertical blanking interval and buffer swap of display unit. Also, it can be possible to apply to performed algorithms to FPGA logic without high-performance graphics processing unit GPU through synchronization which can implement a display system. In this paper, we improved the affordability and reliability through proposed research. 이So, we have studied the radar display unit which can change a flat display from radar display of CRT radar display.

A Study on Treatment Target Position Verification by using Electronic Portal Imaging Device & Fractionated Stereotatic Radiotherapy (EPID와 FSRT를 이용한 치료표적위치 검증에 관한 연구)

  • Lee, Dong-Hoon;Kwon, Jang-Woo;Park, Seung-Woo;Kim, Yoon-Jong;Lee, Dong-Han;Ji, Young-Hoon
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.46 no.3
    • /
    • pp.44-51
    • /
    • 2009
  • It is very important to verify generated setup errors in cancer therapy by using a high energy radiation and to perform the precise radiation therapy. Specially, the verification of treatment position is very crucial in special therapies like fractionated stereotatic radiotherapy (FSRT). The FSRT uses normally high-dose, small field size for treating small intracranial lesions. To estimate the developed FSRT system, the isocenter accuracy of gantry, couch and collimator were performed and a total of inaccuracy was less than ${\pm}1mm$. Precise beam targeting is crucial when using high-dose, small field size FSRT for treating small intracranial lesions. The EPID image of the 3mm lead ball mounted on the isocenter with a 25mm collimator cone was acquired and detected to the extent of one pixel (0.76mm) after comparing the difference between the center of a 25mm collimator cone and a 3 mm ball after processing the EPID image. In this paper, the radiation treatment efficiency can be improved by performing precise radiation therapy with a developed video based EPID and FSRT at near real time