• Title/Summary/Keyword: automatic shot

Search Result 53, Processing Time 0.028 seconds

A Method for Structuring Digital Video

  • Lee, Jae-Yeon;Jeong, Se-Yoon;Yoon, Ho-Sub;Kim, Kyu-Heon;Bae, Younglae-J;Jang, Jong-whan
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1998.06b
    • /
    • pp.92-97
    • /
    • 1998
  • For the efficient searching and browsing of digital video, it is essential to extract the internal structure of the video contents. As an example, a news video consists of several sections such as politics, economics, sports and others, and also each section consists of individual topics. With this information in hand, users can ore easily access the required video frames. This paper addresses the problem of automatic shot boundary detection and selection of representative frames (R-frames), which are the essential step in recognizing the internal structure of video contents. In the shot boundary detection, a new algorithm that have dual detectors which are designed specifically for the abrupt boundaries (cuts) and gradually changing bounaries respectively is proposed. Compared to the existing 미algorithms that mostly have tried to detect both types by a single mechanism, the proposed algorithm is proved to be more robust and accurate. Also in the problem of R-frame selection, simple mechanical approaches such as selecting one frame every other second have been adopted. However this approach often selects too many R-frames in static short, while drops important frames in dynamic shots. To improve the selection mechanism, a new R-frame selection algorithm that uses motion information extracted from pixel difference is proposed.

  • PDF

Development of Independent Target Approximation by Auto-computation of 3-D Distribution Units for Stereotactic Radiosurgery (정위적 방사선 수술시 3차원적 공간상 단위분포들의 자동계산법에 의한 간접적 병소 근사화 방법의 개발)

  • Choi Kyoung Sik;Oh Seung Jong;Lee Jeong Woo;Kim Jeung Kee;Suh Tae Suk;Choe Bo Young;Kim Moon Chan;Chung Hyun-Tai
    • Progress in Medical Physics
    • /
    • v.16 no.1
    • /
    • pp.24-31
    • /
    • 2005
  • The stereotactic radiosurgery (SRS) describes a method of delivering a high dose of radiation to a small tar-get volume in the brain, generally in a single fraction, while the dose delivered to the surrounding normal tissue should be minimized. To perform automatic plan of the SRS, a new method of multi-isocenter/shot linear accelerator (linac) and gamma knife (GK) radiosurgery treatment plan was developed, based on a physical lattice structure in target. The optimal radiosurgical plan had been constructed by many beam parameters in a linear accelerator or gamma knife-based radiation therapy. In this work, an isocenter/shot was modeled as a sphere, which is equal to the circular collimator/helmet hole size because the dimension of the 50% isodose level in the dose profile is similar to its size. In a computer-aided system, it accomplished first an automatic arrangement of multi-isocenter/shot considering two parameters such as positions and collimator/helmet sizes for each isocenter/shot. Simultaneously, an irregularly shaped target was approximated by cubic structures through computation of voxel units. The treatment planning method by the technique was evaluated as a dose distribution by dose volume histograms, dose conformity, and dose homogeneity to targets. For irregularly shaped targets, the new method performed optimal multi-isocenter packing, and it only took a few seconds in a computer-aided system. The targets were included in a more than 50% isodose curve. The dose conformity was ordinarily acceptable levels and the dose homogeneity was always less than 2.0, satisfying for various targets referred to Radiation Therapy Oncology Group (RTOG) SRS criteria. In conclusion, this approach by physical lattice structure could be a useful radiosurgical plan without restrictions in the various tumor shapes and the different modality techniques such as linac and GK for SRS.

  • PDF

Acquisition of Subcentimeter GSD Images Using UAV and Analysis of Visual Resolution (UAV를 이용한 Subcentimeter GSD 영상의 취득 및 시각적 해상도 분석)

  • Han, Soohee;Hong, Chang-Ki
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.35 no.6
    • /
    • pp.563-572
    • /
    • 2017
  • The purpose of the study is to investigate the effect of flight height, flight speed, exposure time of camera shutter and autofocusing on the visual resolution of the image in order to obtain ultra-high resolution images with a GSD less than 1cm. It is also aimed to evaluate the ease of recognition of various types of aerial targets. For this purpose, we measured the visual resolution using a 7952*5304 pixel 35mm CMOS sensor and a 55mm prime lens at 20m intervals from 20m to 120m above ground. As a result, with automatic focusing, the visual resolution is measured 1.1~1.6 times as the theoretical GSD, and without automatic focusing, 1.5~3.5 times. Next, the camera was shot at 80m above ground at a constant flight speed of 5m/s, while reducing the exposure time by 1/2 from 1/60sec to 1/2000sec. Assuming that blur is allowed within 1 pixel, the visual resolution is 1.3~1.5 times larger than the theoretical GSD when the exposure time is kept within the longest exposure time, and 1.4~3.0 times larger when it is not kept. If the aerial targets are printed on A4 paper and they are shot within 80m above ground, the encoded targets can be recognized automatically by commercial software, and various types of general targets and coded ones can be manually recognized with ease.

Exploring automatic scoring of mathematical descriptive assessment using prompt engineering with the GPT-4 model: Focused on permutations and combinations (프롬프트 엔지니어링을 통한 GPT-4 모델의 수학 서술형 평가 자동 채점 탐색: 순열과 조합을 중심으로)

  • Byoungchul Shin;Junsu Lee;Yunjoo Yoo
    • The Mathematical Education
    • /
    • v.63 no.2
    • /
    • pp.187-207
    • /
    • 2024
  • In this study, we explored the feasibility of automatically scoring descriptive assessment items using GPT-4 based ChatGPT by comparing and analyzing the scoring results between teachers and GPT-4 based ChatGPT. For this purpose, three descriptive items from the permutation and combination unit for first-year high school students were selected from the KICE (Korea Institute for Curriculum and Evaluation) website. Items 1 and 2 had only one problem-solving strategy, while Item 3 had more than two strategies. Two teachers, each with over eight years of educational experience, graded answers from 204 students and compared these with the results from GPT-4 based ChatGPT. Various techniques such as Few-Shot-CoT, SC, structured, and Iteratively prompts were utilized to construct prompts for scoring, which were then inputted into GPT-4 based ChatGPT for scoring. The scoring results for Items 1 and 2 showed a strong correlation between the teachers' and GPT-4's scoring. For Item 3, which involved multiple problem-solving strategies, the student answers were first classified according to their strategies using prompts inputted into GPT-4 based ChatGPT. Following this classification, scoring prompts tailored to each type were applied and inputted into GPT-4 based ChatGPT for scoring, and these results also showed a strong correlation with the teachers' scoring. Through this, the potential for GPT-4 models utilizing prompt engineering to assist in teachers' scoring was confirmed, and the limitations of this study and directions for future research were presented.

A Method of Generating Table-of-Contents for Educational Video (교육용 비디오의 ToC 자동 생성 방법)

  • Lee Gwang-Gook;Kang Jung-Won;Kim Jae-Gon;Kim Whoi-Yul
    • Journal of Broadcast Engineering
    • /
    • v.11 no.1 s.30
    • /
    • pp.28-41
    • /
    • 2006
  • Due to the rapid development of multimedia appliances, the increasing amount of multimedia data enforces the development of automatic video analysis techniques. In this paper, a method of ToC generation is proposed for educational video contents. The proposed method consists of two parts: scene segmentation followed by scene annotation. First, video sequence is divided into scenes by the proposed scene segmentation algorithm utilizing the characteristics of educational video. Then each shot in the scene is annotated in terms of scene type, existence of enclosed caption and main speaker of the shot. The ToC generated by the proposed method represents the structure of a video by the hierarchy of scenes and shots and gives description of each scene and shot by extracted features. Hence the generated ToC can help users to perceive the content of a video at a glance and. to access a desired position of a video easily. Also, the generated ToC automatically by the system can be further edited manually for the refinement to effectively reduce the required time achieving more detailed description of the video content. The experimental result showed that the proposed method can generate ToC for educational video with high accuracy.

Generating Extreme Close-up Shot Dataset Based On ROI Detection For Classifying Shots Using Artificial Neural Network (인공신경망을 이용한 샷 사이즈 분류를 위한 ROI 탐지 기반의 익스트림 클로즈업 샷 데이터 셋 생성)

  • Kang, Dongwann;Lim, Yang-mi
    • Journal of Broadcast Engineering
    • /
    • v.24 no.6
    • /
    • pp.983-991
    • /
    • 2019
  • This study aims to analyze movies which contain various stories according to the size of their shots. To achieve this, it is needed to classify dataset according to the shot size, such as extreme close-up shots, close-up shots, medium shots, full shots, and long shots. However, a typical video storytelling is mainly composed of close-up shots, medium shots, full shots, and long shots, it is not an easy task to construct an appropriate dataset for extreme close-up shots. To solve this, we propose an image cropping method based on the region of interest (ROI) detection. In this paper, we use the face detection and saliency detection to estimate the ROI. By cropping the ROI of close-up images, we generate extreme close-up images. The dataset which is enriched by proposed method is utilized to construct a model for classifying shots based on its size. The study can help to analyze the emotional changes of characters in video stories and to predict how the composition of the story changes over time. If AI is used more actively in the future in entertainment fields, it is expected to affect the automatic adjustment and creation of characters, dialogue, and image editing.

A Study on Automatic Precision Landing for Small UAV's Industrial Application (소형 UAV의 산업 응용을 위한 자동 정밀 착륙에 관한 연구)

  • Kim, Jong-Woo;Ha, Seok-Wun;Moon, Yong-Ho
    • Journal of Convergence for Information Technology
    • /
    • v.7 no.3
    • /
    • pp.27-36
    • /
    • 2017
  • In almost industries, such as the logistics industry, marine fisheries, agriculture, industry, and services, small unmanned aerial vehicles are used for aerial photographing or closing flight in areas where human access is difficult or CCTV is not installed. Also, based on the information of small unmanned aerial photographing, application research is actively carried out to efficiently perform surveillance, control, or management. In order to carry out tasks in a mission-based manner in which the set tasks are assigned and the tasks are automatically performed, the small unmanned aerial vehicles must not only fly steadily but also be able to charge the energy periodically, In addition, the unmanned aircraft need to land automatically and precisely at certain points after the end of the mission. In order to accomplish this, an automatic precision landing method that leads landing by continuously detecting and recognizing a marker located at a landing point from a video shot of a small UAV is required. In this paper, it is shown that accurate and stable automatic landing is possible even if simple template matching technique is applied without using various recognition methods that require high specification in using low cost general purpose small unmanned aerial vehicle. Through simulation and actual experiments, the results show that the proposed method will be made good use of industrial fields.

Video Segmentation and Video Browsing using the Edge and Color Distribution (윤곽선과 컬러 분포를 이용한 비디오 분할과 비디오 브라우징)

  • Heo, Seoung;Kim, Woo-Saeng
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.9
    • /
    • pp.2197-2207
    • /
    • 1997
  • In this paper, we propose a video data segmentation method using edge and color distribution of video frames and also develop a video browser by using the proposed algorithm. To segment a video, we use a 644-bin HSV color histogram and the edge information which generated with automatic threshold method. We consider scene's characteristics by using positions and colo distributions of object in each frame. We develop a hierarchical and a shot-based browser for video browsing. We also show that our proposed method is less sensitive to light effects and more robust to motion effects than previous ones like a histogram-based method by testing with various video data.

  • PDF

Backup Software for SAN with NDMP (SAN환경에서 NDMP를 이용한 백업소프트웨어)

  • 복경수;황홍연;송석일;유재수
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.9 no.4
    • /
    • pp.455-469
    • /
    • 2003
  • Recently, as new technologies such as SAN and NAS come into wide use to deal with a large amount of data, an efficient backup software for SAN and NAS is very required. In this paper, we design and implement a backup software for SAN that fully supports NDMP. The NUMP is an open standard protocol for network-based backup. The proposed backup software has various unique features such as SAN based tan free backup, automatic and manual backup, on-line backup by using snap shot, file-system, raw-device, database backup and so on. The proposed backup software also can be configured as a backup center that uses SAN as a backup media.

Automatic Indexing for the Content-based Retrieval of News Video (뉴스 비디오의 내용기반 검색을 위한 자동 인덱싱)

  • Yang, Myung-Sup;Yoo, Cheol-Jung;Chang, Ok-Bae
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.5
    • /
    • pp.1130-1139
    • /
    • 1998
  • This paper presents an integrated solution for the content-based news video indexing and the retrieval. Currently, it is impossible to automatically index a general video, but we can index a specific structural video such as news videos. Our proposed model extracts automatically the key frames by using the structured knowledge of news and consists of the news item segmentation, caption recognition and search browser modules. We present above three modules in the following: the news event segmentation module recognizes an anchor-person shot based on face recognition, and then its news event are divided by the anchor-person's frame information. The caption recognition module detects the caption-frames with the caption characteristics, extracts their character region by the using split-merge method, and then recognizes characters with OCR software. Finally, the search browser module could make a various of searching mechanism possible.

  • PDF