• Title/Summary/Keyword: Video Software Method

Search Result 308, Processing Time 0.027 seconds

Motion Vector Coding Using Adaptive Motion Resolution (적응적인 움직임 벡터 해상도를 이용한 움직임 벡터 부호화 방법)

  • Jang, Myung-Hun;Seo, Chan-Won;Han, Jong-Ki
    • Journal of Broadcast Engineering
    • /
    • v.17 no.1
    • /
    • pp.165-178
    • /
    • 2012
  • In most conventional video codecs, such as MPEG-2 and MPEG-4, inter coding is performed with the fixed motion vector resolution. When KTA software was developed, resolution for MVs can be selected in each slice. Although KTA codec uses a variety of resolutions for ME, the selected resolution is applied over the entire pixels in the slice and the statistical property of the local area is not considered. In this paper, we propose an adaptive decision scheme for motion vector resolution which depends on region, where MV search area is divided to multiple regions according to the distance from PMV. In each region, the assigned resolution is used to estimate MV. Each region supports different resolution for ME from other regions. The efficiency of the proposed scheme is affected from threshold values to divide the search area and the entropy coding method to encode the estimated MV. Simulation results with HM3.0 which is the reference software of HEVC show that the proposed scheme provides bit rate gains of 0.9%, 0.6%, and 2.9% in Random Access, Low Delay with B picture, and Low Delay with P picture structures, respectively.

Comparison of Adversarial Example Restoration Performance of VQ-VAE Model with or without Image Segmentation (이미지 분할 여부에 따른 VQ-VAE 모델의 적대적 예제 복원 성능 비교)

  • Tae-Wook Kim;Seung-Min Hyun;Ellen J. Hong
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.4
    • /
    • pp.194-199
    • /
    • 2022
  • Preprocessing for high-quality data is required for high accuracy and usability in various and complex image data-based industries. However, when a contaminated hostile example that combines noise with existing image or video data is introduced, which can pose a great risk to the company, it is necessary to restore the previous damage to ensure the company's reliability, security, and complete results. As a countermeasure for this, restoration was previously performed using Defense-GAN, but there were disadvantages such as long learning time and low quality of the restoration. In order to improve this, this paper proposes a method using adversarial examples created through FGSM according to image segmentation in addition to using the VQ-VAE model. First, the generated examples are classified as a general classifier. Next, the unsegmented data is put into the pre-trained VQ-VAE model, restored, and then classified with a classifier. Finally, the data divided into quadrants is put into the 4-split-VQ-VAE model, the reconstructed fragments are combined, and then put into the classifier. Finally, after comparing the restored results and accuracy, the performance is analyzed according to the order of combining the two models according to whether or not they are split.

BuddyMirror: A Smart Mirror Supporting Image-Making Service (BuddyMirror: 이미지 메이킹 서비스를 지원하는 스마트 미러)

  • Jo, Yeon-Jeong;Sim, Chae-Lin;Jang, Hyo-Won;Jin, Jae-Hwan;Lee, Myung-Joon
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.9 no.5
    • /
    • pp.811-821
    • /
    • 2019
  • Image making for a person is a way to improve various factors that can express oneself, such as appearance, impression, and confidence. In general, people use mirror or camera as a traditional method to make their own image or perform presentation exercises. Recently, as smart mirrors are widely used in various fields, attempts to use smart mirrors as image making tools instead of mirrors have been frequently made. Smart Mirror is considered as a suitable tool to provide image making service because it can attach various devices such as a camera and a microphone in addition to the main advantage of a mirror that it is easily accessible. In this paper, we present BuddyMirror - a smart mirror software that provides image-making service to users, and a dedicated mobile app for flexibly running the mirror software. BuddyMirror provides functions for presentation, mock interview, and styling service at the request of users, interworking with the dedicated mobile app. We also describe the techniques developed for implementing and activating each of the new services as a module of MagicMirror, a widely used smart mirror development platform. The developed mobile app enables users to deliver presentations to BuddyMirror or to download the recorded video for image-making services.

R-lambda Model based Rate Control for GOP Parallel Coding in A Real-Time HEVC Software Encoder (HEVC 실시간 소프트웨어 인코더에서 GOP 병렬 부호화를 지원하는 R-lambda 모델 기반의 율 제어 방법)

  • Kim, Dae-Eun;Chang, Yongjun;Kim, Munchurl;Lim, Woong;Kim, Hui Yong;Seok, Jin Wook
    • Journal of Broadcast Engineering
    • /
    • v.22 no.2
    • /
    • pp.193-206
    • /
    • 2017
  • In this paper, we propose a rate control method based on the $R-{\lambda}$ model that supports a parallel encoding structure in GOP levels or IDR period levels for 4K UHD input video in real-time. For this, a slice-level bit allocation method is proposed for parallel encoding instead of sequential encoding. When a rate control algorithm is applied in the GOP level or IDR period level parallelism, the information of how many bits are consumed cannot be shared among the frames belonging to a same frame level except the lowest frame level of the hierarchical B structure. Therefore, it is impossible to manage the bit budget with the existing bit allocation method. In order to solve this problem, we improve the bit allocation procedure of the conventional ones that allocate target bits sequentially according to the encoding order. That is, the proposed bit allocation strategy is to assign the target bits in GOPs first, then to distribute the assigned target bits from the lowest depth level to the highest depth level of the HEVC hierarchical B structure within each GOP. In addition, we proposed a processing method that is used to improve subjective image qualities by allocating the bits according to the coding complexities of the frames. Experimental results show that the proposed bit allocation method works well for frame-level parallel HEVC software encoders and it is confirmed that the performance of our rate controller can be improved with a more elaborate bit allocation strategy by using the preprocessing results.

Spatial Correlation Based Fast Coding Depth Decision and Reference Frame Selection in HEVC (HEVC의 공간적 상관성 기반 고속 부호화 깊이 및 참조영상 결정 방법)

  • Lee, Sang-Yong;Kim, Dong-Hyun;Kim, Jae-Gon;Choi, Hae-Chul;Kim, Jin-Soo;Choi, Jin-Soo
    • Journal of Broadcast Engineering
    • /
    • v.17 no.5
    • /
    • pp.716-724
    • /
    • 2012
  • In this paper, we propose a fast decision method of maximum coding depth decision and reference frame selection in HEVC. To reduce computational complexity and encoding time of HEVC, two methods are proposed. In the first method, the maximum depth of each coding unit (CU) in a largest CU (LCU) is constrained by using the maximum coding depth used by adjacent LCUs based on the assumption that the spatial correlation is very high and rate-distortion (R-D) cost. And we constrain the number of reference pictures for prediction unit (PU) performing motion estimation by using the motion information of the upper depth PU. The proposed methods reduce computational complexity of the HEVC encoder by constraining the maximum coding depth and the reference frame. We could achieve about 39% computational complexity reduction with marginal bitrate increase of 1.2% in the comparison with HM6.1 HEVC reference software.

Dose Verification of Intensity Modulated Radiation Therapy with Beam Intensity Scanner System

  • Vahc, Young-Woo;Park, Kwangyl;Ohyun Kwon;Park, Kyung-Ran;Lee, Yong-Ha;Yi, Byung-Yong;Kim, Sookil
    • Proceedings of the Korean Society of Medical Physics Conference
    • /
    • 2002.09a
    • /
    • pp.248-251
    • /
    • 2002
  • The intensity modulated radiation therapy (IMRT) with a multileaf collimator (MLC) requires the conversion of a radiation fluence map into a leaf sequence file that controls the movement of the MLC during radiation treatment of patients. Patient dose verification is clinically one of the most important parts in the treatment delivery of the radiation therapy. The three dimensional (3D) reconstruction of dose distribution delivered to the target helps to verify patient dose and to determine the physical characteristics of beams used in IMRT. A new method is presented for the pretreatment dosimetric verification of two dimensional distributions of photon intensity by means of Beam Intensity Scanner System (BISS) as a radiation detector with a custom-made software for dose calculation of fluorescence signals from scintillator. The scintillator is used to produce fluorescence from the irradiation of 6MV photons on a Varian Clinac 21EX. The BISS reproduces 3D- relative dose distribution from the digitized fluoroscopic signals obtained by digital video camera-based scintillator(DVCS) device in the IMRT. For the intensity modulated beams (IMBs), the calculations of absorbed dose are performed in absolute beam fluence profiles which are used for calculation of the patient dose distribution. The 3D-dose profiles of the IMBs with the BISS were demonstrated by relative measurements of photon beams and shown good agreement with radiographic film. The mechanical and dosimetric properties of the collimating of dynamic and/or step MLC system alter the generated intensity. This is mostly due to leaf transmission, leaf penumbra and geometry of leaves. The variations of output according to the multileaf opening during the irradiation need to be accounted for as well. These phenomena result in a fluence distribution that can be substantially different from the initial and calculative intensity modulation and therefore, should be taken into account by the treatment planning for accurate dose calculations delivered to the target volume in IMRT.

  • PDF

An Approach to Generation Monitoring Module using UML Model (UML모델을 이용한 모니터링 모듈 생성 방법)

  • Park, Jeong-Min;Jeong, Ok-Ran
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.9
    • /
    • pp.57-68
    • /
    • 2011
  • Self-healing is an approach to evaluating constraints defined in target system and to applying an appropriate strategy when violating the constrains. Today, the computing environment is very complex, so researches that endow a system with the self-healing's ability that recognizes problem arising in a target system are being an important issues. However, most of the existing researches are that self-healing developers need much effort and time to analyze and model constraints. Thus, in order to improve these problems, this paper proposes the method that automatically generates monitoring module by using UML models for self-healing. The approach proposes: 1) defining system knowledge required for self-healing from UML model, 2) process for generating monitor, by using monitor generated, and process for monitoring the problems. Through these, we can reduce the efforts of self-healing developers to analyze target system, and secure monitoring scope based on information of system knowledge. Also we can minimize the efforts to develop the monitoring environment automatically. to evaluate the proposed approach, we apply proposed approach to ATM prototype system for qualitative result, and perform quantitative evaluation through video conference system in our existing research.

Chromosome Analysis in Clinical Samples by Chromosome Diagnostic System Using Fluorescence in Situ Hybridization (국산 Fluorescence in Situ Hybridization 시스템을 이용한 다양한 검체에서의 염색체 분석)

  • Moon, Shin-Yong;Pang, Myung-Geol;Oh, Sun-Kyung;Ryu, Buom-Yong;Hwang, Do-Yeong;Jung, Byeong-Jun;Choe, Jin;Sohn, Cherl;Chang, Jun-Keun;Kim, Jong-Won;Kim, Seok-Hyun;Choi, Young-Min
    • Clinical and Experimental Reproductive Medicine
    • /
    • v.24 no.3
    • /
    • pp.335-340
    • /
    • 1997
  • Fluorescence in situ hybridization (FISH) techniques allow the enumeration of chromosome abnormalities and from a great potential for many clinical applications. In order to produce quantitative and reproducible results, expensive tools such as a cooled CCD camera and a computer software are required. We have developed a Chromosome Image Processing System (Chips) using FISH that allows the detection and mapping of the genetic aberrations. The aim of our study, therefore, is to evaluate the capabilities of our original system using a black-and-white video camera. As a model system, three repetitive DNA probes (D18Z1, DXZ1, and DYZ3) were hybridized to variety different clinical samples such as human metaphase spreads and interphase nuclei obtained from uncultured peripheral blood lymphocytes, uncultured amniocytes, and germ cells. The visualization of the FISH signals was performed using our system for image acquisition and pseudocoloring. FISH images were obtained by combining images from each of probes and DAPI counterstain captured separately. Using our original system, the aberrations of single or multiple chromosomes in a single hybridization experiment using chromosomes and interphase nuclei from a variety of cell types, including lymphocytes, amniocytes, sperm, and biopsied blastomeres, were enabled to evaluate. There were no differences in the image quality in accordance with FISH method, fluorochrome types, or different clinical samples. Always bright signals were detected using our system. Our system also yielded constant results. Our Chips would permit a level of performance of FISH analysis on metaphase chromosomes and interphase nuclei with unparalleled capabilities. Thus, it would be useful for clinical purposes.

  • PDF

Kinematic Characteristics Based on Proficiency In Geoduepyeopchagi in Taekwondo Poomsae Koryo

  • So, Jae Moo;Kang, Sung-Sun;Hong, AhReum;Jung, Jong Min;Kim, Jai Jeong
    • Korean Journal of Applied Biomechanics
    • /
    • v.26 no.4
    • /
    • pp.343-351
    • /
    • 2016
  • Objective: The purpose of this study was to help improve game performance and provide preliminary data to enhance the efficiency of the kick and stability of the support foot by comparing the kinematic characteristics of the repeated side kick (geodeupyeopchagi) in poomsaeKoryo between expert and non-expert groups. Method: The subjects were divided into 2 groups according to proficiency in Taekwondo, an expert group and a non-expert group (n = 7 in each group), to observe the repeated side-kick technique. Four video cameras were set at a speed of 60 frames/sec and exposure time of 1/500 sec to measure the kinematic factors of the 2 groups. The Kwon3D XPprogramas used to collect and analyze three-dimensional spatial coordinates. Ground reaction force data were obtained through a force plate with a 1.200-Hz frequency. An independent samplesttest was performed, and statistical significance was defined as .05. The SPSS 18.0 software was used to calculate the mean and standard deviation of the kinematic factors and to identify the difference between the experts and non-experts. Results: The angular displacement of the hip joint in both the expert and non-expert groups showed statistical significance on E1 and E4 of the left support foot and E5 of the right foot (p<.05). The angle displacement of the knee joint in both groups showed statistical significance on E4 of the left support foot, and E1 and E2 of the right foot (p<.05). The angular velocity of the lower leg in both groups showed no statistical significance on the left support foot but showed statistical significance on E2 and E6 of the right foot (p<.05). The angular velocity of the foot in both groups showed no statistical significance on the left support foot but showed statistical significance on E2 of the right foot (p<.05). The vertical ground reaction force in both groups showed statistical significance on E2 (p<.05). The center of pressure in all directions in both groups showed statistical significance (p<.5). Conclusion: While performing the repeated side kick (geodeupyeopchagi), the experts maintainedconsistency and stability of the angle of the support leg while the kick foot moved high and fast. On the other hand, the angle of the support foot of non-experts appeared inconsistent, and the kick foot was raised, relying on the support leg, resulting in unstable and inaccurate movement.

A Road Luminance Measurement Application based on Android (안드로이드 기반의 도로 밝기 측정 어플리케이션 구현)

  • Choi, Young-Hwan;Kim, Hongrae;Hong, Min
    • Journal of Internet Computing and Services
    • /
    • v.16 no.2
    • /
    • pp.49-55
    • /
    • 2015
  • According to the statistics of traffic accidents over recent 5 years, traffic accidents during the night times happened more than the day times. There are various causes to occur traffic accidents and the one of the major causes is inappropriate or missing street lights that make driver's sight confused and causes the traffic accidents. In this paper, with smartphones, we designed and implemented a lane luminance measurement application which stores the information of driver's location, driving, and lane luminance into database in real time to figure out the inappropriate street light facilities and the area that does not have any street lights. This application is implemented under Native C/C++ environment using android NDK and it improves the operation speed than code written in Java or other languages. To measure the luminance of road, the input image with RGB color space is converted to image with YCbCr color space and Y value returns the luminance of road. The application detects the road lane and calculates the road lane luminance into the database sever. Also this application receives the road video image using smart phone's camera and improves the computational cost by allocating the ROI(Region of interest) of input images. The ROI of image is converted to Grayscale image and then applied the canny edge detector to extract the outline of lanes. After that, we applied hough line transform method to achieve the candidated lane group. The both sides of lane is selected by lane detection algorithm that utilizes the gradient of candidated lanes. When the both lanes of road are detected, we set up a triangle area with a height 20 pixels down from intersection of lanes and the luminance of road is estimated from this triangle area. Y value is calculated from the extracted each R, G, B value of pixels in the triangle. The average Y value of pixels is ranged between from 0 to 100 value to inform a luminance of road and each pixel values are represented with color between black and green. We store car location using smartphone's GPS sensor into the database server after analyzing the road lane video image with luminance of road about 60 meters ahead by wireless communication every 10 minutes. We expect that those collected road luminance information can warn drivers about safe driving or effectively improve the renovation plans of road luminance management.