• Title/Summary/Keyword: frame operator

Search Result 70, Processing Time 0.024 seconds

Reproducing Summarized Video Contents based on Camera Framing and Focus

  • Hyung Lee;E-Jung Choi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.10
    • /
    • pp.85-92
    • /
    • 2023
  • In this paper, we propose a method for automatically generating story-based abbreviated summaries from long-form dramas and movies. From the shooting stage, the basic premise was to compose a frame with illusion of depth considering the golden division as well as focus on the object of interest to focus the viewer's attention in terms of content delivery. To consider how to extract the appropriate frames for this purpose, we utilized elemental techniques that have been utilized in previous work on scene and shot detection, as well as work on identifying focus-related blur. After converting the videos shared on YouTube to frame-by-frame, we divided them into a entire frame and three partial regions for feature extraction, and calculated the results of applying Laplacian operator and FFT to each region to choose the FFT with relative consistency and robustness. By comparing the calculated values for the entire frame with the calculated values for the three regions, the target frames were selected based on the condition that relatively sharp regions could be identified. Based on the selected results, the final frames were extracted by combining the results of an offline change point detection method to ensure the continuity of the frames within the shot, and an edit decision list was constructed to produce an abbreviated summary of 62.77% of the footage with F1-Score of 75.9%

A study of Artificial Intelligence (AI) Speaker's Development Process in Terms of Social Constructivism: Focused on the Products and Periodic Co-revolution Process (인공지능(AI) 스피커에 대한 사회구성 차원의 발달과정 연구: 제품과 시기별 공진화 과정을 중심으로)

  • Cha, Hyeon-ju;Kweon, Sang-hee
    • Journal of Internet Computing and Services
    • /
    • v.22 no.1
    • /
    • pp.109-135
    • /
    • 2021
  • his study classified the development process of artificial intelligence (AI) speakers through analysis of the news text of artificial intelligence (AI) speakers shown in traditional news reports, and identified the characteristics of each product by period. The theoretical background used in the analysis are news frames and topic frames. As analysis methods, topic modeling and semantic network analysis using the LDA method were used. The research method was a content analysis method. From 2014 to 2019, 2710 news related to AI speakers were first collected, and secondly, topic frames were analyzed using Nodexl algorithm. The result of this study is that, first, the trend of topic frames by AI speaker provider type was different according to the characteristics of the four operators (communication service provider, online platform, OS provider, and IT device manufacturer). Specifically, online platform operators (Google, Naver, Amazon, Kakao) appeared as a frame that uses AI speakers as'search or input devices'. On the other hand, telecommunications operators (SKT, KT) showed prominent frames for IPTV, which is the parent company's flagship business, and 'auxiliary device' of the telecommunication business. Furthermore, the frame of "personalization of products and voice service" was remarkable for OS operators (MS, Apple), and the frame for IT device manufacturers (Samsung) was "Internet of Things (IoT) Integrated Intelligence System". The econd, result id that the trend of the topic frame by AI speaker development period (by year) showed a tendency to develop around AI technology in the first phase (2014-2016), and in the second phase (2017-2018), the social relationship between AI technology and users It was related to interaction, and in the third phase (2019), there was a trend of shifting from AI technology-centered to user-centered. As a result of QAP analysis, it was found that news frames by business operator and development period in AI speaker development are socially constituted by determinants of media discourse. The implication of this study was that the evolution of AI speakers was found by the characteristics of the parent company and the process of co-evolution due to interactions between users by business operator and development period. The implications of this study are that the results of this study are important indicators for predicting the future prospects of AI speakers and presenting directions accordingly.

Voice Activity Detection Using Modified Power Spectral Deviation Based on Teager Energy (Teager Energy 기반의 수정된 파워 스펙트럼 편차를 이용한 음성 검출)

  • Song, J.H.;Song, Y.R.;Shim, H.M.;Lee, S.M.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.8 no.1
    • /
    • pp.41-46
    • /
    • 2014
  • In this paper, we propose a novel voice activity detection (VAD) algorithm using feature vectors based on TE (teager energy). Specifically, power spectral deviation (PSD), which is used as the feature for the VAD in the IS-127 noise suppression algorithm, is obtained after the input signal is transfomed by Teager energy operator. In addition, the TE-based likelihhod ratio are derived in each frame to modifiy the PSD for further VAD. The performance of our proposed VAD algorithm are evaluated by objective testing (total error rate, receiver operating characteristics, perceptual evaluation of speech quality) under various environments, and it is found that the proposed method yields better results than conventional VAD algorithms in the non-stationary noise environments under 5 dB SNR (total error rate = 2.6% decrease, PESQ score = 0.053 improvement).

  • PDF

Modal parameter identification of tall buildings based on variational mode decomposition and energy separation

  • Kang Cai;Mingfeng Huang;Xiao Li;Haiwei Xu;Binbin Li;Chen Yang
    • Wind and Structures
    • /
    • v.37 no.6
    • /
    • pp.445-460
    • /
    • 2023
  • Accurate estimation of modal parameters (i.e., natural frequency, damping ratio) of tall buildings is of great importance to their structural design, structural health monitoring, vibration control, and state assessment. Based on the combination of variational mode decomposition, smoothed discrete energy separation algorithm-1, and Half-cycle energy operator (VMD-SH), this paper presents a method for structural modal parameter estimation. The variational mode decomposition is proved to be effective and reliable for decomposing the mixed-signal with low frequencies and damping ratios, and the validity of both smoothed discrete energy separation algorithm-1 and Half-cycle energy operator in the modal identification of a single modal system is verified. By incorporating these techniques, the VMD-SH method is able to accurately identify and extract the various modes present in a signal, providing improved insights into its underlying structure and behavior. Subsequently, a numerical study of a four-story frame structure is conducted using the Newmark-β method, and it is found that the relative errors of natural frequency and damping ratio estimated by the presented method are much smaller than those by traditional methods, validating the effectiveness and accuracy of the combined method for the modal identification of the multi-modal system. Furthermore, the presented method is employed to estimate modal parameters of a full-scale tall building utilizing acceleration responses. The identified results verify the applicability and accuracy of the presented VMD-SH method in field measurements. The study demonstrates the effectiveness and robustness of the proposed VMD-SH method in accurately estimating modal parameters of tall buildings from acceleration response data.

Improved changed region detection and motion estimation for object-oriented coding (객체기반 부호화에서의 개선된 움직임 영역 추출 및 추정 기법)

  • 정의윤;박영식;송근원;한규필;하영호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.22 no.9
    • /
    • pp.2043-2052
    • /
    • 1997
  • The object-oriented coding technique which is one of the coding methods in very low bit rate environment is suitable for videophone image sequence. The selection of source model affect image analysis. In this paper, an image analysis method for the object-oriented coding is presented. The process is composed of changed region detection andmotion estimateion. First, we use the standard deviation of frame difference as thrreshold to extract themoving area. If thesum of gray values in mask is greater than the threshold, the center pixel of the mask is regarded as moving region. After moving is detected in changed region by edge operator, observation point is determined from moving region. The motion is estimated by 6-parameter mapping method with determined observation point. The experimantal resutls show that the proposed method can significantly improve the image quality.

  • PDF

Forward Error Correction based Adaptive data frame format for Optical camera communication

  • Nguyen, Quoc Huy;Kim, Hyung-O;Lee, Minwoo;Cho, Juphil;Lee, Seonhee
    • International journal of advanced smart convergence
    • /
    • v.4 no.2
    • /
    • pp.94-102
    • /
    • 2015
  • Optical camera communication (OCC) is an extension of Visible Light Communication. Different from traditional visible light communication, optical camera communications is an almost no additional cost technology by taking the advantage of build-in camera in devices. It was became a candidate for communication protocol for IoT. Camera module can be easy attached to IoT device, because it is small and flexible. Furthermore almost smartphone equip one or two camera for both back and font side with high quality and resolution. It can be utilized for receiving the data from LED or positioning. Actually, OCC combines illumination and communication. It can supply communication for special areas or environment where do not allow Radio frequency such as hospital, airplane etc. There are many concept and experiment be proposed. In this paper we proposed utilizing Android smart-phone camera for receiver and introduce new approach in modulation scheme for LED at transmitter. It also show how Manchester coding can be used encode bits while at the same time being successfully decoded by Android smart-phone camera. We introduce new data frame format for easy decoded and can be achieve high bit rate. This format can be easy to adapt to performance limit of Android operator or embedded system.

A study on the MVNO Wholesale Price in Competitive Communication Service Market (경쟁적인 통신서비스 시장에서 MVNO 도매대가 산정에 관한 연구)

  • Sawng, Yeong-Wha;Bae, Khee-Su;Jeon, Heung-Joo
    • Journal of Information Technology Applications and Management
    • /
    • v.19 no.2
    • /
    • pp.217-231
    • /
    • 2012
  • In the past, companies should make enormous facility investment and acquire a right to do business in order to join communication markets, but now they can do business without important facilities, such as communication networks. Such a movement to ease regulations about companies which want to newly join the communication industry is expected not only to change a competition frame of the mobile communication market but also to greatly affect the entire communication industry. Through this study aiming to look into a way to calculate a reasonable wholesale price related to the government's introduction of the Mobile Virtual Network Operator (MVNO) system, I came up with a following result. I applied the operating profit percentage and the ratio of operating gain to cost to the cost plus model and retail minus model, respectively, to calculate the wholesale price and found that when I calculated with the cost plus model applying the operating profit percentage, I could get the highest wholesale price. On the other hand, I got the lowest wholesale price with the retail minus model by applying the operating profit percentage. Division of expenses and calculation of profit percentage are important factors in calculating the wholesale price and such results are expected to help accurate calculation of the MVNO wholesale price.

Digital TV Revenue Models and T-commerce strategies (디지털 TV 방송서비스 수익모델과 사업자별 T-commerce 통합전략)

  • 정충영
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.4
    • /
    • pp.589-597
    • /
    • 2003
  • This paper discusses revenue model of digital TV broadcast service(D-TV) and presents the basic framework for integrated strategy of applications. Also this paper presents the cases of D-TV in the frame of integrated model. The broadcasting operators should focus on interactive advertising and revenue generation utilizing customer participation. Also, they should utilize the strengths as a platform operator. The contents provider should be concerned about the retail revenue rather than commission revenue. The middle ware provider should develop new interactive D-TV service rather than system use fee or consulting fee.

Real-Time Water Wave Simulation with Surface Advection based on Mass Conservancy

  • Kim, Dong-Young;Yoo, Kwan-Hee
    • International Journal of Contents
    • /
    • v.4 no.2
    • /
    • pp.7-12
    • /
    • 2008
  • In this paper, we present a real-time physical simulation model of water surfaces with a novel method to represent the water mass flow in full three dimensions. In a physical simulation model, the state of the water surfaces is represented by a set of physical values, including height, velocity, and the gradient. The evolution of the velocity field in previous works is handled by a velocity solver based on the Navier-Stokes equations, which occurs as a result of the unevenness of the velocity propagation. In this paper, we integrate the principle of the mass conservation in a fluid of equilateral density to upgrade the height field from the unevenness, which in mathematical terms can be represented by the divergence operator. Thus the model generates waves induced by horizontal velocity, offering a simulation that puts forces added in all direction into account when calculating the values for height and velocity for the next frame. Other effects such as reflection off the boundaries, and interactions with floating objects are involved in our method. The implementation of our method demonstrates to run with fast speed scalable to real-time rates even for large simulation domains. Therefore, our model is appropriate for a real-time and large scale water surface simulation into which the animator wishes to visualize the global fluid flow as a main emphasis.

Improvement of Image Processing Algorithm for Particle Size Measurement Using Hough Transform (Hough 변환을 이용한 입경 측정을 위한 영상처리 알고리즘의 개선)

  • Kim, Yu-Dong;Lee, Sang-Yong
    • Journal of ILASS-Korea
    • /
    • v.6 no.1
    • /
    • pp.35-43
    • /
    • 2001
  • Previous studies on image processing techniques for panicle size measurement usually have focused on a single panicle or weakly overlapped particles. In the present work, the image processing algorithm for particle size measurement has been improved to process heavily-overlapped spherical-particle images. The algorithm consists of two steps; detection of boundaries which separate the images of the overlapped panicles from the background and the panicle identification process. For the first step, Sobel operator (using gray-level gradient) and the thinning process was adopted, and compared with the gray-level thresholding method that has been widely adopted. In the second, Hough transform was used. Hough transform is the detection algorithm of parametric curves such as straight lines or circles which can be described by several parameters. To reduce the measurement error, the process of finding the true center was added. The improved algorithm was tested by processing an image frame which contains heavily overlapped spherical panicles. The results showed that both the performances of detecting the overlapped images and separating the panicle from them were improved.

  • PDF