• Title/Summary/Keyword: 영상 이미지

Search Result 3,018, Processing Time 0.028 seconds

Group-based Adaptive Rendering for 6DoF Immersive Video Streaming (6DoF 몰입형 비디오 스트리밍을 위한 그룹 분할 기반 적응적 렌더링 기법)

  • Lee, Soonbin;Jeong, Jong-Beom;Ryu, Eun-Seok
    • Journal of Broadcast Engineering
    • /
    • v.27 no.2
    • /
    • pp.216-227
    • /
    • 2022
  • The MPEG-I (Immersive) group is working on a standardization project for immersive video that provides 6 degrees of freedom (6DoF). The MPEG Immersion Video (MIV) standard technology is intended to provide limited 6DoF based on depth map-based image rendering (DIBR) technique. Many efficient coding methods have been suggested for MIV, but efficient transmission strategies have received little attention in MPEG-I. This paper proposes group-based adaptive rendering method for immersive video streaming. Each group can be transmitted independently using group-based encoding, enabling adaptive transmission depending on the user's viewport. In the rendering process, the proposed method derives weights of group for view synthesis and allocate high quality bitstream according to a given viewport. The proposed method is implemented through the Test Model for Immersive Video (TMIV) test model. The proposed method demonstrates 17.0% Bjontegaard-delta rate (BD-rate) savings on the peak signalto-noise ratio (PSNR) and 14.6% on the Immersive Video PSNR(IV-PSNR) in terms of various end-to-end evaluation metrics in the experiment.

A Study on Applicability of Smartphone Camera and Lens for Concrete Crack Measurement Using Image Processing Techniques (이미지 처리기법을 이용한 균열 측정시 스마트폰 카메라 및 렌즈 적용성에 대한 연구)

  • Seo, Seunghwan;Kim, Dong-Hyun;Chung, Moonkyung
    • Journal of the Korean Geosynthetics Society
    • /
    • v.20 no.4
    • /
    • pp.63-71
    • /
    • 2021
  • Recently, high-resolution cameras in smartphones enable measurement of minute objects such as cracks in concrete using image processing techniques. The technology to investigate the crack width using an application at an adjacent distance of the close shot range has already been implemented, but the use is limited, so it is necessary to verify the usability of the high-resolution smartphone camera to measure cracks at a longer distance. This study focuses on recognizing the size of subdivided crack widths at a thickness within 1.0 mm of crack width at a distance of 2 m. In recent Android-based smartphones, an experiment was conducted focusing on the relationship between the unit pixel size, which is a measurement component, and the shooting distance, depending on the camera resolution. As a result, it was possible to confirm the necessity of a smartphone lens for the classification and quantification of microcrack widths of 0.3 mm to 1mm. The universal telecentric lens for smartphones needed to be installed in an accurate position to minimize the effect of distortion. In addition, as a result of applying a 64 MP high-resolution smartphone camera and double magnification lens, the crack width could be calculated within 2 m in pixel units, and crack widths of 0.3, 0.5, and 1mm could be distinguished.

Multimodal Sentiment Analysis Using Review Data and Product Information (리뷰 데이터와 제품 정보를 이용한 멀티모달 감성분석)

  • Hwang, Hohyun;Lee, Kyeongchan;Yu, Jinyi;Lee, Younghoon
    • The Journal of Society for e-Business Studies
    • /
    • v.27 no.1
    • /
    • pp.15-28
    • /
    • 2022
  • Due to recent expansion of online market such as clothing, utilizing customer review has become a major marketing measure. User review has been used as a tool of analyzing sentiment of customers. Sentiment analysis can be largely classified with machine learning-based and lexicon-based method. Machine learning-based method is a learning classification model referring review and labels. As research of sentiment analysis has been developed, multi-modal models learned by images and video data in reviews has been studied. Characteristics of words in reviews are differentiated depending on products' and customers' categories. In this paper, sentiment is analyzed via considering review data and metadata of products and users. Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Self Attention-based Multi-head Attention models and Bidirectional Encoder Representation from Transformer (BERT) are used in this study. Same Multi-Layer Perceptron (MLP) model is used upon every products information. This paper suggests a multi-modal sentiment analysis model that simultaneously considers user reviews and product meta-information.

The Performance Improvement of U-Net Model for Landcover Semantic Segmentation through Data Augmentation (데이터 확장을 통한 토지피복분류 U-Net 모델의 성능 개선)

  • Baek, Won-Kyung;Lee, Moung-Jin;Jung, Hyung-Sup
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_2
    • /
    • pp.1663-1676
    • /
    • 2022
  • Recently, a number of deep-learning based land cover segmentation studies have been introduced. Some studies denoted that the performance of land cover segmentation deteriorated due to insufficient training data. In this study, we verified the improvement of land cover segmentation performance through data augmentation. U-Net was implemented for the segmentation model. And 2020 satellite-derived landcover dataset was utilized for the study data. The pixel accuracies were 0.905 and 0.923 for U-Net trained by original and augmented data respectively. And the mean F1 scores of those models were 0.720 and 0.775 respectively, indicating the better performance of data augmentation. In addition, F1 scores for building, road, paddy field, upland field, forest, and unclassified area class were 0.770, 0.568, 0.433, 0.455, 0.964, and 0.830 for the U-Net trained by original data. It is verified that data augmentation is effective in that the F1 scores of every class were improved to 0.838, 0.660, 0.791, 0.530, 0.969, and 0.860 respectively. Although, we applied data augmentation without considering class balances, we find that data augmentation can mitigate biased segmentation performance caused by data imbalance problems from the comparisons between the performances of two models. It is expected that this study would help to prove the importance and effectiveness of data augmentation in various image processing fields.

A case study of ground subsidence analysis using the InSAR technique (InSAR 기술을 이용한 지반침하분석 사례연구)

  • Moon, Joon-Shik;Oh, Hyoung-seok
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.24 no.2
    • /
    • pp.171-182
    • /
    • 2022
  • InSAR (Interferometry SAR) technique is a technique that uses complex data to obtain phase difference information from two or more SAR image data, and enables high-resolution image extraction, surface change detection, elevation measurement, and glacial change observation. In many countries, research on the InSAR technique is being conducted in various fields of study such as volcanic activity detection, glacier observation in Antarctica, and ground subsidence analysis. In this study, a case of large ground settlement due to groundwater level drawdown during tunnelling was introduced, and ground settlement analyses using InSAR technique and numerical analysis method were compared. The maximum settlement and influence radius estimated by the InSAR technique and numerical method were found to be quite similar, which confirms the reliability of the InSAR technique. Through this case study, it was found that the InSAR technique reliable to use for estimating ground settlement and can be used as a key technology to identify the long-term ground settlement history in the absence of measurement data.

Analysis Study on the Detection and Classification of COVID-19 in Chest X-ray Images using Artificial Intelligence (인공지능을 활용한 흉부 엑스선 영상의 코로나19 검출 및 분류에 대한 분석 연구)

  • Yoon, Myeong-Seong;Kwon, Chae-Rim;Kim, Sung-Min;Kim, Su-In;Jo, Sung-Jun;Choi, Yu-Chan;Kim, Sang-Hyun
    • Journal of the Korean Society of Radiology
    • /
    • v.16 no.5
    • /
    • pp.661-672
    • /
    • 2022
  • After the outbreak of the SARS-CoV2 virus that causes COVID-19, it spreads around the world with the number of infections and deaths rising rapidly caused a shortage of medical resources. As a way to solve this problem, chest X-ray diagnosis using Artificial Intelligence(AI) received attention as a primary diagnostic method. The purpose of this study is to comprehensively analyze the detection of COVID-19 via AI. To achieve this purpose, 292 studies were collected through a series of Classification methods. Based on these data, performance measurement information including Accuracy, Precision, Area Under Cover(AUC), Sensitivity, Specificity, F1-score, Recall, K-fold, Architecture and Class were analyzed. As a result, the average Accuracy, Precision, AUC, Sensitivity and Specificity were achieved as 95.2%, 94.81%, 94.01%, 93.5%, and 93.92%, respectively. Although the performance measurement information on a year-on-year basis gradually increased, furthermore, we conducted a study on the rate of change according to the number of Class and image data, the ratio of use of Architecture and about the K-fold. Currently, diagnosis of COVID-19 using AI has several problems to be used independently, however, it is expected that it will be sufficient to be used as a doctor's assistant.

Deep Learning Based Rescue Requesters Detection Algorithm for Physical Security in Disaster Sites (재난 현장 물리적 보안을 위한 딥러닝 기반 요구조자 탐지 알고리즘)

  • Kim, Da-hyeon;Park, Man-bok;Ahn, Jun-ho
    • Journal of Internet Computing and Services
    • /
    • v.23 no.4
    • /
    • pp.57-64
    • /
    • 2022
  • If the inside of a building collapses due to a disaster such as fire, collapse, or natural disaster, the physical security inside the building is likely to become ineffective. Here, physical security is needed to minimize the human casualties and physical damages in the collapsed building. Therefore, this paper proposes an algorithm to minimize the damage in a disaster situation by fusing existing research that detects obstacles and collapsed areas in the building and a deep learning-based object detection algorithm that minimizes human casualties. The existing research uses a single camera to determine whether the corridor environment in which the robot is currently located has collapsed and detects obstacles that interfere with the search and rescue operation. Here, objects inside the collapsed building have irregular shapes due to the debris or collapse of the building, and they are classified and detected as obstacles. We also propose a method to detect rescue requesters-the most important resource in the disaster situation-and minimize human casualties. To this end, we collected open-source disaster images and image data of disaster situations and calculated the accuracy of detecting rescue requesters in disaster situations through various deep learning-based object detection algorithms. In this study, as a result of analyzing the algorithms that detect rescue requesters in disaster situations, we have found that the YOLOv4 algorithm has an accuracy of 0.94, proving that it is most suitable for use in actual disaster situations. This paper will be helpful for performing efficient search and rescue in disaster situations and achieving a high level of physical security, even in collapsed buildings.

Analysis of Influence Factors on the Satisfaction of Viewers on China's CCTV-9 Channel (중국 CCTV-9 채널 시청자의 프로그램 관람 만족도 결정요인 분석)

  • Guo, Yuan;Wang, Zhifeng
    • Journal of Korea Entertainment Industry Association
    • /
    • v.15 no.8
    • /
    • pp.107-116
    • /
    • 2021
  • In recent years, the research on audience satisfaction after watching programs has been carried out in various fields. However, there is no precedent for the study of simply analyzing the influencing factors of audience satisfaction with the newly established CCTV-9 channel. For CCTV-9, how to explore the strategy of industrial development based on the needs of the audience in the era of big data is a very important part. This article exploratively focuses on the influencing factors related to CCTV-9 audience satisfaction. Using questionnaires, 101 samples of the satisfaction with the channel of men and women of different ages, education backgrounds, majors, and incomes were collected to test, and 9 hypotheses were tentatively proposed as relevant influencing factors of channel satisfaction. Through empirical analysis, this research searches for the determinants. The reliability and validity of the measurement were properly analyzed, and all hypotheses were statistically tested. The empirical results show that: subject matter, program format, program scheduling, program broadcast time, channel advertising, simulcast series of documentaries, diversified communication platforms, brand image packaging and audience satisfaction are significantly positively correlated.

Real-time Segmentation of Black Ice Region in Infrared Road Images

  • Li, Yu-Jie;Kang, Sun-Kyoung;Jung, Sung-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.2
    • /
    • pp.33-42
    • /
    • 2022
  • In this paper, we proposed a deep learning model based on multi-scale dilated convolution feature fusion for the segmentation of black ice region in road image to send black ice warning to drivers in real time. In the proposed multi-scale dilated convolution feature fusion network, different dilated ratio convolutions are connected in parallel in the encoder blocks, and different dilated ratios are used in different resolution feature maps, and multi-layer feature information are fused together. The multi-scale dilated convolution feature fusion improves the performance by diversifying and expending the receptive field of the network and by preserving detailed space information and enhancing the effectiveness of diated convolutions. The performance of the proposed network model was gradually improved with the increase of the number of dilated convolution branch. The mIoU value of the proposed method is 96.46%, which was higher than the existing networks such as U-Net, FCN, PSPNet, ENet, LinkNet. The parameter was 1,858K, which was 6 times smaller than the existing LinkNet model. From the experimental results of Jetson Nano, the FPS of the proposed method was 3.63, which can realize segmentation of black ice field in real time.

A Study on 'Character Creation' of Personality Actor - Focusing on Actor Jung Woo-sung and the Characters He Played - (퍼스낼리티 배우의 '인물창조' 연구 -배우 '정우성-캐릭터'들을 중심으로-)

  • Oh, Youn-Hong
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.8
    • /
    • pp.141-152
    • /
    • 2020
  • This thesis is a study on the creation of characters of "personality actors and character actors" among the concepts of "star persona" of S. Prince. This study selected 7 representative films of Jung Woo-sung, a representative personality actor in the Korean film industry, and analyzed how 'Jung Woo-sung-character' is embodied in the film as a character component suggested by R. Dyer. In the traditional sense, the actor's acting has been highly valued for transformation and impersonation in character creation, but discussing the method of creating a character focusing only on the actor's acting in a movie acting implemented in combination with advanced technology I don't think it's suitable for movie acting research. Theorists such as R.Dyer and Edgar Morin, who studied film actors and stars, also emphasize that 'movie actors are different from theater actors' (by the film medium). Therefore, in this thesis, a detailed analysis of the acting of a personality actor was avoided. As Morin pointed out that the character in the movie can be implemented through the actor's image and type, not playing the type of role. I tried to propose the significance of this character creation method of the personality actor by analyzing actor Jung Woo-sung and his character.