• Title/Summary/Keyword: 특징변환

Search Result 1,728, Processing Time 0.031 seconds

A Thoracic Spine Segmentation Technique for Automatic Extraction of VHS and Cobb Angle from X-ray Images (X-ray 영상에서 VHS와 콥 각도 자동 추출을 위한 흉추 분할 기법)

  • Ye-Eun, Lee;Seung-Hwa, Han;Dong-Gyu, Lee;Ho-Joon, Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.1
    • /
    • pp.51-58
    • /
    • 2023
  • In this paper, we propose an organ segmentation technique for the automatic extraction of medical diagnostic indicators from X-ray images. In order to calculate diagnostic indicators of heart disease and spinal disease such as VHS(vertebral heart scale) and Cobb angle, it is necessary to accurately segment the thoracic spine, carina, and heart in a chest X-ray image. A deep neural network model in which the high-resolution representation of the image for each layer and the structure converted into a low-resolution feature map are connected in parallel was adopted. This structure enables the relative position information in the image to be effectively reflected in the segmentation process. It is shown that learning performance can be improved by combining the OCR module, in which pixel information and object information are mutually interacted in a multi-step process, and the channel attention module, which allows each channel of the network to be reflected as different weight values. In addition, a method of augmenting learning data is presented in order to provide robust performance against changes in the position, shape, and size of the subject in the X-ray image. The effectiveness of the proposed theory was evaluated through an experiment using 145 human chest X-ray images and 118 animal X-ray images.

Analysis of calcium fluoride single crystal grown by the czochralski method (초크랄스키 방법으로 성장한 CaF2 단결정 분석)

  • Lee, Ha-Lin;Na, Jun-Hyuck;Park, Mi-Seon;Jang, Yeon-Suk;Jung, Hea-Kyun;Kim, Doo-Gun;Lee, Won-Jae
    • Journal of the Korean Crystal Growth and Crystal Technology
    • /
    • v.32 no.6
    • /
    • pp.219-224
    • /
    • 2022
  • CaF2 single crystal has a large band gap (12 eV), and it is used for optical windows, prisms, and lenses due to its excellent transmittance in a wide wavelength range and low refractive index. Moreover, it is expected to be one of the materials for ultraviolet transmissive laser optical components. CaF2 belongs to the fluoride compounds and has a face-centered cubic (FCC) structure with three sub-lattices. The representative method for CaF2 single crystal growth is Czochralski, which method has the advantages of high production efficiency and the ability to make large crystals. In this study, X-ray diffraction (XRD), X-ray rocking curves (XRC) measurement, and chemical etching were performed to analyze the crystallinity and defect density of the CaF2 single crystals, grown by the Czochralski method. Fourier-transform infrared spectroscopy (FT-IR) and UV-VIS-NIR spectroscopy systems were used to investigate the optical properties of the CaF2 crystal. The provability of various applications, including UV application, was systematically investigated with various analysis results.

Analysis and Prediction Methods of Marine Accident Patterns related to Vessel Traffic using Long Short-Term Memory Networks (장단기 기억 신경망을 활용한 선박교통 해양사고 패턴 분석 및 예측)

  • Jang, Da-Un;Kim, Joo-Sung
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.28 no.5
    • /
    • pp.780-790
    • /
    • 2022
  • Quantitative risk levels must be presented by analyzing the causes and consequences of accidents and predicting the occurrence patterns of the accidents. For the analysis of marine accidents related to vessel traffic, research on the traffic such as collision risk analysis and navigational path finding has been mainly conducted. The analysis of the occurrence pattern of marine accidents has been presented according to the traditional statistical analysis. This study intends to present a marine accident prediction model using the statistics on marine accidents related to vessel traffic. Statistical data from 1998 to 2021, which can be accumulated by month and hourly data among the Korean domestic marine accidents, were converted into structured time series data. The predictive model was built using a long short-term memory network, which is a representative artificial intelligence model. As a result of verifying the performance of the proposed model through the validation data, the RMSEs were noted to be 52.5471 and 126.5893 in the initial neural network model, and as a result of the updated model with observed datasets, the RMSEs were improved to 31.3680 and 36.3967, respectively. Based on the proposed model, the occurrence pattern of marine accidents could be predicted by learning the features of various marine accidents. In further research, a quantitative presentation of the risk of marine accidents and the development of region-based hazard maps are required.

Analysis of teaching and learning contents of matrix in German high school mathematics (독일 고등학교 수학에서 행렬 교수·학습 내용 분석)

  • Ahn, Eunkyung;Ko, Ho Kyoung
    • The Mathematical Education
    • /
    • v.62 no.2
    • /
    • pp.269-287
    • /
    • 2023
  • Matrix theory is widely used not only in mathematics, natural sciences, and engineering, but also in social sciences and artificial intelligence. In the 2009 revised mathematics curriculum, matrices were removed from high school math education to reduce the burden on students, but in anticipation of the age of artificial intelligence, they will be reintegrated into the 2022 revised education curriculum. Therefore, there is a need to analyze the matrix content covered in other countries to suggest a meaningful direction for matrix education and to derive implications for textbook composition. In this study, we analyzed the German mathematics curriculum and standard education curriculum, as well as the matrix units in the German Hesse state mathematics curriculum and textbook, and identified the characteristics of their content elements and development methods. As a result of our analysis, it was found that the German textbooks cover matrices in three categories: matrices for solving linear equations, matrices for explaining linear transformations, and matrices for explaining transition processes. It was also found that the emphasis was on mathematical reasoning and modeling when learning matrices. Based on these findings, we suggest that if matrices are to be reintegrated into school mathematics, the curriculum should focus on deep conceptual understanding, mathematical reasoning, and mathematical modeling in textbook composition.

An Iterative Digital Image Watermarking Technique using Encrypted Binary Phase Computer Generated Hologram in the DCT Domain (DCT 영역에서 암호화된 이진 위상 컴퓨터형성 홀로그램을 이용한 반복적 디지털 영상 워터마킹 기술)

  • Kim, Cheol-Su
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.14 no.3
    • /
    • pp.15-21
    • /
    • 2009
  • In this paper, we proposed an iterative digital image watermarking technique using encrypted binary phase computer generated hologram in the discrete cosine transform(OCT) domain. For the embedding process of watermark, using simulated annealing algorithm, we would generate a binary phase computer generated hologram(BPCGH) which can reconstruct hidden image perfectly instead of hidden image and repeat the hologram and encrypt it through the XOR operation with key image that is ramdomly generated binary phase components. We multiply the encrypted watermark by the weight function and embed it into the DC coefficients in the DCT domain of host image and an inverse DCT is performed. For the extracting process of watermark, we compare the DC coefficients of watermarked image and original host image in the DCT domain and dividing it by the weight function and decrypt it using XOR operation with key image. And we recover the hidden image by inverse Fourier transforming the decrypted watermark. Finally, we compute the correlation between the original hidden image and recovered hidden image to determine if a watermark exits in the host image. The proposed watermarking technique use the hologram information of hidden image which consist of binary values and encryption technique so it is very secure and robust to the external attacks such as compression, noises and cropping. We confirmed the advantages of the proposed watermarking technique through the computer simulations.

An Automatic ROI Extraction and Its Mask Generation based on Wavelet of Low DOF Image (피사계 심도가 낮은 이미지에서 웨이블릿 기반의 자동 ROI 추출 및 마스크 생성)

  • Park, Sun-Hwa;Seo, Yeong-Geon;Lee, Bu-Kweon;Kang, Ki-Jun;Kim, Ho-Yong;Kim, Hyung-Jun;Kim, Sang-Bok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.3
    • /
    • pp.93-101
    • /
    • 2009
  • This paper suggests a new algorithm automatically searching for Region-of-Interest(ROI) with high speed, using the edge information of high frequency subband transformed with wavelet. The proposed method executes a searching algorithm of 4-direction object boundary by the unit of block using the edge information, and detects ROIs. The whole image is splitted by $64{\times}64$ or $32{\times}32$ sized blocks and the blocks can be ROI block or background block according to taking the edges or not. The 4-directions searche the image from the outside to the center and the algorithm uses a feature that the low-DOF image has some edges as one goes to center. After searching all the edges, the method regards the inner blocks of the edges as ROI, and makes the ROI masks and sends them to server. This is one of the dynamic ROI method. The existing methods have had some problems of complicated filtering and region merge, but this method improved considerably the problems. Also, it was possible to apply to an application requiring real-time processing caused by the process of the unit of block.

A Study on the Thermal Prediction Model cf the Heat Storage Tank for the Optimal Use of Renewable Energy (신재생 에너지 최적 활용을 위한 축열조 온도 예측 모델 연구)

  • HanByeol Oh;KyeongMin Jang;JeeYoung Oh;MyeongBae Lee;JangWoo Park;YongYun Cho;ChangSun Shin
    • Smart Media Journal
    • /
    • v.12 no.10
    • /
    • pp.63-70
    • /
    • 2023
  • Recently, energy consumption for heating costs, which is 35% of smart farm energy costs, has increased, requiring energy consumption efficiency, and the importance of new and renewable energy is increasing due to concerns about the realization of electricity bills. Renewable energy belongs to hydropower, wind, and solar power, of which solar energy is a power generation technology that converts it into electrical energy, and this technology has less impact on the environment and is simple to maintain. In this study, based on the greenhouse heat storage tank and heat pump data, the factors that affect the heat storage tank are selected and a heat storage tank supply temperature prediction model is developed. It is predicted using Long Short-Term Memory (LSTM), which is effective for time series data analysis and prediction, and XGBoost model, which is superior to other ensemble learning techniques. By predicting the temperature of the heat pump heat storage tank, energy consumption may be optimized and system operation may be optimized. In addition, we intend to link it to the smart farm energy integrated operation system, such as reducing heating and cooling costs and improving the energy independence of farmers due to the use of solar power. By managing the supply of waste heat energy through the platform and deriving the maximum heating load and energy values required for crop growth by season and time, an optimal energy management plan is derived based on this.

Reproducing Summarized Video Contents based on Camera Framing and Focus

  • Hyung Lee;E-Jung Choi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.10
    • /
    • pp.85-92
    • /
    • 2023
  • In this paper, we propose a method for automatically generating story-based abbreviated summaries from long-form dramas and movies. From the shooting stage, the basic premise was to compose a frame with illusion of depth considering the golden division as well as focus on the object of interest to focus the viewer's attention in terms of content delivery. To consider how to extract the appropriate frames for this purpose, we utilized elemental techniques that have been utilized in previous work on scene and shot detection, as well as work on identifying focus-related blur. After converting the videos shared on YouTube to frame-by-frame, we divided them into a entire frame and three partial regions for feature extraction, and calculated the results of applying Laplacian operator and FFT to each region to choose the FFT with relative consistency and robustness. By comparing the calculated values for the entire frame with the calculated values for the three regions, the target frames were selected based on the condition that relatively sharp regions could be identified. Based on the selected results, the final frames were extracted by combining the results of an offline change point detection method to ensure the continuity of the frames within the shot, and an edit decision list was constructed to produce an abbreviated summary of 62.77% of the footage with F1-Score of 75.9%

Multi-View 3D Human Pose Estimation Based on Transformer (트랜스포머 기반의 다중 시점 3차원 인체자세추정)

  • Seoung Wook Choi;Jin Young Lee;Gye Young Kim
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.48-56
    • /
    • 2023
  • The technology of Three-dimensional human posture estimation is used in sports, motion recognition, and special effects of video media. Among various methods for this, multi-view 3D human pose estimation is essential for precise estimation even in complex real-world environments. But Existing models for multi-view 3D human posture estimation have the disadvantage of high order of time complexity as they use 3D feature maps. This paper proposes a method to extend an existing monocular viewpoint multi-frame model based on Transformer with lower time complexity to 3D human posture estimation for multi-viewpoints. To expand to multi-viewpoints our proposed method first generates an 8-dimensional joint coordinate that connects 2-dimensional joint coordinates for 17 joints at 4-vieiwpoints acquired using the 2-dimensional human posture detector, CPN(Cascaded Pyramid Network). This paper then converts them into 17×32 data with patch embedding, and enters the data into a transformer model, finally. Consequently, the MLP(Multi-Layer Perceptron) block that outputs the 3D-human posture simultaneously updates the 3D human posture estimation for 4-viewpoints at every iteration. Compared to Zheng[5]'s method the number of model parameters of the proposed method was 48.9%, MPJPE(Mean Per Joint Position Error) was reduced by 20.6 mm (43.8%) and the average learning time per epoch was more than 20 times faster.

  • PDF

Analysis of Research Trends in Deep Learning-Based Video Captioning (딥러닝 기반 비디오 캡셔닝의 연구동향 분석)

  • Lyu Zhi;Eunju Lee;Youngsoo Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.13 no.1
    • /
    • pp.35-49
    • /
    • 2024
  • Video captioning technology, as a significant outcome of the integration between computer vision and natural language processing, has emerged as a key research direction in the field of artificial intelligence. This technology aims to achieve automatic understanding and language expression of video content, enabling computers to transform visual information in videos into textual form. This paper provides an initial analysis of the research trends in deep learning-based video captioning and categorizes them into four main groups: CNN-RNN-based Model, RNN-RNN-based Model, Multimodal-based Model, and Transformer-based Model, and explain the concept of each video captioning model. The features, pros and cons were discussed. This paper lists commonly used datasets and performance evaluation methods in the video captioning field. The dataset encompasses diverse domains and scenarios, offering extensive resources for the training and validation of video captioning models. The model performance evaluation method mentions major evaluation indicators and provides practical references for researchers to evaluate model performance from various angles. Finally, as future research tasks for video captioning, there are major challenges that need to be continuously improved, such as maintaining temporal consistency and accurate description of dynamic scenes, which increase the complexity in real-world applications, and new tasks that need to be studied are presented such as temporal relationship modeling and multimodal data integration.