• Title/Summary/Keyword: Processing

Search Result 69,719, Processing Time 0.091 seconds

Effective Multi-Modal Feature Fusion for 3D Semantic Segmentation with Multi-View Images (멀티-뷰 영상들을 활용하는 3차원 의미적 분할을 위한 효과적인 멀티-모달 특징 융합)

  • Hye-Lim Bae;Incheol Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.505-518
    • /
    • 2023
  • 3D point cloud semantic segmentation is a computer vision task that involves dividing the point cloud into different objects and regions by predicting the class label of each point. Existing 3D semantic segmentation models have some limitations in performing sufficient fusion of multi-modal features while ensuring both characteristics of 2D visual features extracted from RGB images and 3D geometric features extracted from point cloud. Therefore, in this paper, we propose MMCA-Net, a novel 3D semantic segmentation model using 2D-3D multi-modal features. The proposed model effectively fuses two heterogeneous 2D visual features and 3D geometric features by using an intermediate fusion strategy and a multi-modal cross attention-based fusion operation. Also, the proposed model extracts context-rich 3D geometric features from input point cloud consisting of irregularly distributed points by adopting PTv2 as 3D geometric encoder. In this paper, we conducted both quantitative and qualitative experiments with the benchmark dataset, ScanNetv2 in order to analyze the performance of the proposed model. In terms of the metric mIoU, the proposed model showed a 9.2% performance improvement over the PTv2 model using only 3D geometric features, and a 12.12% performance improvement over the MVPNet model using 2D-3D multi-modal features. As a result, we proved the effectiveness and usefulness of the proposed model.

Comparative Analysis of Self-supervised Deephashing Models for Efficient Image Retrieval System (효율적인 이미지 검색 시스템을 위한 자기 감독 딥해싱 모델의 비교 분석)

  • Kim Soo In;Jeon Young Jin;Lee Sang Bum;Kim Won Gyum
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.519-524
    • /
    • 2023
  • In hashing-based image retrieval, the hash code of a manipulated image is different from the original image, making it difficult to search for the same image. This paper proposes and evaluates a self-supervised deephashing model that generates perceptual hash codes from feature information such as texture, shape, and color of images. The comparison models are autoencoder-based variational inference models, but the encoder is designed with a fully connected layer, convolutional neural network, and transformer modules. The proposed model is a variational inference model that includes a SimAM module of extracting geometric patterns and positional relationships within images. The SimAM module can learn latent vectors highlighting objects or local regions through an energy function using the activation values of neurons and surrounding neurons. The proposed method is a representation learning model that can generate low-dimensional latent vectors from high-dimensional input images, and the latent vectors are binarized into distinguishable hash code. From the experimental results on public datasets such as CIFAR-10, ImageNet, and NUS-WIDE, the proposed model is superior to the comparative model and analyzed to have equivalent performance to the supervised learning-based deephashing model. The proposed model can be used in application systems that require low-dimensional representation of images, such as image search or copyright image determination.

Extraction and Utilization of DEM based on UAV Photogrammetry for Flood Trace Investigation and Flood Prediction (침수흔적조사를 위한 UAV 사진측량 기반 DEM의 추출 및 활용)

  • Jung-Sik PARK;Yong-Jin CHOI;Jin-Duk LEE
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.26 no.4
    • /
    • pp.237-250
    • /
    • 2023
  • Orthophotos and DEMs were generated by UAV-based aerial photogrammetry and an attempt was made to apply them to detailed investigations for the production of flood traces. The cultivated area located in Goa-eup, Gumi, where the embankment collapsed and inundated inundation occurred due to the impact of 6th Typhoon Sanba in 2012, was selected as rhe target area. To obtain optimal accuracy of UAV photogrammetry performance, the UAV images were taken under the optimal placement of 19 GCPs and then point cloud, DEM, and orthoimages were generated through image processing using Pix4Dmapper software. After applying CloudCompare's CSF Filtering to separate the point cloud into ground elements and non-ground elements, a finally corrected DEM was created using only non-ground elements in GRASS GIS software. The flood level and flood depth data extracted from the final generated DEM were compared and presented with the flood level and flood depth data from existing data as of 2012 provided through the public data portal site of the Korea Land and Geospatial Informatix Corporation(LX).

Effect of calcium silicate-based sealer to bone tissue of mandible of rats (칼슘 실리케이트 계열 실러가 흰쥐의 하악골 조직에 미치는 영향)

  • Jee-Seon Tae;Ki-Yeon Yoo;Jin-Woo Kim;Kyung-Mo Cho;Yoon Lee;Se-Hee Park
    • Journal of Dental Rehabilitation and Applied Science
    • /
    • v.40 no.1
    • /
    • pp.1-12
    • /
    • 2024
  • Purpose: To histologically evaluate the effects of three calcium silicate-based sealers on rat mandible tissue. Materials and Methods: Rats were randomly divided as follows: A group that sacrificed immediately after cavity preparation, a group that sacrificed two weeks after cavity preparation, a group that sacrificed two weeks after CeraSeal (CS), AH Plus Bioceramic (AHB), or One-Fil (OF) sealer injection, respectively. After tissue processing for all groups, the bone tissue area (%) and the number of osteoclasts in and around the cavity were measured under a microscope. The results of each group were compared and statistical analysis was performed using one-way ANOVA and Tukey's test. Results: The formation of bone tissue and the presence of osteoclasts in the cavity were observed in the group that sacrificed two weeks after cavity preparation and the group sacrificed two weeks after AHB sealer injection, and these groups showed significantly higher average bone tissue area (%) than the other groups. In the other groups, no inflammation or foreign body reaction occurred in the cavity, and no osteoclasts were observed. Conclusion: All calcium silicate-based sealers used in this study showed a favorable bone tissue response when injected into the rat mandible. In particular, higher bone formation in the cavity was observed in AHB.

Changes in the Teaching Expertise of Teachers Participating in an In-School Professional Learning Community for Elementary Science Instructional Research (초등과학 수업 연구를 위한 학교 안 전문적 학습공동체 참여 교사들의 수업 전문성 변화 양상)

  • Kim, Eun Seo;Lee, Sun-Kyung
    • Journal of Korean Elementary Science Education
    • /
    • v.43 no.1
    • /
    • pp.185-200
    • /
    • 2024
  • This study explored the changes in the elementary science teaching expertise of teachers who participated in an in-school professional learning community for elementary science instructional research. Six elementary school teachers from grades 4, 5, and 6 at an 18-class S elementary school in a medium-sized city in Chungcheongbuk-do conducted collaborative instructional research on elementary science lessons as part of an in-school professional learning community, which was held 26 times over 7 months in 2020. During the professional learning community, video and audio recordings of the activities, research lessons, course materials, and professional learning community reflection activities were collected for analysis. The collected data were analyzed using qualitative research methods; data processing, reading, note-taking, description, classification, interpretation, reporting, and visualization; and the instructional professionalism elements were extracted based on the instructional professionalism framework. In the early professional learning community activity stages, the participating teachers first discussed their teaching perspectives, their experiences, and their goals for teaching science, which resulted in a selection of research questions. The teachers then collaboratively designed and implemented research lessons for each grade level, after which lesson reflections were conducted. The teachers' abilities to engage in qualitative reflection on the research questions improved after each reflection iteration. It was found that this professional learning community collaborative lesson study experience positively contributed to teaching expertise development. Based on the study findings, the implications for using professional learning communities to improve elementary teachers' science teaching expertise are given.

Analysis of Scoring Difficulty in Different Match Situations in Relation to First Athlete to Score in World Taekwondo Athletes (세계태권도 겨루기 선수들의 선제득점에 따른 경기 내용별 득점 난이도 분석)

  • Mi-Na Jin;Jung-Hyun Yun;Chang-Jin Lee
    • Journal of Industrial Convergence
    • /
    • v.22 no.4
    • /
    • pp.21-29
    • /
    • 2024
  • This study analyzed the difficulty of scoring in different match situations in relation to which competitor scored first. The study analyzed the data from the 2022 Guadalajara World Taekwondo Championships. The analysis was performed for two separate weight classes: lightweight and heavyweight. Four game content variables were used: whether the athlete scored first, attack type, attack area, and game situation. Descriptive statistics, the Rasch model, and discrimination function questions were applied for data processing. SPSS and Winsteps were used for the statistical analysis, and the statistical significance level was set at 0.05. Consequently, in the lightweight class, the scoring frequency of the first scorer was high for all the game variables. In the heavyweight class, the scoring frequency for the first scorer was high for the attack type and attack area. By contrast, those who did not score first were more frequently found to be in a loss situation. By analyzing the scoring difficulties in different match situations based on whether the competitor scored first, the athletes who scored first in attack type most easily scored first. In losing situations, the athletes who scored first in attack area scored most easily, whereas those who did not score first scored most easily in body and match situations. For the heavyweight class, those who scored first in terms of attack type, counter-attack, and attack area scored the most easily while winning in body and match situations.

Multifaceted Evaluation Methodology for AI Interview Candidates - Integration of Facial Recognition, Voice Analysis, and Natural Language Processing (AI면접 대상자에 대한 다면적 평가방법론 -얼굴인식, 음성분석, 자연어처리 영역의 융합)

  • Hyunwook Ji;Sangjin Lee;Seongmin Mun;Jaeyeol Lee;Dongeun Lee;kyusang Lim
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.55-58
    • /
    • 2024
  • 최근 각 기업의 AI 면접시스템 도입이 증가하고 있으며, AI 면접에 대한 실효성 논란 또한 많은 상황이다. 본 논문에서는 AI 면접 과정에서 지원자를 평가하는 방식을 시각, 음성, 자연어처리 3영역에서 구현함으로써, 면접 지원자를 다방면으로 분석 방법론의 적절성에 대해 평가하고자 한다. 첫째, 시각적 측면에서, 면접 지원자의 감정을 인식하기 위해, 합성곱 신경망(CNN) 기법을 활용해, 지원자 얼굴에서 6가지 감정을 인식했으며, 지원자가 카메라를 응시하고 있는지를 시계열로 도출하였다. 이를 통해 지원자가 면접에 임하는 태도와 특히 얼굴에서 드러나는 감정을 분석하는 데 주력했다. 둘째, 시각적 효과만으로 면접자의 태도를 파악하는 데 한계가 있기 때문에, 지원자 음성을 주파수로 환산해 특성을 추출하고, Bidirectional LSTM을 활용해 훈련해 지원자 음성에 따른 6가지 감정을 추출했다. 셋째, 지원자의 발언 내용과 관련해 맥락적 의미를 파악해 지원자의 상태를 파악하기 위해, 음성을 STT(Speech-to-Text) 기법을 이용하여 텍스트로 변환하고, 사용 단어의 빈도를 분석하여 지원자의 언어 습관을 파악했다. 이와 함께, 지원자의 발언 내용에 대한 감정 분석을 위해 KoBERT 모델을 적용했으며, 지원자의 성격, 태도, 직무에 대한 이해도를 파악하기 위해 객관적인 평가지표를 제작하여 적용했다. 논문의 분석 결과 AI 면접의 다면적 평가시스템의 적절성과 관련해, 시각화 부분에서는 상당 부분 정확도가 객관적으로 입증되었다고 판단된다. 음성에서 감정분석 분야는 면접자가 제한된 시간에 모든 유형의 감정을 드러내지 않고, 또 유사한 톤의 말이 진행되다 보니 특정 감정을 나타내는 주파수가 다소 집중되는 현상이 나타났다. 마지막으로 자연어처리 영역은 면접자의 발언에서 나오는 말투, 특정 단어의 빈도수를 넘어, 전체적인 맥락과 느낌을 이해할 수 있는 자연어처리 분석모델의 필요성이 더욱 커졌음을 판단했다.

  • PDF

Voice Synthesis Detection Using Language Model-Based Speech Feature Extraction (언어 모델 기반 음성 특징 추출을 활용한 생성 음성 탐지)

  • Seung-min Kim;So-hee Park;Dae-seon Choi
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.3
    • /
    • pp.439-449
    • /
    • 2024
  • Recent rapid advancements in voice generation technology have enabled the natural synthesis of voices using text alone. However, this progress has led to an increase in malicious activities, such as voice phishing (voishing), where generated voices are exploited for criminal purposes. Numerous models have been developed to detect the presence of synthesized voices, typically by extracting features from the voice and using these features to determine the likelihood of voice generation.This paper proposes a new model for extracting voice features to address misuse cases arising from generated voices. It utilizes a deep learning-based audio codec model and the pre-trained natural language processing model BERT to extract novel voice features. To assess the suitability of the proposed voice feature extraction model for voice detection, four generated voice detection models were created using the extracted features, and performance evaluations were conducted. For performance comparison, three voice detection models based on Deepfeature proposed in previous studies were evaluated against other models in terms of accuracy and EER. The model proposed in this paper achieved an accuracy of 88.08%and a low EER of 11.79%, outperforming the existing models. These results confirm that the voice feature extraction method introduced in this paper can be an effective tool for distinguishing between generated and real voices.

Nutritional analysis of amino acid composition and zinc bioavailability in plant-based meats (대체육의 아미노산 조성 및 아연 생체 이용률의 영양학적 분석)

  • Seohyun Kang;Solmin Lee;Min Seo Chang;Soorin Kim;Young-gyun Lim;Yujin Kim;Wonhyeong Jang
    • Analytical Science and Technology
    • /
    • v.37 no.3
    • /
    • pp.155-165
    • /
    • 2024
  • This study aimed to assess whether plant-based meat substitutes can effectively replace animal meat products in terms of amino acid composition and zinc bioavailability. The evaluation was conducted in response to the growing demand for meat substitutes, driven by the increasing vegan population and the expansion of vegan culture. For this purpose, a chicken product and two plant-based meat substitutes in tender form were selected. The amino acid content and composition were measured using HPLC, while the levels of trace elements like zinc and calcium were determined through ICP-AES. Additionally, the presence of phytic acid, which inhibits zinc bioavailability, was extracted and quantified using UV-Vis spectroscopy. The results were analyzed in the context of daily product consumption. The findings revealed that certain essential amino acids, such as valine and lysine, were found to be deficient in plant-based meat substitutes compared to animal meat products. It was challenging to meet the recommended daily intake of these amino acids solely through the use of meat substitutes. Regarding zinc bioavailability, the inhibitory effect of calcium on zinc bioavailability was expected to be minimal. The zinc bioavailability of the meat substitutes varied significantly depending on the zinc and phytic acid content of the ingredients. Therefore, ingredients of plant-based meat substitutes should be carefully modulated to reach appropriate zinc bioavailability by selecting and processing plant materials with high zinc and low phytic acid content.

The Prognostic Value of 18F-Fluorodeoxyglucose PET/CT in the Initial Assessment of Primary Tracheal Malignant Tumor: A Retrospective Study

  • Dan Shao;Qiang Gao;You Cheng;Dong-Yang Du;Si-Yun Wang;Shu-Xia Wang
    • Korean Journal of Radiology
    • /
    • v.22 no.3
    • /
    • pp.425-434
    • /
    • 2021
  • Objective: To investigate the potential value of 18F-fluorodeoxyglucose (FDG) PET/CT in predicting the survival of patients with primary tracheal malignant tumors. Materials and Methods: An analysis of FDG PET/CT findings in 37 primary tracheal malignant tumor patients with a median follow-up period of 43.2 months (range, 10.8-143.2 months) was performed. Cox proportional hazards regression analyses were used to assess the associations between quantitative 18F-FDG PET/CT parameters, other clinic-pathological factors, and overall survival (OS). A risk prognosis model was established according to the independent prognostic factors identified on multivariate analysis. A survival curve determined by the Kaplan-Meier method was used to assess whether the prognosis prediction model could effectively stratify patients with different risks factors. Results: The median survival time of the 37 patients with tracheal tumors was 38.0 months, with a 95% confidence interval of 10.8 to 65.2 months. The 3-year, 5-year and 10-year survival rate were 54.1%, 43.2%, and 16.2%, respectively. The metabolic tumor volume (MTV), total lesion glycolysis (TLG), maximum standardized uptake value, age, pathological type, extension categories, and lymph node stage were included in multivariate analyses. Multivariate analysis showed MTV (p = 0.011), TLG (p = 0.020), pathological type (p = 0.037), and extension categories (p = 0.038) were independent prognostic factors for OS. Additionally, assessment of the survival curve using the Kaplan-Meier method showed that our prognosis prediction model can effectively stratify patients with different risks factors (p < 0.001). Conclusion: This study shows that 18F-FDG PET/CT can predict the survival of patients with primary tracheal malignant tumors. Patients with an MTV > 5.19, a TLG > 16.94 on PET/CT scans, squamous cell carcinoma, and non-E1 were more likely to have a reduced OS.