Search | Korea Science

Multi-focus Image Fusion using Fully Convolutional Two-stream Network for Visual Sensors

Xu, Kaiping;Qin, Zheng;Wang, Guolong;Zhang, Huidi;Huang, Kai;Ye, Shuxiong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.5
- /
- pp.2253-2272
- /
- 2018
We propose a deep learning method for multi-focus image fusion. Unlike most existing pixel-level fusion methods, either in spatial domain or in transform domain, our method directly learns an end-to-end fully convolutional two-stream network. The framework maps a pair of different focus images to a clean version, with a chain of convolutional layers, fusion layer and deconvolutional layers. Our deep fusion model has advantages of efficiency and robustness, yet demonstrates state-of-art fusion quality. We explore different parameter settings to achieve trade-offs between performance and speed. Moreover, the experiment results on our training dataset show that our network can achieve good performance with subjective visual perception and objective assessment metrics.
https://doi.org/10.3837/tiis.2018.05.019 인용 PDF KSCI

Multiple Fusion-based Deep Cross-domain Recommendation (다중 융합 기반 심층 교차 도메인 추천)

Hong, Minsung;Lee, WonJin
- Journal of Korea Multimedia Society
- /
- v.25 no.6
- /
- pp.819-832
- /
- 2022
Cross-domain recommender system transfers knowledge across different domains to improve the recommendation performance in a target domain that has a relatively sparse model. However, they suffer from the "negative transfer" in which transferred knowledge operates as noise. This paper proposes a novel Multiple Fusion-based Deep Cross-Domain Recommendation named MFDCR. We exploit Doc2Vec, one of the famous word embedding techniques, to fuse data user-wise and transfer knowledge across multi-domains. It alleviates the "negative transfer" problem. Additionally, we introduce a simple multi-layer perception to learn the user-item interactions and predict the possibility of preferring items by users. Extensive experiments with three domain datasets from one of the most famous services Amazon demonstrate that MFDCR outperforms recent single and cross-domain recommendation algorithms. Furthermore, experimental results show that MFDCR can address the problem of "negative transfer" and improve recommendation performance for multiple domains simultaneously. In addition, we show that our approach is efficient in extending toward more domains.
https://doi.org/10.9717/kmms.2022.25.6.819 인용 PDF KSCI HTML

ADxClass: Multi-Domain Attention Fusion and Imputation of Missing Heterogeneous Tabular Data

Dhivyaa S P;Hyung-Jeong Yang;Sae-Ryung Kang;Soo-Hyung Kim
- Annual Conference of KIPS
- /
- 2024.10a
- /
- pp.507-510
- /
- 2024
Alzheimer's Disease (AD) is a neurodegenerative disorder characterized by a progressive decline in cognitive function. Accurate and early diagnosis of AD is crucial for effective management and treatment. Traditional machine learning models, though commonly applied, often fall short in capturing the intricate relationships between diverse tabular data. Furthermore, the missing data issue, typically addressed using conventional imputation techniques, leads to reduced accuracy and generalizability of AD classification models. This paper introduces ADxClass, a novel deep learning framework that enhances AD classification by leveraging multi-domain attention fusion and data type-based imputation techniques for handling missing heterogeneous tabular data. ADxClass integrates data from various domains, including demographic, cognitive, genetic, and biomarkers obtained from neuroimaging measurements, to improve the robustness and accuracy of AD classification models. The model's efficiency is validated via a 5-fold cross-validation on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, showing significant improvements in classification performance compared to traditional machine learning approaches.
https://doi.org/10.3745/PKIPS.y2024m10a.507 인용 PDF

Multi-Focus Image Fusion Using Transformation Techniques: A Comparative Analysis

Ali Alferaidi
- International Journal of Computer Science & Network Security
- /
- v.23 no.4
- /
- pp.39-47
- /
- 2023
This study compares various transformation techniques for multifocus image fusion. Multi-focus image fusion is a procedure of merging multiple images captured at unalike focus distances to produce a single composite image with improved sharpness and clarity. In this research, the purpose is to compare different popular frequency domain approaches for multi-focus image fusion, such as Discrete Wavelet Transforms (DWT), Stationary Wavelet Transforms (SWT), DCT-based Laplacian Pyramid (DCT-LP), Discrete Cosine Harmonic Wavelet Transform (DC-HWT), and Dual-Tree Complex Wavelet Transform (DT-CWT). The objective is to increase the understanding of these transformation techniques and how they can be utilized in conjunction with one another. The analysis will evaluate the 10 most crucial parameters and highlight the unique features of each method. The results will help determine which transformation technique is the best for multi-focus image fusion applications. Based on the visual and statistical analysis, it is suggested that the DCT-LP is the most appropriate technique, but the results also provide valuable insights into choosing the right approach.
https://doi.org/10.22937/IJCSNS.2023.23.4.6 인용 PDF

New Medical Image Fusion Approach with Coding Based on SCD in Wireless Sensor Network

Zhang, De-gan;Wang, Xiang;Song, Xiao-dong
- Journal of Electrical Engineering and Technology
- /
- v.10 no.6
- /
- pp.2384-2392
- /
- 2015
The technical development and practical applications of big-data for health is one hot topic under the banner of big-data. Big-data medical image fusion is one of key problems. A new fusion approach with coding based on Spherical Coordinate Domain (SCD) in Wireless Sensor Network (WSN) for big-data medical image is proposed in this paper. In this approach, the three high-frequency coefficients in wavelet domain of medical image are pre-processed. This pre-processing strategy can reduce the redundant ratio of big-data medical image. Firstly, the high-frequency coefficients are transformed to the spherical coordinate domain to reduce the correlation in the same scale. Then, a multi-scale model product (MSMP) is used to control the shrinkage function so as to make the small wavelet coefficients and some noise removed. The high-frequency parts in spherical coordinate domain are coded by improved SPIHT algorithm. Finally, based on the multi-scale edge of medical image, it can be fused and reconstructed. Experimental results indicate the novel approach is effective and very useful for transmission of big-data medical image(especially, in the wireless environment).
https://doi.org/10.5370/JEET.2015.10.6.2384 인용 PDF KSCI KPUBS HTML

Understanding and Development of Programmable Automation Controllers (PACs) having Multi-Domain Functionality (다양한 도메인 기능을 갖는 PAC에 대한 이해와 개발)

Kim Kyung-Don;Lee Kang-Joo;Kim Chan-Bong
- Journal of the Korean Society for Precision Engineering
- /
- v.22 no.6 s.171
- /
- pp.15-21
- /
- 2005
PDF KSCI

Fusion-in-Decoder for Open Domain Multi-Modal Question Answering (FiD를 이용한 멀티 모달 오픈 도메인 질의 응답)

Eunhwan Park;Sung-Min Lee;Daeryong Seo;Donghyeon Jeon;Inho Kang;Seung-Hoon Na
- Annual Conference on Human and Language Technology
- /
- 2022.10a
- /
- pp.95-99
- /
- 2022
오픈 도메인 질의 응답 (ODQA, Open-Domain Question Answering)은 주어진 질문에 대한 답을 찾는 과업으로서 질문과 관련있는 지식을 찾는 "검색" 단계를 필요로 한다. 최근 이미지, 테이블 등의 검색을 요구하는 멀티 모달 ODQA에 대한 연구가 많이 진행되었을 뿐만 아니라 산업에서의 중요도 또한 높아지고 있다. 본 논문은 여러 종류의 멀티 모달 ODQA 중에서도 테이블 - 텍스트 기반 멀티 모달 ODQA 데이터 집합으로 Fusion-in-Decoder (FiD)를 이용한 멀티 모달 오픈 도메인 질의 응답 연구를 제안하며 베이스라인 대비 최대 EM 20.5, F1 23.2 향상을 보였다.
PDF

Multi-view Clustering by Spectral Structure Fusion and Novel Low-rank Approximation

Long, Yin;Liu, Xiaobo;Murphy, Simon
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.3
- /
- pp.813-829
- /
- 2022
In multi-view subspace clustering, how to integrate the complementary information between perspectives to construct a unified representation is a critical problem. In the existing works, the unified representation is usually constructed in the original data space. However, when the data representation in each view is very diverse, the unified representation derived directly in the original data domain may lead to a huge information loss. To address this issue, different to the existing works, inspired by the latest revelation that the data across all perspectives have a very similar or close spectral block structure, we try to construct the unified representation in the spectral embedding domain. In this way, the complementary information across all perspectives can be fused into a unified representation with little information loss, since the spectral block structure from all views shares high consistency. In addition, to capture the global structure of data on each view with high accuracy and robustness both, we propose a novel low-rank approximation via the tight lower bound on the rank function. Finally, experimental results prove that, the proposed method has the effectiveness and robustness at the same time, compared with the state-of-art approaches.
https://doi.org/10.3837/tiis.2022.03.004 인용 PDF KSCI HTML

Dialog-based multi-item recommendation using automatic evaluation

Euisok Chung;Hyun Woo Kim;Byunghyun Yoo;Ran Han;Jeongmin Yang;Hwa Jeon Song
- ETRI Journal
- /
- v.46 no.2
- /
- pp.277-289
- /
- 2024
In this paper, we describe a neural network-based application that recommends multiple items using dialog context input and simultaneously outputs a response sentence. Further, we describe a multi-item recommendation by specifying it as a set of clothing recommendations. For this, a multimodal fusion approach that can process both cloth-related text and images is required. We also examine achieving the requirements of downstream models using a pretrained language model. Moreover, we propose a gate-based multimodal fusion and multiprompt learning based on a pretrained language model. Specifically, we propose an automatic evaluation technique to solve the one-to-many mapping problem of multi-item recommendations. A fashion-domain multimodal dataset based on Koreans is constructed and tested. Various experimental environment settings are verified using an automatic evaluation method. The results show that our proposed method can be used to obtain confidence scores for multi-item recommendation results, which is different from traditional accuracy evaluation.
https://doi.org/10.4218/etrij.2022-0333 인용 PDF

Characteristics Analysis of Total Internal Reflection-based Dielectric Multi-layer Sensor Using Plasmonics Phenomena (플라즈모닉스 현상을 이용한 전반사 기반 다층 유전체 박막 센서의 특성 분석)

Kim, Hong-Seung;Lee, Tae-Kyeong;Kim, Doo-Gun;Jung, You-Ra;Oh, Geum-Yoon;Lee, Byeong-Hyeon;Ki, Hyun-Chul;Choi, Young-Wan
- Journal of the Korean Institute of Electrical and Electronic Material Engineers
- /
- v.25 no.7
- /
- pp.516-520
- /
- 2012
In this paper, we have theoretically analyzed and designed a dielectric multi-layer sensor with a SPR (surface plasmon resonance) using analytical calculation and FDTD (finite difference time-domain) methods. The proposed structure is composed of periodic layer and thin metal film. It has many advantages. One of that is a high sensitivity of the SPR. Another is a high Q-factor of the characteristics in the PhC (photonic crystals) micro-cavity structure. The incident light has double resonance characteristics, because the filtered light by PhC structure, dielectric multi-layer, is met the thin metal film for SPR effect. We have also observed the change of resonance characteristics according to the variation of effective index on the metal film.
https://doi.org/10.4313/JKEM.2012.25.7.516 인용 PDF KSCI

Search Result 28, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)