• Title/Summary/Keyword: Heterogeneity Learning

Search Result 48, Processing Time 0.023 seconds

Stress Affect Detection At Wearable Devices Via Clustered Federated Learning Based On Number of Samples Mahalanobis Distance (웨어러블 기기에서 데이터수 기반 마하라노비스 군집화 연합학습을 통한 스트레스 및 감정탐지)

  • Tae-Hwan Yoon;Bong-Jun Choi
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2024.05a
    • /
    • pp.764-767
    • /
    • 2024
  • 웨어러블 디바이스에서는 사용자의 다양한 메타데이터를 수집할 수 있다. 그러나 이런 개인정보를 함유하고 있는 데이터를 수집하는 것은 사용자에게 개인정보침해 위협을 야기한다. 때문에 본 논문에서는 개인정보보호를 통한 웨어러블 디바이스 데이터활용방안으로 연합학습을 채택하였다. 다만 기존 연합학습에서도 해결해야할 문제점들이 있다. 우리는 그중에서도 데이터이질성(Data Heterogeneity) 문제해결을 위해 군집화(Clustering) 방법을 활용하였다. 또한 기존의 코사인유사도 기반 군집화에서 파라미터중요도가 반영되지 않는다는 문제점을 해결하고자 데이터수 기반 마하라노비스거리(Number of Samples Mahalanobis Distance) 군집화 방법을 제시하였다. 이를 통해 WESAD(Werable Stress Affect Detection)데이터에서 피실험자의 데이터 이질성이 존재하는 상황에서 기존 연합학습보다 학습 안정성 측면에서 좋음을 보여주었다.

THE ELEVATION OF EFFICACY IDENTIFYING PITUITARY TISSUE ABNORMALITIES WITHIN BRAIN IMAGES BY EMPLOYING MEMORY CONTRAST LEARNING TECHNIQUES

  • S. SINDHU;N. VIJAYALAKSHMI
    • Journal of applied mathematics & informatics
    • /
    • v.42 no.4
    • /
    • pp.931-943
    • /
    • 2024
  • Accurately identifying brain tumors is crucial for medical imaging's precise diagnosis and treatment planning. This study presents a novel approach that uses cutting-edge image processing techniques to automatically segment brain tumors. with the use of the Pyramid Network algorithm. This technique accurately and robustly delineates tumor borders in MRI images. Our strategy incorporates special algorithms that efficiently address problems such as tumor heterogeneity and size and shape fluctuations. An assessment using the RESECT Dataset confirms the validity and reliability of the method and yields promising results in terms of accuracy and computing efficiency. This method has a great deal of promise to help physicians accurately identify tumors and assess the efficacy of treatments, which could lead to higher standards of care in the field of neuro-oncology.

Current Applications and Future Perspectives of Brain Tumor Imaging (뇌종양 영상의 현재와 미래)

  • Ji Eun Park;Ho Sung Kim
    • Journal of the Korean Society of Radiology
    • /
    • v.81 no.3
    • /
    • pp.467-487
    • /
    • 2020
  • Anatomical imaging is the basis of the diagnosis and treatment response assessment of brain tumors. Among the existing imaging techniques currently available in clinical practice, diffusion-weighted imaging and perfusion imaging provide additional information. Recently, with the increasing importance of evaluation of the genomic variation and heterogeneity of tumors, clinical application of imaging techniques using radiomics and deep learning is expected. In this review, we will describe recommendations for magnetic resonance imaging protocols focusing on anatomical images that are still important in the clinical application of brain tumor imaging, and the basic principles of diffusion-weighted imaging and perfusion imaging among the advanced imaging techniques, as well as their pathophysiological background and clinical application. Finally, we will review the future perspectives of radiomics and deep learning applications in brain tumor imaging, which have been studied to a great extent due to the development of computer technology.

Comparison of performance of automatic detection model of GPR signal considering the heterogeneous ground (지반의 불균질성을 고려한 GPR 신호의 자동탐지모델 성능 비교)

  • Lee, Sang Yun;Song, Ki-Il;Kang, Kyung Nam;Ryu, Hee Hwan
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.24 no.4
    • /
    • pp.341-353
    • /
    • 2022
  • Pipelines are buried in urban area, and the position (depth and orientation) of buried pipeline should be clearly identified before ground excavation. Although various geophysical methods can be used to detect the buried pipeline, it is not easy to identify the exact information of pipeline due to heterogeneous ground condition. Among various non-destructive geo-exploration methods, ground penetration radar (GPR) can explore the ground subsurface rapidly with relatively low cost compared to other exploration methods. However, the exploration data obtained from GPR requires considerable experiences because interpretation is not intuitive. Recently, researches on automated detection technology for GPR data using deep learning have been conducted. However, the lack of GPR data which is essential for training makes it difficult to build up the reliable detection model. To overcome this problem, we conducted a preliminary study to improve the performance of the detection model using finite difference time domain (FDTD)-based numerical analysis. Firstly, numerical analysis was performed with homogeneous soil media having single permittivity. In case of heterogeneous ground, numerical analysis was performed considering the ground heterogeneity using fractal technique. Secondly, deep learning was carried out using convolutional neural network. Detection Model-A is trained with data set obtained from homogeneous ground. And, detection Model-B is trained with data set obtained from homogeneous ground and heterogeneous ground. As a result, it is found that the detection Model-B which is trained including heterogeneous ground shows better performance than detection Model-A. It indicates the ground heterogeneity should be considered to increase the performance of automated detection model for GPR exploration.

A Design of SOA-based Data Integration Framework for Effective Spatial Data Mining (효과적인 공간 데이터 마이닝을 위한 SOA 기반 데이터 통합 프레임워크 설계)

  • Moon, Il-Hwan;Hur, Hwan;Kim, Sam-Keun
    • The KIPS Transactions:PartD
    • /
    • v.18D no.5
    • /
    • pp.385-392
    • /
    • 2011
  • Recently, the concern of IT-in-Agriculture convergence technology that combines information technology and agriculture is increasing rapidly. Especially, the crop cultivation related prediction services by spatial data mining (SDM) can play an important role in reducing the damage of natural disaster and enhancing crop productivity. However, the data conversion and integration procedure to acquire the learning dataset of SDM for the prediction service need a lot of effort and time, because of their heterogeneity between distributed data. In addition, calculating spatial neighborhood relationships between spatial and non-spatial data necessitates requires the complicated calculation procedure for large dataset. In this paper, we suggest a SOA-based data integration framework that can effectively integrate distributed heterogeneous data by treating each data source as a service unit and support to find the optimal prediction service by improving productivity of learning dataset for SDM. In our experiment, we confirmed that our framework can be effectively applied to find the optimal prediction service for the frost damage area, by considering the case of peach crop cultivation in Icheon in Korea.

Big Data Smoothing and Outlier Removal for Patent Big Data Analysis

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.8
    • /
    • pp.77-84
    • /
    • 2016
  • In general statistical analysis, we need to make a normal assumption. If this assumption is not satisfied, we cannot expect a good result of statistical data analysis. Most of statistical methods processing the outlier and noise also need to the assumption. But the assumption is not satisfied in big data because of its large volume and heterogeneity. So we propose a methodology based on box-plot and data smoothing for controling outlier and noise in big data analysis. The proposed methodology is not dependent upon the normal assumption. In addition, we select patent documents as target domain of big data because patent big data analysis is a important issue in management of technology. We analyze patent documents using big data learning methods for technology analysis. The collected patent data from patent databases on the world are preprocessed and analyzed by text mining and statistics. But the most researches about patent big data analysis did not consider the outlier and noise problem. This problem decreases the accuracy of prediction and increases the variance of parameter estimation. In this paper, we check the existence of the outlier and noise in patent big data. To know whether the outlier is or not in the patent big data, we use box-plot and smoothing visualization. We use the patent documents related to three dimensional printing technology to illustrate how the proposed methodology can be used for finding the existence of noise in the searched patent big data.

Big IoT Healthcare Data Analytics Framework Based on Fog and Cloud Computing

  • Alshammari, Hamoud;El-Ghany, Sameh Abd;Shehab, Abdulaziz
    • Journal of Information Processing Systems
    • /
    • v.16 no.6
    • /
    • pp.1238-1249
    • /
    • 2020
  • Throughout the world, aging populations and doctor shortages have helped drive the increasing demand for smart healthcare systems. Recently, these systems have benefited from the evolution of the Internet of Things (IoT), big data, and machine learning. However, these advances result in the generation of large amounts of data, making healthcare data analysis a major issue. These data have a number of complex properties such as high-dimensionality, irregularity, and sparsity, which makes efficient processing difficult to implement. These challenges are met by big data analytics. In this paper, we propose an innovative analytic framework for big healthcare data that are collected either from IoT wearable devices or from archived patient medical images. The proposed method would efficiently address the data heterogeneity problem using middleware between heterogeneous data sources and MapReduce Hadoop clusters. Furthermore, the proposed framework enables the use of both fog computing and cloud platforms to handle the problems faced through online and offline data processing, data storage, and data classification. Additionally, it guarantees robust and secure knowledge of patient medical data.

Analysis about the actual situation of Arabic education and his culture in France and his view (프랑스에서의 아랍어와 아랍문화의 현황과 전망 분석 - Sabhan Rabina Al-Baldhawe의 논문을 중심으로)

  • JUNG, Il Young
    • Cross-Cultural Studies
    • /
    • v.25
    • /
    • pp.107-129
    • /
    • 2011
  • This article aims to observe the role of Arabic and analyze the future of Arabic in France under the base of the Al-Sabhan Rabina Baldhawe's article, published in mettre l'importance sur University Paris 8 in 2007. In the first part, we have focus into the historical analysis: in France, with a few Arabic and French policy has been settled for what were examined. Also enable the use of Arabic in France with regard to trends of Maghreb countries and other Arab countries, is being led by noted. In the second part, we put on the importance about the situation of Arabic in the France's educational institution. And we have analysed the reasons why Arabic became the most important reason for learning the target language: - in order to faciliate the children of immigrants living in Maghreb able to speak French - Due to differences in culture and language experience to relieve the psychological insecurity above sea - By using the Arabic language at home among family members, strengthen solidarity and resolve heterogeneity In the third part, we have recognized that the French education system was looked at in the Journal of Arabic teaching elementary, middle and high school courses, separated by a learning Arabic as the target language. Finally, we have tried to find a way to revitalize Arabic in France in connection with Sabhan Rabina Al-Baldhawe concrete example of the paper were based on a survey. France and the Arab countries' relationship has been long enough to prove the historic aspects and economic cooperation have maintained a relationship even tighter. Arabic, many of the French people also need education and children to learn Arabic in the French educational institution that has shown a positive stance. French students learning Arabic as a future career in choosing the width of the wider benefits it helps to have. Learning Arabic in the course need to be addressed is also true that a lot of points. But the Arabic and various aspects of internal organization is considered a minority in the popular Arabic language training in France has become more competitive in research and analysis to be active stance is required externally, such as the increase of trade agreements and economic systems side at the level of cultural exchange and international co-operation system, strengthening its position as the Arabic language in France.

Mapping Categories of Heterogeneous Sources Using Text Analytics (텍스트 분석을 통한 이종 매체 카테고리 다중 매핑 방법론)

  • Kim, Dasom;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.193-215
    • /
    • 2016
  • In recent years, the proliferation of diverse social networking services has led users to use many mediums simultaneously depending on their individual purpose and taste. Besides, while collecting information about particular themes, they usually employ various mediums such as social networking services, Internet news, and blogs. However, in terms of management, each document circulated through diverse mediums is placed in different categories on the basis of each source's policy and standards, hindering any attempt to conduct research on a specific category across different kinds of sources. For example, documents containing content on "Application for a foreign travel" can be classified into "Information Technology," "Travel," or "Life and Culture" according to the peculiar standard of each source. Likewise, with different viewpoints of definition and levels of specification for each source, similar categories can be named and structured differently in accordance with each source. To overcome these limitations, this study proposes a plan for conducting category mapping between different sources with various mediums while maintaining the existing category system of the medium as it is. Specifically, by re-classifying individual documents from the viewpoint of diverse sources and storing the result of such a classification as extra attributes, this study proposes a logical layer by which users can search for a specific document from multiple heterogeneous sources with different category names as if they belong to the same source. Besides, by collecting 6,000 articles of news from two Internet news portals, experiments were conducted to compare accuracy among sources, supervised learning and semi-supervised learning, and homogeneous and heterogeneous learning data. It is particularly interesting that in some categories, classifying accuracy of semi-supervised learning using heterogeneous learning data proved to be higher than that of supervised learning and semi-supervised learning, which used homogeneous learning data. This study has the following significances. First, it proposes a logical plan for establishing a system to integrate and manage all the heterogeneous mediums in different classifying systems while maintaining the existing physical classifying system as it is. This study's results particularly exhibit very different classifying accuracies in accordance with the heterogeneity of learning data; this is expected to spur further studies for enhancing the performance of the proposed methodology through the analysis of characteristics by category. In addition, with an increasing demand for search, collection, and analysis of documents from diverse mediums, the scope of the Internet search is not restricted to one medium. However, since each medium has a different categorical structure and name, it is actually very difficult to search for a specific category insofar as encompassing heterogeneous mediums. The proposed methodology is also significant for presenting a plan that enquires into all the documents regarding the standards of the relevant sites' categorical classification when the users select the desired site, while maintaining the existing site's characteristics and structure as it is. This study's proposed methodology needs to be further complemented in the following aspects. First, though only an indirect comparison and evaluation was made on the performance of this proposed methodology, future studies would need to conduct more direct tests on its accuracy. That is, after re-classifying documents of the object source on the basis of the categorical system of the existing source, the extent to which the classification was accurate needs to be verified through evaluation by actual users. In addition, the accuracy in classification needs to be increased by making the methodology more sophisticated. Furthermore, an understanding is required that the characteristics of some categories that showed a rather higher classifying accuracy of heterogeneous semi-supervised learning than that of supervised learning might assist in obtaining heterogeneous documents from diverse mediums and seeking plans that enhance the accuracy of document classification through its usage.

Trade Liberalization, Growth, and Bi-polarization in Korean Manufacturing: Evidence from Microdata (우리나라 제조업에서 무역자유화가 성장 및 양극화에 미치는 영향: 미시자료를 통한 실증적 증거들)

  • Hahn, Chin Hee
    • KDI Journal of Economic Policy
    • /
    • v.35 no.4
    • /
    • pp.1-29
    • /
    • 2013
  • This paper examines the effect of trade liberalization or globalization, more broadly, on plants' growth as well as on "bi-polarization". To do so, we reviewed the possible theoretical mechanisms put forward by recent heterogeneous firm trade theories, and provided available micro-evidence from existing empirical studies on Korean manufacturing sector. Above all, the empirical evidence provided in this paper strongly suggests that globalization promoted growth of Korean manufacturing plants. Specifically, evidence suggests that exporting not only increases within-plant productivity but also promotes introduction of new products and dropping of old products. However, the empirical evidence also suggest that globalization has some downsides: widening productivity differences across plants and rising wage inequality between skilled and unskilled workers. Specifically, trade liberalization widens the initial productivity differences among plants through learning from export market participation as well as through interactions between exporting and R&D, both of which increase plants' productivity. We also show that there is only a small group of large and productive "superstar" plants engaged in both R&D and exporting activity, which can fully utilize the potential benefits from globalization. Finally, we also show evidence that trade liberalization interacts with innovation to increase the skilled-unskilled wage inequality.

  • PDF