통합 검색 | Korea Science

멀티에이전트 강화학습에서 견고한 지식 전이를 위한 확률적 초기 상태 랜덤화 기법 연구 (Stochastic Initial States Randomization Method for Robust Knowledge Transfer in Multi-Agent Reinforcement Learning)

김도현;배정호
- 한국군사과학기술학회지
- /
- 제27권4호
- /
- pp.474-484
- /
- 2024
Reinforcement learning, which are also studied in the field of defense, face the problem of sample efficiency, which requires a large amount of data to train. Transfer learning has been introduced to address this problem, but its effectiveness is sometimes marginal because the model does not effectively leverage prior knowledge. In this study, we propose a stochastic initial state randomization(SISR) method to enable robust knowledge transfer that promote generalized and sufficient knowledge transfer. We developed a simulation environment involving a cooperative robot transportation task. Experimental results show that successful tasks are achieved when SISR is applied, while tasks fail when SISR is not applied. We also analyzed how the amount of state information collected by the agents changes with the application of SISR.
https://doi.org/10.9766/KIMST.2024.27.4.474 인용 PDF

Robust Stability eEaluation of Multi-loop Control Systems Based on Experimental Data of Frequency Response

Chen, Hong;Okuyama, Yoshifumi;Takemori, Fumiaki
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 1995년도 Proceedings of the Korea Automation Control Conference, 10th (KACC); Seoul, Korea; 23-25 Oct. 1995
- /
- pp.360-363
- /
- 1995
In this paper, we describe the composition of frequency response bands based on experimental data of plants (controlled systems) with uncertainty and nonlinearity, and the robust stability evaluation of feedback control systems. Analysis and design of control systems using the upper and lower bounds of such experimental data would be effective as a practicable method which is not heavily dependent upon mathematical models such as the transfer function. First, we present a method to composite gain characteristic bands of frequency response of cascade connected plants with uncertainty and a recurrent inequality for the composition. Next, evaluation methods of the robust stability of multi-loop control systems obtained through feedback from the output terminals and multi-loop control systems obtained through feedback into the input terminals are described. In actual control systems, experimental data of frequency responses often depends on the amplitude of input. Therefore, we present the evaluation method of the nominal value and the width of the frequency response band in such a case, and finally give numerical examples based on virtual experimental data.
PDF

A Four-Layer Robust Storage in Cloud using Privacy Preserving Technique with Reliable Computational Intelligence in Fog-Edge

Nirmala, E.;Muthurajkumar, S.
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권9호
- /
- pp.3870-3884
- /
- 2020
The proposed framework of Four Layer Robust Storage in Cloud (FLRSC) architecture involves host server, local host and edge devices in addition to Virtual Machine Monitoring (VMM). The goal is to protect the privacy of stored data at edge devices. The computational intelligence (CI) part of our algorithm distributes blocks of data to three different layers by partially encoded and forwarded for decoding to the next layer using hash and greed Solomon algorithms. VMM monitoring uses snapshot algorithm to detect intrusion. The proposed system is compared with Tiang Wang method to validate efficiency of data transfer with security. Hence, security is proven against the indexed efficiency. It is an important study to integrate communication between local host software and nearer edge devices through different channels by verifying snapshot using lamport mechanism to ensure integrity and security at software level thereby reducing the latency. It also provides thorough knowledge and understanding about data communication at software level with VMM. The performance evaluation and feasibility study of security in FLRSC against three-layered approach is proven over 2³² blocks of data with 98% accuracy. Practical implications and contributions to the growing knowledge base are highlighted along with directions for further research.
https://doi.org/10.3837/tiis.2020.09.017 인용 PDF KSCI HTML

Utilizing Mean Teacher Semi-Supervised Learning for Robust Pothole Image Classification

Inki Kim;Beomjun Kim;Jeonghwan Gwak
- 한국컴퓨터정보학회논문지
- /
- 제28권5호
- /
- pp.17-28
- /
- 2023
포장도로에서 발생하는 포트홀은 고속 주행 차량에 치명적인 영향을 미치며, 사망사고를 유발할 수 있는 도로상의 장애물이다. 이를 방지하기 위해 일반적으로는 작업자가 직접 포트홀을 탐지하는 방식을 사용해왔으나, 이는 작업자의 안전 문제와 예측하기 어려운 범주에서 발생하는 모든 포트홀을 인력으로 탐지하는 것이 비효율적이기 때문에 한계가 있다. 또한, 도로 환경과 관련된 지반 환경이 포트홀 생성에 영향을 미치기 때문에, 완벽한 포트홀 방지는 어렵다. 데이터셋 구축을 위해서는 전문가의 지도하에 라벨링 작업이 필요하지만, 이는 매우 시간과 비용이 많이 필요하다. 따라서, 본 논문에서는 Mean Teacher 기법을 사용하여 라벨링된 데이터의 샘플 수가 적더라도 지도학습보다 더욱 강인한 포트홀 이미지 분류 성능을 보여준다. 이러한 결과는 성능지표와 GradCAM을 통해 입증되었으며, 준지도학습을 사용할 때 15개의 사전 학습된 CNN 모델이 평균 90.41%의 정확도를 달성하며, 지도학습과 비교하여 2%에서 9%의 차이로 강인한 성능을 나타내는 것을 확인하였다.
https://doi.org/10.9708/jksci.2023.28.05.017 인용 PDF HTML

An Enhanced Time Delay Observer for Nonlinear Systems

Park, Suk-Ho;Chang, Pyung-Hun
- Transactions on Control, Automation and Systems Engineering
- /
- 제2권3호
- /
- pp.149-156
- /
- 2000
Time delay observer (TDO), thanks to the time delay control (TDC) concept, requires little knowledge of a plant model, and hence is easy to design, robust to parameter variation and computationally efficient, yet can reconstruct states rather reliable for nonlinear plant. In this paper, we propose an improved version of TDO that solves two problems inherent in TDO as follows: TDO displays large reconstruction errors due to low-frequency uncertainty and has some restrictions on selecting its gains. By introducing a low pass filter and a state associated with it, we obtain an enhanced time delay observer (ETDO). This observer turns out to have smaller reconstruction errors than those of TDO and not to have any restriction on selecting its gains, thereby solving the problems. Through performance comparison by transfer function and simulation, we validate the analysis results of two observers (TDO and ETDO) and evaluate the performances. Finally, through experiments on BLDC motor system, the analysis results are clearly conformed.
PDF

Human Action Recognition Using Pyramid Histograms of Oriented Gradients and Collaborative Multi-task Learning

Gao, Zan;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제8권2호
- /
- pp.483-503
- /
- 2014
In this paper, human action recognition using pyramid histograms of oriented gradients and collaborative multi-task learning is proposed. First, we accumulate global activities and construct motion history image (MHI) for both RGB and depth channels respectively to encode the dynamics of one action in different modalities, and then different action descriptors are extracted from depth and RGB MHI to represent global textual and structural characteristics of these actions. Specially, average value in hierarchical block, GIST and pyramid histograms of oriented gradients descriptors are employed to represent human motion. To demonstrate the superiority of the proposed method, we evaluate them by KNN, SVM with linear and RBF kernels, SRC and CRC models on DHA dataset, the well-known dataset for human action recognition. Large scale experimental results show our descriptors are robust, stable and efficient, and outperform the state-of-the-art methods. In addition, we investigate the performance of our descriptors further by combining these descriptors on DHA dataset, and observe that the performances of combined descriptors are much better than just using only sole descriptor. With multimodal features, we also propose a collaborative multi-task learning method for model learning and inference based on transfer learning theory. The main contributions lie in four aspects: 1) the proposed encoding the scheme can filter the stationary part of human body and reduce noise interference; 2) different kind of features and models are assessed, and the neighbor gradients information and pyramid layers are very helpful for representing these actions; 3) The proposed model can fuse the features from different modalities regardless of the sensor types, the ranges of the value, and the dimensions of different features; 4) The latent common knowledge among different modalities can be discovered by transfer learning to boost the performance.
https://doi.org/10.3837/tiis.2014.02.009 인용 PDF KSCI KPUBS

Interspecies Transfer and Regulation of Pseudomonas stutzeri A1501 Nitrogen Fixation Island in Escherichia coli

Han, Yunlei;Lu, Na;Chen, Qinghua;Zhan, Yuhua;Liu, Wei Liu;Lu, Wei;Zhu, Baoli;Lin, Min;Yang, Zhirong;Yan, Yongliang
- Journal of Microbiology and Biotechnology
- /
- 제25권8호
- /
- pp.1339-1348
- /
- 2015
Until now, considerable effort has been made to engineer novel nitrogen-fixing organisms through the transfer of nif genes from various diazotrophs to non-nitrogen fixers; however, regulatory coupling of the heterologous nif genes with the regulatory system of the new host is still not well understood. In this work, a 49 kb nitrogen fixation island from P. stutzeri A1501 was transferred into E. coli using a novel and efficient transformation strategy, and a series of recombinant nitrogen-fixing E. coli strains were obtained. We found that the nitrogenase activity of the recombinant E. coli strain EN-01, similar to the parent strain P. stutzeri A1501, was dependent on external ammonia concentration, oxygen tension, and temperature. We further found that there existed a regulatory coupling between the E. coli general nitrogen regulatory system and the heterologous P. stutzeri nif island in the recombinant E. coli strain. We also provided evidence that the E. coli general nitrogen regulator GlnG protein was involved in the activation of the nif-specific regulator NifA via a direct interaction with the NifA promoter. To the best of our knowledge, this work plays a groundbreaking role in increasing understanding of the regulatory coupling of the heterologous nitrogen fixation system with the regulatory system of the recipient host. Furthermore, it will shed light on the structure and functional integrity of the nif island and will be useful for the construction of novel and more robust nitrogen-fixing organisms through biosynthetic engineering.
https://doi.org/10.4014/jmb.1502.02027 인용 PDF KSCI KPUBS HTML

검색결과 7건 처리시간 0.02초

멀티에이전트 강화학습에서 견고한 지식 전이를 위한 확률적 초기 상태 랜덤화 기법 연구 (Stochastic Initial States Randomization Method for Robust Knowledge Transfer in Multi-Agent Reinforcement Learning)

Robust Stability eEaluation of Multi-loop Control Systems Based on Experimental Data of Frequency Response

A Four-Layer Robust Storage in Cloud using Privacy Preserving Technique with Reliable Computational Intelligence in Fog-Edge

Utilizing Mean Teacher Semi-Supervised Learning for Robust Pothole Image Classification

An Enhanced Time Delay Observer for Nonlinear Systems

Human Action Recognition Using Pyramid Histograms of Oriented Gradients and Collaborative Multi-task Learning

Interspecies Transfer and Regulation of Pseudomonas stutzeri A1501 Nitrogen Fixation Island in Escherichia coli

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)