Search | Korea Science

Fuzzy Q-learning using Weighted Eligibility (가중 기여도를 이용한 퍼지 Q-learning)

정석일;이연정
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2000.11a
- /
- pp.163-167
- /
- 2000
The eligibility is used to solve the credit-assignment problem which is one of important problems in reinforcement learning. Conventional eligibilities which are accumulating eligibility and replacing eligibility make ineffective use of rewards acquired in learning process. Because only an executed action in a visited state is learned by these eligibilities. Thus, we propose a new eligibility, called the weighted eligibility with which not only an executed action but also neighboring actions in a visited state are to be learned. The fuzzy Q-learning algorithm using proposed eligibility is applied to a cart-pole balancing problem, which shows improvement of learning speed.
PDF

Optimal Method for Binary Neural Network using AETLA (AETLA를 이용한 이진 신경회로망의 최적 합성방법)

성상규;정종원;이준탁
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2001.05a
- /
- pp.105-108
- /
- 2001
In this paper, the learning algorithm called advanced expanded and truncate algorithm(AETLA) is proposed to training multilayer binary neural network to approximate binary to binary mapping. AETLA used merit of ETL and MTGA learning algorithm. We proposed to new learning algorithm to decrease number of hidden layer. Therefore, learning speed of the proposed AETLA learning algorithm is much faster than other learning algorithm.
PDF

Development of a Real-time Safest Evacuation Route using Internet of Things and Reinforcement Learning in Case of Fire in a Building (건물 내 화재 발생 시 사물 인터넷과 강화 학습을 활용한 실시간 안전 대피 경로 방안 개발)

Ahn, Yusun;Choi, Haneul
- Journal of the Korean Society of Safety
- /
- v.37 no.2
- /
- pp.97-105
- /
- 2022
Human casualties from fires are increasing worldwide. The majority of human deaths occur during the evacuation process, as occupants panic and are unaware of the location of the fire and evacuation routes. Using an Internet of Things (IoT) sensor and reinforcement learning, we propose a method to find the safest evacuation route by considering the fire location, flame speed, occupant position, and walking conditions. The first step is detecting the fire with IoT-based devices. The second step is identifying the occupant's position via a beacon connected to the occupant's mobile phone. In the third step, the collected information, flame speed, and walking conditions are input into the reinforcement learning model to derive the optimal evacuation route. This study makes it possible to provide the safest evacuation route for individual occupants in real time. This study is expected to reduce human casualties caused by fires.
https://doi.org/10.14346/JKOSOS.2022.37.2.97 인용 PDF KSCI

Development of a Real-Time Automatic Passenger Counting System using Head Detection Based on Deep Learning

Kim, Hyunduk;Sohn, Myoung-Kyu;Lee, Sang-Heon
- Journal of Information Processing Systems
- /
- v.18 no.3
- /
- pp.428-442
- /
- 2022
A reliable automatic passenger counting (APC) system is a key point in transportation related to the efficient scheduling and management of transport routes. In this study, we introduce a lightweight head detection network using deep learning applicable to an embedded system. Currently, object detection algorithms using deep learning have been found to be successful. However, these algorithms essentially need a graphics processing unit (GPU) to make them performable in real-time. So, we modify a Tiny-YOLOv3 network using certain techniques to speed up the proposed network and to make it more accurate in a non-GPU environment. Finally, we introduce an APC system, which is performable in real-time on embedded systems, using the proposed head detection algorithm. We implement and test the proposed APC system on a Samsung ARTIK 710 board. The experimental results on three public head datasets reflect the detection accuracy and efficiency of the proposed head detection network against Tiny-YOLOv3. Moreover, to test the proposed APC system, we measured the accuracy and recognition speed by repeating 50 instances of entering and 50 instances of exiting. These experimental results showed 99% accuracy and a 0.041-second recognition speed despite the fact that only the CPU was used.
https://doi.org/10.3745/JIPS.04.0246 인용 PDF KSCI

Vision Sensor and Deep Learning-based Around View Monitoring System for Ship Berthing (비전 센서 및 딥러닝 기반 선박 접안을 위한 어라운드뷰 모니터링 시스템)

Kim, Hanguen;Kim, Donghoon;Park, Byeolteo;Lee, Seung-Mok
- IEMEK Journal of Embedded Systems and Applications
- /
- v.15 no.2
- /
- pp.71-78
- /
- 2020
This paper proposes vision sensors and deep learning-based around view monitoring system for ship berthing. Ship berthing to the port requires precise relative position and relative speed information between the mooring facility and the ship. For ships of Handysize or higher, the vesselships must be docked with the help of pilots and tugboats. In the case of ships handling dangerous cargo, tug boats push the ship and dock it in the port, using the distance and velocity information receiving from the berthing aid system (BAS). However, the existing BAS is very expensive and there is a limit on the size of the vessel that can be measured. Also, there is a limitation that it is difficult to measure distance and speed when there are obstacles near the port. This paper proposes a relative distance and speed estimation system that can be used as a ship berthing assist system. The proposed system is verified by comparing the performance with the existing laser-based distance and speed measurement system through the field tests at the actual port.
https://doi.org/10.14372/IEMEK.2020.15.2.71 인용 PDF KSCI

Analysis and Prediction of Behavioral Changes in Angelfish Pterophyllum scalare Under Stress Conditions (스트레스 조건에 노출된 Angelfish Pterophyllum scalare의 행동 변화 분석 및 예측)

Kim, Yoon-Jae;NO, Hea-Min;Kim, Do-Hyung
- Korean Journal of Fisheries and Aquatic Sciences
- /
- v.54 no.6
- /
- pp.965-973
- /
- 2021
The behavior of angelfish Pterophyllum scalare exposed to low and high temperatures was monitored by video tracking, and information such as the initial speed, changes in speed, and locations of the fish in the tank were analyzed. The water temperature was raised from 26℃ to 36℃ or lowered from 26℃ to 16℃ for 4 h. The control group was maintained at 26℃ for 8 h. The experiment was repeated five times for each group. Machine learning analysis comprising a long short-term memory model was used to train and test the behavioral data (80 s) after pre-processing. Results showed that when the water temperature changed to 36℃ or 16℃, the average speed, changes in speed and fractal dimension value were significantly lower than those in the control group. Machine learning analysis revealed that the accuracy of 80-s video footage data was 87.4%. The machine learning used in this study could distinguish between the optimal temperature group and changing temperature groups with specificity and sensitivity percentages of 86.9% and 87.4%, respectively. Therefore, video tracking technology can be used to effectively analyze fish behavior. In addition, it can be used as an early warning system for fish health in aquariums and fish farms.
https://doi.org/10.5657/KFAS.2021.0965 인용 PDF KSCI HTML

Sensorless Speed Control of Direct Current Motor by Neural Network (신경회로망을 이용한 직류전동기의 센서리스 속도제어)

김종수;강성주
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.7 no.8
- /
- pp.1743-1750
- /
- 2003
DC motor requires a rotor speed sensor for accurate speed control. The speed sensors such as resolvers and encoders are used as a speed detector, but they increase cost and size of the motor and restrict the industrial drive applications. So in these days, many papers have reported in the sensorless operation of DC motor〔35〕. This paper presents a new sensorless strategy using neural networks〔68〕. Neural network has three layers which are input layer, hidden layer and output layer. The optimal neural network structure was tracked down by trial and error, and it was found that 4161 neural network structure has given suitable results for the instantaneous rotor speed. Also, learning method is very important in neural network. Supervised learning methods〔8〕 are typically used to train the neural network for learning the input/output pattern presented. The backpropagation technique adjusts the neural network weights during training. The rotor speed is gained by weights and four inputs to the neural network. The experimental results were found satisfactory in both the independency on machine parameters and the insensitivity to the load condition.
PDF KSCI

On the configuration of learning parameter to enhance convergence speed of back propagation neural network (역전파 신경회로망의 수렴속도 개선을 위한 학습파라메타 설정에 관한 연구)

홍봉화;이승주;조원경
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.11
- /
- pp.159-166
- /
- 1996
In this paper, the method for improving the speed of convergence and learning rate of back propagation algorithms is proposed which update the learning rate parameter and momentum term for each weight by generated error, changely the output layer of neural network generates a high value in the case that output value is far from the desired values, and genrates a low value in the opposite case this method decreases the iteration number and is able to learning effectively. The effectiveness of proposed method is verified through the simulation of X-OR and 3-parity problem.
PDF

Actor-Critic Algorithm with Transition Cost Estimation

Sergey, Denisov;Lee, Jee-Hyong
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.16 no.4
- /
- pp.270-275
- /
- 2016
We present an approach for acceleration actor-critic algorithm for reinforcement learning with continuous action space. Actor-critic algorithm has already proved its robustness to the infinitely large action spaces in various high dimensional environments. Despite that success, the main problem of the actor-critic algorithm remains the same-speed of convergence to the optimal policy. In high dimensional state and action space, a searching for the correct action in each state takes enormously long time. Therefore, in this paper we suggest a search accelerating function that allows to leverage speed of algorithm convergence and reach optimal policy faster. In our method, we assume that actions may have their own distribution of preference, that independent on the state. Since in the beginning of learning agent act randomly in the environment, it would be more efficient if actions were taken according to the some heuristic function. We demonstrate that heuristically-accelerated actor-critic algorithm learns optimal policy faster, using Educational Process Mining dataset with records of students' course learning process and their grades.
https://doi.org/10.5391/IJFIS.2016.16.4.270 인용 PDF KSCI

A Study on Efficient Memory Management Using Machine Learning Algorithm

Park, Beom-Joo;Kang, Min-Soo;Lee, Minho;Jung, Yong Gyu
- International journal of advanced smart convergence
- /
- v.6 no.1
- /
- pp.39-43
- /
- 2017
As the industry grows, the amount of data grows exponentially, and data analysis using these serves as a predictable solution. As data size increases and processing speed increases, it has begun to be applied to new fields by combining artificial intelligence technology as well as simple big data analysis. In this paper, we propose a method to quickly apply a machine learning based algorithm through efficient resource allocation. The proposed algorithm allocates memory for each attribute. Learning Distinct of Attribute and allocating the right memory. In order to compare the performance of the proposed algorithm, we compared it with the existing K-means algorithm. As a result of measuring the execution time, the speed was improved.
https://doi.org/10.7236/IJASC.2017.6.1.39 인용 PDF KSCI

Search Result 1,145, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)