• 제목/요약/키워드: Learning Speed

검색결과 1,154건 처리시간 0.024초

가중 기여도를 이용한 퍼지 Q-learning (Fuzzy Q-learning using Weighted Eligibility)

  • 정석일;이연정
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2000년도 추계학술대회 학술발표 논문집
    • /
    • pp.163-167
    • /
    • 2000
  • The eligibility is used to solve the credit-assignment problem which is one of important problems in reinforcement learning. Conventional eligibilities which are accumulating eligibility and replacing eligibility make ineffective use of rewards acquired in learning process. Because only an executed action in a visited state is learned by these eligibilities. Thus, we propose a new eligibility, called the weighted eligibility with which not only an executed action but also neighboring actions in a visited state are to be learned. The fuzzy Q-learning algorithm using proposed eligibility is applied to a cart-pole balancing problem, which shows improvement of learning speed.

  • PDF

AETLA를 이용한 이진 신경회로망의 최적 합성방법 (Optimal Method for Binary Neural Network using AETLA)

  • 성상규;정종원;이준탁
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2001년도 춘계학술대회 학술발표 논문집
    • /
    • pp.105-108
    • /
    • 2001
  • In this paper, the learning algorithm called advanced expanded and truncate algorithm(AETLA) is proposed to training multilayer binary neural network to approximate binary to binary mapping. AETLA used merit of ETL and MTGA learning algorithm. We proposed to new learning algorithm to decrease number of hidden layer. Therefore, learning speed of the proposed AETLA learning algorithm is much faster than other learning algorithm.

  • PDF

건물 내 화재 발생 시 사물 인터넷과 강화 학습을 활용한 실시간 안전 대피 경로 방안 개발 (Development of a Real-time Safest Evacuation Route using Internet of Things and Reinforcement Learning in Case of Fire in a Building)

  • 안유선;최하늘
    • 한국안전학회지
    • /
    • 제37권2호
    • /
    • pp.97-105
    • /
    • 2022
  • Human casualties from fires are increasing worldwide. The majority of human deaths occur during the evacuation process, as occupants panic and are unaware of the location of the fire and evacuation routes. Using an Internet of Things (IoT) sensor and reinforcement learning, we propose a method to find the safest evacuation route by considering the fire location, flame speed, occupant position, and walking conditions. The first step is detecting the fire with IoT-based devices. The second step is identifying the occupant's position via a beacon connected to the occupant's mobile phone. In the third step, the collected information, flame speed, and walking conditions are input into the reinforcement learning model to derive the optimal evacuation route. This study makes it possible to provide the safest evacuation route for individual occupants in real time. This study is expected to reduce human casualties caused by fires.

Development of a Real-Time Automatic Passenger Counting System using Head Detection Based on Deep Learning

  • Kim, Hyunduk;Sohn, Myoung-Kyu;Lee, Sang-Heon
    • Journal of Information Processing Systems
    • /
    • 제18권3호
    • /
    • pp.428-442
    • /
    • 2022
  • A reliable automatic passenger counting (APC) system is a key point in transportation related to the efficient scheduling and management of transport routes. In this study, we introduce a lightweight head detection network using deep learning applicable to an embedded system. Currently, object detection algorithms using deep learning have been found to be successful. However, these algorithms essentially need a graphics processing unit (GPU) to make them performable in real-time. So, we modify a Tiny-YOLOv3 network using certain techniques to speed up the proposed network and to make it more accurate in a non-GPU environment. Finally, we introduce an APC system, which is performable in real-time on embedded systems, using the proposed head detection algorithm. We implement and test the proposed APC system on a Samsung ARTIK 710 board. The experimental results on three public head datasets reflect the detection accuracy and efficiency of the proposed head detection network against Tiny-YOLOv3. Moreover, to test the proposed APC system, we measured the accuracy and recognition speed by repeating 50 instances of entering and 50 instances of exiting. These experimental results showed 99% accuracy and a 0.041-second recognition speed despite the fact that only the CPU was used.

비전 센서 및 딥러닝 기반 선박 접안을 위한 어라운드뷰 모니터링 시스템 (Vision Sensor and Deep Learning-based Around View Monitoring System for Ship Berthing)

  • 김한근;김동훈;박별터;이승목
    • 대한임베디드공학회논문지
    • /
    • 제15권2호
    • /
    • pp.71-78
    • /
    • 2020
  • This paper proposes vision sensors and deep learning-based around view monitoring system for ship berthing. Ship berthing to the port requires precise relative position and relative speed information between the mooring facility and the ship. For ships of Handysize or higher, the vesselships must be docked with the help of pilots and tugboats. In the case of ships handling dangerous cargo, tug boats push the ship and dock it in the port, using the distance and velocity information receiving from the berthing aid system (BAS). However, the existing BAS is very expensive and there is a limit on the size of the vessel that can be measured. Also, there is a limitation that it is difficult to measure distance and speed when there are obstacles near the port. This paper proposes a relative distance and speed estimation system that can be used as a ship berthing assist system. The proposed system is verified by comparing the performance with the existing laser-based distance and speed measurement system through the field tests at the actual port.

스트레스 조건에 노출된 Angelfish Pterophyllum scalare의 행동 변화 분석 및 예측 (Analysis and Prediction of Behavioral Changes in Angelfish Pterophyllum scalare Under Stress Conditions)

  • 김윤재;노혜민;김도형
    • 한국수산과학회지
    • /
    • 제54권6호
    • /
    • pp.965-973
    • /
    • 2021
  • The behavior of angelfish Pterophyllum scalare exposed to low and high temperatures was monitored by video tracking, and information such as the initial speed, changes in speed, and locations of the fish in the tank were analyzed. The water temperature was raised from 26℃ to 36℃ or lowered from 26℃ to 16℃ for 4 h. The control group was maintained at 26℃ for 8 h. The experiment was repeated five times for each group. Machine learning analysis comprising a long short-term memory model was used to train and test the behavioral data (80 s) after pre-processing. Results showed that when the water temperature changed to 36℃ or 16℃, the average speed, changes in speed and fractal dimension value were significantly lower than those in the control group. Machine learning analysis revealed that the accuracy of 80-s video footage data was 87.4%. The machine learning used in this study could distinguish between the optimal temperature group and changing temperature groups with specificity and sensitivity percentages of 86.9% and 87.4%, respectively. Therefore, video tracking technology can be used to effectively analyze fish behavior. In addition, it can be used as an early warning system for fish health in aquariums and fish farms.

신경회로망을 이용한 직류전동기의 센서리스 속도제어 (Sensorless Speed Control of Direct Current Motor by Neural Network)

  • 김종수;강성주
    • 한국정보통신학회논문지
    • /
    • 제7권8호
    • /
    • pp.1743-1750
    • /
    • 2003
  • 근래에는 정확성과 신뢰성이 강한 센서리스 속도추정방법으로 전동기를 구동하기 위한 노력이 전개되고 있으며, 본 논문은 외란에 대한 강인성이 뛰어난 신경회로망을 이용하여 직류전동기의 센서리스 속도제어를 실현한 연구 결과이다. 〔6­8〕 신경회로망은 사람의 뇌가 경험을 통해 학습하듯이 주어진 입력에 대해 학습을 통하여 최적의 출력을 발생한다. 학습은 직류전동기의 수식모델을 통해 얻어진 전압$.$전류 및 회전자 속도를 입$.$출력 데이터로 사용하여 역전파 학습 알고리즘〔8〕을 통해 행하여지며, 학습 완료 후 얻은 최적의 연결강도를 이용하여 속도를 추정한다. 신경회로망에 의한 방식은 복잡한 알고리즘을 사용하지 않고도 정확한 속도 추정이 가능하며, 직류전동기의 문제점인 회전자 권선의 열에 의한 전동기의 성능 악화 및 속도 제어의 어려움을 해소하여 운전 조건에 따른 외란 등에도 강인한 제어 특성을 가질 뿐만 아니라 전 속도 영역에서 속도 응답 특성이 우수한 결과를 얻을 수 있었다.

역전파 신경회로망의 수렴속도 개선을 위한 학습파라메타 설정에 관한 연구 (On the configuration of learning parameter to enhance convergence speed of back propagation neural network)

  • 홍봉화;이승주;조원경
    • 전자공학회논문지B
    • /
    • 제33B권11호
    • /
    • pp.159-166
    • /
    • 1996
  • In this paper, the method for improving the speed of convergence and learning rate of back propagation algorithms is proposed which update the learning rate parameter and momentum term for each weight by generated error, changely the output layer of neural network generates a high value in the case that output value is far from the desired values, and genrates a low value in the opposite case this method decreases the iteration number and is able to learning effectively. The effectiveness of proposed method is verified through the simulation of X-OR and 3-parity problem.

  • PDF

Actor-Critic Algorithm with Transition Cost Estimation

  • Sergey, Denisov;Lee, Jee-Hyong
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제16권4호
    • /
    • pp.270-275
    • /
    • 2016
  • We present an approach for acceleration actor-critic algorithm for reinforcement learning with continuous action space. Actor-critic algorithm has already proved its robustness to the infinitely large action spaces in various high dimensional environments. Despite that success, the main problem of the actor-critic algorithm remains the same-speed of convergence to the optimal policy. In high dimensional state and action space, a searching for the correct action in each state takes enormously long time. Therefore, in this paper we suggest a search accelerating function that allows to leverage speed of algorithm convergence and reach optimal policy faster. In our method, we assume that actions may have their own distribution of preference, that independent on the state. Since in the beginning of learning agent act randomly in the environment, it would be more efficient if actions were taken according to the some heuristic function. We demonstrate that heuristically-accelerated actor-critic algorithm learns optimal policy faster, using Educational Process Mining dataset with records of students' course learning process and their grades.

A Study on Efficient Memory Management Using Machine Learning Algorithm

  • Park, Beom-Joo;Kang, Min-Soo;Lee, Minho;Jung, Yong Gyu
    • International journal of advanced smart convergence
    • /
    • 제6권1호
    • /
    • pp.39-43
    • /
    • 2017
  • As the industry grows, the amount of data grows exponentially, and data analysis using these serves as a predictable solution. As data size increases and processing speed increases, it has begun to be applied to new fields by combining artificial intelligence technology as well as simple big data analysis. In this paper, we propose a method to quickly apply a machine learning based algorithm through efficient resource allocation. The proposed algorithm allocates memory for each attribute. Learning Distinct of Attribute and allocating the right memory. In order to compare the performance of the proposed algorithm, we compared it with the existing K-means algorithm. As a result of measuring the execution time, the speed was improved.