통합 검색 | Korea Science

Actor-Critic Algorithm with Transition Cost Estimation

Sergey, Denisov;Lee, Jee-Hyong
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- 제16권4호
- /
- pp.270-275
- /
- 2016
We present an approach for acceleration actor-critic algorithm for reinforcement learning with continuous action space. Actor-critic algorithm has already proved its robustness to the infinitely large action spaces in various high dimensional environments. Despite that success, the main problem of the actor-critic algorithm remains the same-speed of convergence to the optimal policy. In high dimensional state and action space, a searching for the correct action in each state takes enormously long time. Therefore, in this paper we suggest a search accelerating function that allows to leverage speed of algorithm convergence and reach optimal policy faster. In our method, we assume that actions may have their own distribution of preference, that independent on the state. Since in the beginning of learning agent act randomly in the environment, it would be more efficient if actions were taken according to the some heuristic function. We demonstrate that heuristically-accelerated actor-critic algorithm learns optimal policy faster, using Educational Process Mining dataset with records of students' course learning process and their grades.
https://doi.org/10.5391/IJFIS.2016.16.4.270 인용 PDF KSCI

비대칭 외판원문제에서 3-Opt를 이용한 효율적인 국지탐색 알고리즘 (An Efficient Local Search Algorithm for the Asymmetric Traveling Salesman Problem Using 3-Opt)

김경구;권상호;강맹규
- 산업경영시스템학회지
- /
- 제23권59호
- /
- pp.1-10
- /
- 2000
The traveling salesman problem is a representative NP-Complete problem. It needs lots of time to get a solution as the number of city increase. So, we need an efficient heuristic algorithm that gets good solution in a short time. Almost edges that participate in optimal path have somewhat low value cost. This paper discusses the property of nearest neighbor and 3-opt. This paper uses nearest neighbor's property to select candidate edge. Candidate edge is a set of edge that has high probability to improve cycle path. We insert edge that is one of candidate edge into intial cycle path. As two cities are connected. It does not satisfy hamiltonian cycle's rule that every city must be visited and departed only one time. This paper uses 3-opt's method to sustain hamiltonian cycle while inserting edge into cycle path. This paper presents a highly efficient heuristic algorithm verified by numerous experiments.
PDF

Design of multi-span steel box girder using lion pride optimization algorithm

Kaveh, A.;Mahjoubi, S.
- Smart Structures and Systems
- /
- 제20권5호
- /
- pp.607-618
- /
- 2017
In this research, a newly developed nature-inspired optimization method, the Lion Pride Optimization algorithm (LPOA), is utilized for optimal design of composite steel box girder bridges. A composite box girder bridge is one of the common types of bridges used for medium spans due to their economic, aesthetic, and structural benefits. The aim of the present optimization procedure is to provide a feasible set of design variables in order to minimize the weight of the steel trapezoidal box girders. The solution space is delimited by different types of design constraints specified by the American Association of State Highway and Transportation Officials. Additionally, the optimal solution obtained by LPOA is compared to the results of other well-established meta-heuristic algorithms, namely Gray Wolf Optimization (GWO), Ant Lion Optimizer (ALO) and the results of former researches. By this comparison the capability of the LPOA in optimal design of composite steel box girder bridges is demonstrated.
https://doi.org/10.12989/sss.2017.20.5.607 인용 KSCI

추상 그래프를 활용한 경로 탐색 알고리즘의 구현 및 성능 평가 (Implementation and Evaluation of Path-Finding Algorithm using Abstract Graphs)

김지수;이지완;조대수
- 한국정보통신학회:학술대회논문집
- /
- 한국해양정보통신학회 2009년도 추계학술대회
- /
- pp.245-248
- /
- 2009
최근 단말기 기반의 경로 탐색에서도 동적인 정보를 반영하기 위한 연구가 진행되고 있다. 그러나 제시하는 대부분의 알고리즘은 $A{\ast}$알고리즘을 기반으로 한다. 휴리스틱을 이용한 알고리즘에서는 다음과 같은 탐색 비용이 증가하는 문제가 발생할 수 있다. 휴리스틱에 의해 결정된 추정 경로에 실제 경로가 존재하지 않을 경우, 휴리스틱 가중치 값이 비슷한 2가지 이상의 경로가 존재할 경우 탐색 비용이 증가한다. 이 논문에서는 생성 방법이 다른 추상 그래프의 성능을 평가 하였다. 추상 그래프는 실제 도로 네트워크를 단순화한 그래프로, 휴리스틱의 의존성과 탐색 비용을 줄이기 위해 제안된 방법이다. 추상 그래프는 생성 방법에 따라 동일 특성 노드 합병을 통한 추상 그래프($AG^H$)와 연결 노드 합병을 통한 추상 그래프($AG^C$)로 구별된다. 성능 실험 결과 생성 비용 측면에서 $AG^C$가 좋은 성능을 보였지만, 탐색 성능 측면에서는 $AG^H$가 좋은 성능을 보였다.
PDF

HS 최적화 알고리즘을 이용한 전력용 변압기의 경제적 수명평가 (Economic Life Assessment of Power Transformer using HS Optimization Algorithm)

이태봉;손진근
- 전기학회논문지P
- /
- 제66권3호
- /
- pp.123-128
- /
- 2017
Electric utilities has been considered the necessity to introduce AM(asset management) of electric power facilities in order to reduce maintenance cost of existing facilities and to maximize profit. In order to make decisions in terms of repairs and replacements for power transformers, not only measuring by counting parts and labor costs, but comprehensive comparison including reliability and cost is needed. Therefore, this study is modeling input cost for power transformer during its entire life and also the life cycle cost (LCC) technique is applied. In particular, this paper presents an application of heuristic harmony search(HS) optimization algorithm to the convergence and the validity of economic life assessment of power transformer from LCC technique. This recently developed HS algorithm is conceptualized using the musical process of searching for a perfect state of harmony. It uses a stochastic random search instead of a gradient search so that derivative information is unnecessary. The effectiveness of the proposed identification method has been demonstrated through an economic life assessment simulation of power transformer using HS optimization algorithm.
https://doi.org/10.5370/KIEEP.2017.66.3.123 인용 PDF KSCI

FUZZY 이론을 이용한 전압.무효전력의 순서제어에 관한 연구 (STUDY ON THE REAL TIME VOLTAGE-REACTIVE POWER CONTROL USING THE FUZZY THEORY)

송길영;김세영;조준우
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1990년도 추계학술대회 논문집 학회본부
- /
- pp.231-234
- /
- 1990
This paper shows real-time control technique of voltage-reactive power using the fuzzy theory. Here, major benefits of applying the fuzzy set theory as follow. First, heuristic knowledge of operator has been used in the operation and control of power system. Second, difficulties in traditional multi-objective numerical solution methods have been solved. Also, to achieve optimizing process on the voltage-reactive power control conventional search method have been used.
PDF

인공지능 기법을 이용한 CMOS 표준셀의 심볼릭 레이아웃 발생기 (A Symbolic Layout Generator for CMOS Standard Cells Using Artificial Intelligence Approach)

유종근;이문기
- 대한전자공학회논문지
- /
- 제24권6호
- /
- pp.1080-1086
- /
- 1987
SLAGEN, a system for symbolic cell layout based on artificial intelligence approach, takes as input a transistor connection description of CMOS standard cells and environment information, and outputs a symbolic layout description. SLAGEN performas transistor grouping by a heuristic search method, in order to minimize the number of separations, and then performs group reordering and transistor reordering with an eye toward minimizing routing. Next, SLAGEN creates a rough initial routing in order to guarantee functionality and correctness, and then improve the initial routing by a rule-based approach.
PDF

탐색 영역 확장 기법들을 활용한 추상 그래프 기반의 탐색 알고리즘 성능 개선 (Enhanced Methods of Path Finding Based on An Abstract Graph with Extension of Search Space)

조대수
- 한국정보통신학회논문지
- /
- 제15권1호
- /
- pp.157-162
- /
- 2011
이 논문에서는 추상 그래프 기반의 경로 탐색 알고리즘에서 탐색된 경로의 비용이 증가하는 문제점을 보완하기 위해 탐색 영역 확장 기법들을 제안한다. 제안하는 기법들은 버퍼링 셀을 추출하여 유효 셀과 함께 탐색 영역으로 설정하는 기법으로, 단순 버퍼링 기법, 속력 제한 버퍼링 기법, 거리제한 버퍼링 기법을 제안하고 성능 평가하였다. 단순 버퍼링 기법은 유효 셀의 인접 셀들을 버퍼링 셀로 추출하며, 속력 제한 버퍼링 기법과 거리 제한 버퍼링 기법은 단순 버퍼링 기법을 통해 추출된 버퍼링 셀을 속력과 거리에 대해 제한하여 임계값을 미치지 못하는 버퍼링 셀을 제외하는 기법이다. 성능 평가 결과 탐색 영역을 확장함으로써 탐색된 경로의 비용을 줄일 수 있었다. 제안한 기법은 경로탐색, 물류관리 등 텔레매틱스 응용 서비스의 개발에 활용될 수 있을 것으로 기대된다.
https://doi.org/10.6109/jkiice.2011.15.1.157 인용 PDF KSCI

누적환승함수를 고려한 경험적 최적경로탐색 방안 (A Heuristic Optimal Path Search Considering Cumulative Transfer Functions)

신성일;백남철;남두희
- 한국ITS학회 논문지
- /
- 제15권3호
- /
- pp.60-67
- /
- 2016
환승누적함수에서 환승회수가 증가되면 환승비용에 대한 개별적인 환승의 영향이 선형 또는 비선형적으로 증가된다. 이 함수는 버스 또는 철도와 같이 대중교통노선에서 경로를 선택하는 승객의 행태를 효과적으로 설명한다. 이 함수로 통행시간이 더 소요되더라도 환승이 적은 대중교통노선을 선택하는 일반적인 상황의 구현이 가능하다. 그러나 환승누적함수가 포함되는 통행비용은 비가산성비용으로 최적경로탐색을 위해서 경로열거라는 어려운 상황을 포함한다. 본 연구는 환승누적함수를 고려하여 최적경로를 탐색하는 효과적인 방안을 제안하였다. 이를 위해 우선 환승누적함수가 포함되는 경우 경로탐색과정에서 나타나는 최적경로역전 현상을 설명하였다. 또한 복수의 경로를 탐색해서 최소의 비용경로를 최적경로로 선택하는 경험적인 방안을 제안하였다. 유입링크기반 전체경로삭제기법을 복수경로탐색기법으로 채택하여 알고리즘의 경로최적조건의 증명성에 기반하여 K개의 경로를 탐색하는 방안을 제안하였다. 환승계수를 도입하는 사례연구를 통하여 제안된 방안의 실제 교통망에 대한 활용성을 논의하였다.
https://doi.org/10.12815/kits.2016.15.3.060 인용 PDF KSCI

부경로를 이용한 ACS 탐색에서 수정된 지역갱신규칙을 이용한 최적해 탐색 기법 (Optimal solution search method by using modified local updating rule in ACS-subpath algorithm)

홍석미;이승관
- 디지털융복합연구
- /
- 제11권11호
- /
- pp.443-448
- /
- 2013
개미군락시스템(Ant Colony System, ACS)은 조합 최적화 문제를 해결하기 위한 기법으로 생물학적 기반의 메타휴리스틱 접근법이다. 지나간 경로에 대하여 페로몬을 분비하고 통신 매개물로 사용하는 실제 개미들의 추적 행위를 기반으로 한다. 최적 경로를 찾기 위해서는 보다 다양한 에지들에 대한 탐색이 필요하다. 기존 개미군락시스템의 지역 갱신 규칙에서는 지나간 에지에 대하여 고정된 페로몬 갱신 값을 부여하고 있다. 그러나 본 논문에서는 현재 선택한 노드에 대한 이전 iteration 에서 방문한 총 빈도수를 고려한 페로몬 부여 방법을 지역갱신규칙에 사용하고자 한다. 탐색을 위해서는 부경로를 이용한 ACS알고리즘을 사용하였다. 보다 많은 정보를 탐색에 활용함으로써 기존의 방법에 비해 지역 최적화에 빠지지 않고 더 나은 해를 찾을 수 있다.
https://doi.org/10.14400/JDPM.2013.11.11.443 인용 PDF

검색결과 285건 처리시간 0.023초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)