참고문헌
- 박찬건, 양성봉, '강화 학습에서의 탐색과 이용의 균 형을 통한 범용적 온라인 Q-학습이 적용된 에이전트 의 구현,' 정보과학회 논문지(B), Vol. 30, No. 7, pp. 672-680, 2003
- 정태진, 장병탁, '강화 학습을 이용한 웹 정보 검색,' 정보과학회 제 28회 추계학술대회, Vol. 28, No. 2, pp. 94-96, 2001
- C. J. Watkins and P. Dayan, 'Technical note : QLearning,' Machine Learning, 8, pp .279-292, 1992
- F. Menczer, 'ARACHNID: Adaptive retrieval agents choosing heuristic neighborhoods for information discovery,' In proceedings of 14th International Conference on Machine Learning, pp. 227-235, 1997
- H. Lieberman, 'Letizia: An agent that assists web browsing,' In Proocedings of the International Joint Conference on Arti cial Intelligence (IJCAI95), pp. 924-929, 1995
- J. Boyan, D. Freitag, and T. Joachimas, 'A machine learning architecture for optimizing web search engines,' In proceedings of AAAI workshop on Internet-Based Information Systems, pp. 1-8, 1996
- J. Peng, and R. Williams, 'Incremental multi-step Q-learning,' Machine Learning, vol. 22, pp. 283- 290, 1996
- J. Rennie and A. McCallum, 'Using Reinforcement Learning to Spider the Web Efficiently,' In proceedings of the 16th International Conference on Machine Learning(ICML-99), pp. 335-343, 1999
- L. P. Kaelbling, 'Learning in Embedded System,' PhD thesis, Departmenr of Computer Science, Stanford University, 1990
- R. Dearden, N. Friedman and S. Russell, 'Bayesian Q-Learning,' In proceedings of AAA-98, 1989
- R. S. Sutton and A. G. Barto, Reinforcement Learning : An Introduction. The MIT Press, 1998
- S. B. Thrun, 'The role of exploration in learning control,' Handbook of Intelligent Control:Neural, Fussy and Adaptive Approaches. 1992
- T. Joachims, D. Freitag, and T. M. Mitchell. 'A WebWatcher: A Tour Guide for the World Wide Web,' In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI'97), pp. 770-777, 1997
- T. M. Mitchell, Machine Learning, McGraw-Hill, 1997
- M. Tan, Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proc. of the Tenth International Conf. on Machine Learning, pp. 330.337, 1993