[KSCI] Korea Science Citation Index Service

Balancing the Tradeoffs Between Exploration and Exploitation

Park, Sun-Ju (연세대학교 경영학과)

Publication Information

Journal of KIISE:Software and Applications / v.32, no.11, 2005 , pp. 1099-1110 More about this Journal

Abstract

As auctions become popular, developing good agent bidding strategies has been an important focus in agent-based electronic commerce research. Especially for the continuous double auctions where no single dominant strategy is known, the agent bidding strategy has practical significance. This paper introduces an adaptive agent strategy for the countinuous double auction. The central idea is to let the agent figure out at run time when the sophisticated strategy (called the p-strategy) is beneficial and when a simpler strategy is better. Balance between exploration and exploitation is achieved by using a heuristic exploration function that trades off the expected profits and the number of tries of each strategy. We have experimentally evaluated the performance of the adaptive strategy in a wide variety of environments. The experiment results indicate that the adaptive strategy outperforms the plain p-strategy when the p-strategy performs poorly, while it performs similar to the p-strategy when the p-strategy dominates the other simple strategies.

Keywords

Auctions; Continuous Double Auctions; Auction Agents; Bidding Strategy; Agent Development;

Citations & Related Records

Reference

1	Gode, D. K. and S. Sunder. 'Lower Bounds for Efficiency of Surplus Extraction in Double Auctions,' The Double Auction Market: Institutions, Theories, and Evidence. D. Friedman and J. Rust. Reading, MA, Addison-Wesley: 199-219. 1993
2	Steiglitz, K., M. L. Honig, et al. 'A Computational Market Model based on Individual Action,' Marketbased Control: A Paradigm for Distributed Resource Allocation. S. Clearwater. 1996
3	Park, S., E. H. Durfee, et al. 'Use of Markov Chains to Design an Agent Bidding Strategy for Continuous Double Auctions,' Journal of Artificial Intelligence Research, Vol. 22, 175-214, November, 2004
4	Russell, S. and P. Norvig. Artificial Intelligence: A Modern Approach, Prentice Hall. 1995
5	Tanenbaum, A. Computer Networks, Prentice Hall. 1996
6	Wellman, M. P. and J. Hu. 'Conjectural Equilibrium in Multiagent Learning,' Machine Learning 33: 179-200. 1998 DOI
7	Hu, J. and M. P. Wellman. 'Learning About Other Agents in a Dynamic Multiagent System,' Cognitive Systems Research 2 : 67-79, 2001 DOI ScienceOn
8	Gmytrasiewicz, P. J. and E. H. Durfee. 'Rational Communication in Multi-Agent Systems,' Autonomous Agents and Multi-Agent Systems Journal, 4(3): 233-272. 2001 DOI
9	Bartos, O. J. Process and Outcome of Negotiations, Columbia University Press. 1974
10	Watkins, C. J. and P. Dayan. 'Q-learning,' Machine Learning, 8: 279-292. 1992 DOI
11	Priest, C. 'Commodity Trading Using an Agent-Based Iterated Double Auction,' Technical Report: HPL-2003-238, Hewlett-Packard Lab. 2003
12	Tesauro, G. and R. Das. 'High-performance bidding agents for the continuous double auction,' Proceedings of the 3rd ACM conference on Electronic Commerce, 206-209, Tampa, Florida, USA, 2001
13	He, M. and N. R. Jennings. 'Designing a Successful Trading Agent: A Fuzzy Set Approach,' IEEE Transactions on Fuzzy Systems, Vol 12, No. 3: 389-410. 2004 DOI ScienceOn
14	Vytelingum, P., R. K. Dash, E. David, and N. R. Jennings. 'A Risk-Based Bidding Strategy for Continuous Double Auctions,' European Conference on Artificial Intelligence, 79-83. 2004
15	White, J. E. 'Telescript Technology: The Foundation for the Electronic Marketplace,' White Paper, General Magic. 1994
16	Cliff, D. 'Genetic Optimization of Adaptive Trading Agents for Double-Auction Markets,' Autonomous Agents '98 Workshop, Artificial Societies and Computational Markets, Minneapolis/St.Paul. 1998 DOI
17	Oliver, J. R. 'On Artificial Agents for Negotiation in Electronic Commerce,' Dissertation, Wharton school of business. Philadelphia, U of Pennsylvania. 1998
18	Byde, A. 'Applying Evolutionary Game Theory to Auction Mechanism Design,' Technical Report, HPL2002-321, Hewlett-Packard Lab. 2002
19	Roth, A. E. 'Introduction to Experimental Economics,' Handbook of Experimental Economics. J. Kagel and A. E. Roth, Princeton University Press: 3-109. 1995.
20	Kirchler, E., B. Maciejovsky, and M. Weber (Forthcoming). 'Framing Effects, Selective Information and Market Behavior: An Experimental Approach,' Journal of Behavioral Finance. 2005
21	Roth, A. E. 'On the Early History of Experimental Economics,' Journal of the History of Economic Thought: 184-209. 1993
22	Rust, J., J. Miller, et al. 'Behavior of Trading Automata in a Computerized Double Auction Market,' The Double Auction Market. D. Friedman and J. Rust: 155-198. 1993
23	Kagel, J. and A. E. Roth, Eds. Handbook of Experimental Economics, Princeton University Press. 1995
24	Verkama, M., R. P. Hamalainen, et al. 'Multi-Agent Interaction Processes: From Oligopoly Theory to Decentralized Artificial Intelligence,' Group Decision and Negotiation 2: 137-159. 1992 DOI

KSCI

Balancing the Tradeoffs Between Exploration and Exploitation 탐색 (Exploration)과 이용(Exploitation)의 상반관계의 균형에 관한 연구

Balancing the Tradeoffs Between Exploration and Exploitation