Reinforcement Learning  Paper

 

°­È­ÇнÀ ±â¹ýÀ» ÀÌ¿ëÇÑ TSP ÀÇ Çعý (A learning based algorithm for Traveling Salesman Problem) : ÀÓÁع¬, °­Áø±Ô, ±æº»Àϼö, ÀÓÀç±¹, ´ëÇÑ»ê¾÷°øÇÐȸ, 2002

°­È­ÇнÀ¿¡ ±âÃÊÇÑ ·Îº¿Ã౸ ¿¡ÀÌÀüÆ®ÀÇ ¼³°è ¹× ±¸Çö (Design and Implementation of Robot Soccer Agent Based on Reinforcement Learning) : ±èÀÎö, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2002

°­È­ÇнÀÀ» ÀÌ¿ëÇÑ ´ÙÁß ¿¡ÀÌÀüÆ® Á¦¾îÀü·« (Multiagent Control Strategy Using Reinforcement Learning) : ±èº´Ãµ, ÀÌÇüÀÏ, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2003

°³¹Ì Áý´Ü ½Ã½ºÅÛ¿¡¼­ TD-¿À·ù¸¦ ÀÌ¿ëÇÑ °­È­ÇнÀ ±â¹ý (A Reinforcement Learning Method using TD-Error in Ant Colony System) : Á¤ÅÂÃæ, À̽°ü, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2004

¸ñÇ¥»óÅ °ª ÀüÆĸ¦ ÀÌ¿ëÇÑ °­È­ÇнÀ (Reinforcement Learning using Propagation of Goal-State-Value) : À±º´ÁÖ, ±èº´Ãµ, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 1999

¹Ì·Îȯ°æ¿¡¼­ ÃÖ´Ü°æ·Î Ž»öÀ» À§ÇÑ ½Ç½Ã°£ °­È­ÇнÀ (Online Reinforcement Learning to Search the Shortest Path in Maze Environments) : ±èº´Ãµ, ±è»ï±Ù, À±º´ÁÖ, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2002

»óÅ°ø°£ ¾ÐÃàÀ» ÀÌ¿ëÇÑ °­È­ÇнÀ (Reinforcement Learning Using State Space Compression) : À±º´ÁÖ, ±èº´Ãµ, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 1999

Áö´ÉÇü ¿¡ÀÌÀüÆ®ÀÇ È¯°æ ÀûÀÀ¼º ¹× È®À强 (A study on environmental adaptation and expansion of intelligent agent) : ¹éÇýÁ¤, ¹Ú¿µÅÃ, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2003

ÆÛÁö Ŭ·¯½ºÅ͸µÀ» ÀÌ¿ëÇÑ °­È­ÇнÀÀÇ ÇÔ¼ö±Ù»ç (Function Approximation for Reinforcement Learning using Fuzzy Clustering) : ÀÌ¿µ¾Æ, Á¤ÅÂÃæ, Á¤°æ¼÷, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2003

Ant Colony System¿¡¼­ È¿À²Àû °æ·Î Ž»öÀ» À§ÇÑ Áö¿ª°»½Å°ú Àü¿ª°»½Å¿¡¼­ÀÇ Ãß°¡ °­È­¿¡ °üÇÑ ¿¬±¸ (A Study about Additional Reinforcement in Local Updating and Global Updating for Efficient Path Search in Ant Colony System) : À̽°ü, Á¤ÅÂÃæ, Çѱ¹Á¤º¸Ã³¸®ÇÐȸ, 2003

°­È­ÇнÀÀ» ÀÌ¿ëÇÑ À¥ Á¤º¸ °Ë»ö : ¼­¿ï´ë ÄÄÇ»ÅÍ°øÇаú ¼®»ç³í¹®, Á¤ÅÂÁø, 2002

The National Science Foundation Workshop on Reinforcement Learning : Sridhar Mahadevan and Leslie Pack Kaelbling 17(4): Winter 1996, 89-93

A Review of Reinforcement Learning : Sebastian Thrun and Michael L. Littman 21(1): Spring 2000, 103-105

Reinforcement Learning : A Survey : Leslie Kaelbling, Michael Littman, Andrew Moore. Journal of Artificial Intelligence Research 4 (1996) pp. 237–285

Reinforcement Learning : Richard Sutton and Andrew Barto, MIT Press, 1998

Enhancing Transfer in Reinforcement Learning by Building Stochastic Models of Robot Actions : Sridhar Mahadevan, in Proceedings of the Ninth International Workshop on Machine Learning (ML92), pp.290-299, San Francisco: Morgan Kaufmann, 1992.

Automatic Programming of Behavior-Based Robots Using Reinforcement Learning : Sridhar Mahadevan, Artificial Intelligence, 55(1-2):311-365, 1992.