Journal articles on the topic 'Q-learning'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Q-learning.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Watkins, Christopher J. C. H., and Peter Dayan. "Q-learning." Machine Learning 8, no. 3-4 (May 1992): 279–92. http://dx.doi.org/10.1007/bf00992698.
Full textClausen, C., and H. Wechsler. "Quad-Q-learning." IEEE Transactions on Neural Networks 11, no. 2 (March 2000): 279–94. http://dx.doi.org/10.1109/72.839000.
Full textten Hagen, Stephan, and Ben Kr�se. "Neural Q-learning." Neural Computing & Applications 12, no. 2 (November 1, 2003): 81–88. http://dx.doi.org/10.1007/s00521-003-0369-9.
Full textWang, Yin-Hao, Tzuu-Hseng S. Li, and Chih-Jui Lin. "Backward Q-learning: The combination of Sarsa algorithm and Q-learning." Engineering Applications of Artificial Intelligence 26, no. 9 (October 2013): 2184–93. http://dx.doi.org/10.1016/j.engappai.2013.06.016.
Full textEvseenko, Alla, and Dmitrii Romannikov. "Application of Deep Q-learning and double Deep Q-learning algorithms to the task of control an inverted pendulum." Transaction of Scientific Papers of the Novosibirsk State Technical University, no. 1-2 (August 26, 2020): 7–25. http://dx.doi.org/10.17212/2307-6879-2020-1-2-7-25.
Full textAbedalguni, Bilal. "Bat Q-learning Algorithm." Jordanian Journal of Computers and Information Technology 3, no. 1 (2017): 51. http://dx.doi.org/10.5455/jjcit.71-1480540385.
Full textZhu, Rong, and Mattia Rigotti. "Self-correcting Q-learning." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 12 (May 18, 2021): 11185–92. http://dx.doi.org/10.1609/aaai.v35i12.17334.
Full textBorkar, Vivek S., and Siddharth Chandak. "Prospect-theoretic Q-learning." Systems & Control Letters 156 (October 2021): 105009. http://dx.doi.org/10.1016/j.sysconle.2021.105009.
Full textGanger, Michael, and Wei Hu. "Quantum Multiple Q-Learning." International Journal of Intelligence Science 09, no. 01 (2019): 1–22. http://dx.doi.org/10.4236/ijis.2019.91001.
Full textJohn, Indu, Chandramouli Kamanchi, and Shalabh Bhatnagar. "Generalized Speedy Q-Learning." IEEE Control Systems Letters 4, no. 3 (July 2020): 524–29. http://dx.doi.org/10.1109/lcsys.2020.2970555.
Full textHORIUCHI, Tadashi, Akinori FUJINO, Osamu KATAI, and Tetsuo SAWARAGI. "Q-PSP Learning: An Exploitation-Oriented Q-Learning Algorithm and Its Applications." Transactions of the Society of Instrument and Control Engineers 35, no. 5 (1999): 645–53. http://dx.doi.org/10.9746/sicetr1965.35.645.
Full textGhazanfari, Behzad, and Nasser Mozayani. "Enhancing Nash Q-learning and Team Q-learning mechanisms by using bottlenecks." Journal of Intelligent & Fuzzy Systems 26, no. 6 (2014): 2771–83. http://dx.doi.org/10.3233/ifs-130945.
Full textYang, Min-Gyu, Kuk-Hyun Ahn, and Jae-Bok Song. "Tidy-up Task Planner based on Q-learning." Journal of Korea Robotics Society 16, no. 1 (February 1, 2021): 56–63. http://dx.doi.org/10.7746/jkros.2021.16.1.056.
Full textKim, Min-Soeng, Sun-Gi Hong, and Ju-Jang Lee. "Self-Learning Fuzzy Logic Controller using Q-Learning." Journal of Advanced Computational Intelligence and Intelligent Informatics 4, no. 5 (September 20, 2000): 349–54. http://dx.doi.org/10.20965/jaciii.2000.p0349.
Full textMoodie, Erica E. M., Nema Dean, and Yue Ru Sun. "Q-Learning: Flexible Learning About Useful Utilities." Statistics in Biosciences 6, no. 2 (September 12, 2013): 223–43. http://dx.doi.org/10.1007/s12561-013-9103-z.
Full textHatcho, Yasuyo, Kiyohiko Hattori, and Keiki Takadama. "Time Horizon Generalization in Reinforcement Learning: Generalizing Multiple Q-Tables in Q-Learning Agents." Journal of Advanced Computational Intelligence and Intelligent Informatics 13, no. 6 (November 20, 2009): 667–74. http://dx.doi.org/10.20965/jaciii.2009.p0667.
Full textClifton, Jesse, and Eric Laber. "Q-Learning: Theory and Applications." Annual Review of Statistics and Its Application 7, no. 1 (March 9, 2020): 279–301. http://dx.doi.org/10.1146/annurev-statistics-031219-041220.
Full textHe, Ningxia. "Image Sampling Using Q-Learning." International Journal of Computer Science and Engineering 8, no. 1 (January 25, 2021): 5–12. http://dx.doi.org/10.14445/23488387/ijcse-v8i1p102.
Full textGanapathi Subramanian, Sriram, Matthew E. Taylor, Kate Larson, and Mark Crowley. "Multi-Agent Advisor Q-Learning." Journal of Artificial Intelligence Research 74 (May 5, 2022): 1–74. http://dx.doi.org/10.1613/jair.1.13445.
Full textHu, Yuepeng, Lehan Yang, and Yizhu Lou. "Path Planning with Q-Learning." Journal of Physics: Conference Series 1948, no. 1 (June 1, 2021): 012038. http://dx.doi.org/10.1088/1742-6596/1948/1/012038.
Full textSarigül, Mehmet, and Mutlu Avci. "Q LEARNING REGRESSION NEURAL NETWORK." Neural Network World 28, no. 5 (2018): 415–31. http://dx.doi.org/10.14311/nnw.2018.28.023.
Full textKamanchi, Chandramouli, Raghuram Bharadwaj Diddigi, and Shalabh Bhatnagar. "Successive Over-Relaxation ${Q}$ -Learning." IEEE Control Systems Letters 4, no. 1 (January 2020): 55–60. http://dx.doi.org/10.1109/lcsys.2019.2921158.
Full textPatnaik, Srikanta, and N. P. Mahalik. "Multiagent coordination utilising Q-learning." International Journal of Automation and Control 1, no. 4 (2007): 377. http://dx.doi.org/10.1504/ijaac.2007.015863.
Full textLecué, Guillaume, and Philippe Rigollet. "Optimal learning with Q-aggregation." Annals of Statistics 42, no. 1 (February 2014): 211–24. http://dx.doi.org/10.1214/13-aos1190.
Full textAhmadabadi, M. N., and M. Asadpour. "Expertness based cooperative Q-learning." IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics) 32, no. 1 (2002): 66–76. http://dx.doi.org/10.1109/3477.979961.
Full textLinn, Kristin A., Eric B. Laber, and Leonard A. Stefanski. "Interactive Q-Learning for Quantiles." Journal of the American Statistical Association 112, no. 518 (March 31, 2017): 638–49. http://dx.doi.org/10.1080/01621459.2016.1155993.
Full textGoldberg, Yair, and Michael R. Kosorok. "Q-learning with censored data." Annals of Statistics 40, no. 1 (February 2012): 529–60. http://dx.doi.org/10.1214/12-aos968.
Full textPeng, Jing, and Ronald J. Williams. "Incremental multi-step Q-learning." Machine Learning 22, no. 1-3 (1996): 283–90. http://dx.doi.org/10.1007/bf00114731.
Full textHOSOYA, Yu, and Motohide UMANO. "Improvement of Updating Method of Q Values in Fuzzy Q-Learning." Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 27, no. 6 (2015): 942–48. http://dx.doi.org/10.3156/jsoft.27.942.
Full textDuryea, Ethan, Michael Ganger, and Wei Hu. "Exploring Deep Reinforcement Learning with Multi Q-Learning." Intelligent Control and Automation 07, no. 04 (2016): 129–44. http://dx.doi.org/10.4236/ica.2016.74012.
Full textHwang, Kao-Shing, Wei-Cheng Jiang, and Yu-Jen Chen. "ADAPTIVE MODEL LEARNING BASED ON DYNA-Q LEARNING." Cybernetics and Systems 44, no. 8 (November 17, 2013): 641–62. http://dx.doi.org/10.1080/01969722.2013.803387.
Full textda Costa, Luis Antonio L. F., Rafael Kunst, and Edison Pignaton de Freitas. "Q-FANET: Improved Q-learning based routing protocol for FANETs." Computer Networks 198 (October 2021): 108379. http://dx.doi.org/10.1016/j.comnet.2021.108379.
Full textMeng, Xiao-Li. "Discussion: The Q-q Dynamic for Deeper Learning and Research." International Statistical Review 84, no. 2 (December 16, 2015): 181–89. http://dx.doi.org/10.1111/insr.12151.
Full textGuo, Yanqin. "Enhancing Flappy Bird Performance With Q-Learning and DQN Strategies." Highlights in Science, Engineering and Technology 85 (March 13, 2024): 396–402. http://dx.doi.org/10.54097/qrded191.
Full textD'Orazio, Tiziana, and Grazia Cicirelli. "Q-Learning: computation of optimal Q-values for evaluating the learning level in robotic tasks." Journal of Experimental & Theoretical Artificial Intelligence 13, no. 3 (July 2001): 241–70. http://dx.doi.org/10.1080/09528130110063100.
Full text古, 彭. "Improvement and Implementation of Q-Learning Algorithm." Computer Science and Application 11, no. 07 (2021): 1994–2007. http://dx.doi.org/10.12677/csa.2021.117204.
Full textWei-Kai Sun, Wei-Kai Sun, Xiao-Mei Wang Wei-Kai Sun, Bin Wang Xiao-Mei Wang, Jia-Sen Zhang Bin Wang, and Hai-Yang Du Jia-Sen Zhang. "MR-SFAMA-Q: A MAC Protocol based on Q-Learning for Underwater Acoustic Sensor Networks." 電腦學刊 35, no. 1 (February 2024): 051–63. http://dx.doi.org/10.53106/199115992024023501004.
Full textLiu, Peiyi. "Q-Learning: Applications and Convergence Rate Optimization." Highlights in Science, Engineering and Technology 63 (August 8, 2023): 210–15. http://dx.doi.org/10.54097/hset.v63i.10878.
Full textChen, Bo-Wei, Shih-Hung Yang, Yu-Chun Lo, Ching-Fu Wang, Han-Lin Wang, Chen-Yang Hsu, Yun-Ting Kuo, et al. "Enhancement of Hippocampal Spatial Decoding Using a Dynamic Q-Learning Method With a Relative Reward Using Theta Phase Precession." International Journal of Neural Systems 30, no. 09 (August 12, 2020): 2050048. http://dx.doi.org/10.1142/s0129065720500483.
Full textZhang, Chunyuan, Qi Song, and Zeng Meng. "Minibatch Recursive Least Squares Q-Learning." Computational Intelligence and Neuroscience 2021 (October 8, 2021): 1–9. http://dx.doi.org/10.1155/2021/5370281.
Full textShin, YongWoo. "Q-learning to improve learning speed using Minimax algorithm." Journal of Korea Game Society 18, no. 4 (August 31, 2018): 99–106. http://dx.doi.org/10.7583/jkgs.2018.18.4.99.
Full textXu, Haoran, Xianyuan Zhan, and Xiangyu Zhu. "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 8 (June 28, 2022): 8753–60. http://dx.doi.org/10.1609/aaai.v36i8.20855.
Full textCharypar, David, and Kai Nagel. "Q-Learning for Flexible Learning of Daily Activity Plans." Transportation Research Record: Journal of the Transportation Research Board 1935, no. 1 (January 2005): 163–69. http://dx.doi.org/10.1177/0361198105193500119.
Full textTan, Chunxi, Ruijian Han, Rougang Ye, and Kani Chen. "Adaptive Learning Recommendation Strategy Based on Deep Q-learning." Applied Psychological Measurement 44, no. 4 (July 25, 2019): 251–66. http://dx.doi.org/10.1177/0146621619858674.
Full textGokul, Vignesh, Parinitha Kannan, Sharath Kumar, and Shomona Gracia. "Deep Q-Learning for Home Automation." International Journal of Computer Applications 152, no. 6 (October 17, 2016): 1–5. http://dx.doi.org/10.5120/ijca2016911873.
Full textNOTSU, Akira, and Katsuhiro HONDA. "Discounted UCB1-tuned for Q-Learning." Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 26, no. 6 (2014): 913–23. http://dx.doi.org/10.3156/jsoft.26.913.
Full textHu, Wei, and James Hu. "Q Learning with Quantum Neural Networks." Natural Science 11, no. 01 (2019): 31–39. http://dx.doi.org/10.4236/ns.2019.111005.
Full textZheng, Zhang, Ji-Hoon Seung, Tae-Yeong Kim, and Kil-To Chong. "Traffic Control using Q-Learning Algorithm." Journal of the Korea Academia-Industrial cooperation Society 12, no. 11 (November 30, 2011): 5135–42. http://dx.doi.org/10.5762/kais.2011.12.11.5135.
Full textLiu, Jingchen, Gongjun Xu, and Zhiliang Ying. "Theory of self-learning $Q$-matrix." Bernoulli 19, no. 5A (November 2013): 1790–817. http://dx.doi.org/10.3150/12-bej430.
Full textMa, Yu chien (Calvin), Zoe Wang, and Alexander Fleiss. "Deep Q-Learning for Trading Cryptocurrency." Journal of Financial Data Science 3, no. 3 (June 8, 2021): 121–27. http://dx.doi.org/10.3905/jfds.2021.1.064.
Full text