Journal articles on the topic 'Q-learning'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Q-learning.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Watkins, Christopher J. C. H., and Peter Dayan. "Q-learning." Machine Learning 8, no. 3-4 (1992): 279–92. http://dx.doi.org/10.1007/bf00992698.
Full textClausen, C., and H. Wechsler. "Quad-Q-learning." IEEE Transactions on Neural Networks 11, no. 2 (2000): 279–94. http://dx.doi.org/10.1109/72.839000.
Full textten Hagen, Stephan, and Ben Kr�se. "Neural Q-learning." Neural Computing & Applications 12, no. 2 (2003): 81–88. http://dx.doi.org/10.1007/s00521-003-0369-9.
Full textWang, Yin-Hao, Tzuu-Hseng S. Li, and Chih-Jui Lin. "Backward Q-learning: The combination of Sarsa algorithm and Q-learning." Engineering Applications of Artificial Intelligence 26, no. 9 (2013): 2184–93. http://dx.doi.org/10.1016/j.engappai.2013.06.016.
Full textEvseenko, Alla, and Dmitrii Romannikov. "Application of Deep Q-learning and double Deep Q-learning algorithms to the task of control an inverted pendulum." Transaction of Scientific Papers of the Novosibirsk State Technical University, no. 1-2 (August 26, 2020): 7–25. http://dx.doi.org/10.17212/2307-6879-2020-1-2-7-25.
Full textAbedalguni, Bilal. "Bat Q-learning Algorithm." Jordanian Journal of Computers and Information Technology 3, no. 1 (2017): 51. http://dx.doi.org/10.5455/jjcit.71-1480540385.
Full textZhu, Rong, and Mattia Rigotti. "Self-correcting Q-learning." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 12 (2021): 11185–92. http://dx.doi.org/10.1609/aaai.v35i12.17334.
Full textBorkar, Vivek S., and Siddharth Chandak. "Prospect-theoretic Q-learning." Systems & Control Letters 156 (October 2021): 105009. http://dx.doi.org/10.1016/j.sysconle.2021.105009.
Full textGanger, Michael, and Wei Hu. "Quantum Multiple Q-Learning." International Journal of Intelligence Science 09, no. 01 (2019): 1–22. http://dx.doi.org/10.4236/ijis.2019.91001.
Full textJohn, Indu, Chandramouli Kamanchi, and Shalabh Bhatnagar. "Generalized Speedy Q-Learning." IEEE Control Systems Letters 4, no. 3 (2020): 524–29. http://dx.doi.org/10.1109/lcsys.2020.2970555.
Full textHORIUCHI, Tadashi, Akinori FUJINO, Osamu KATAI, and Tetsuo SAWARAGI. "Q-PSP Learning: An Exploitation-Oriented Q-Learning Algorithm and Its Applications." Transactions of the Society of Instrument and Control Engineers 35, no. 5 (1999): 645–53. http://dx.doi.org/10.9746/sicetr1965.35.645.
Full textGhazanfari, Behzad, and Nasser Mozayani. "Enhancing Nash Q-learning and Team Q-learning mechanisms by using bottlenecks." Journal of Intelligent & Fuzzy Systems 26, no. 6 (2014): 2771–83. http://dx.doi.org/10.3233/ifs-130945.
Full textKim, Min-Soeng, Sun-Gi Hong, and Ju-Jang Lee. "Self-Learning Fuzzy Logic Controller using Q-Learning." Journal of Advanced Computational Intelligence and Intelligent Informatics 4, no. 5 (2000): 349–54. http://dx.doi.org/10.20965/jaciii.2000.p0349.
Full textYang, Min-Gyu, Kuk-Hyun Ahn, and Jae-Bok Song. "Tidy-up Task Planner based on Q-learning." Journal of Korea Robotics Society 16, no. 1 (2021): 56–63. http://dx.doi.org/10.7746/jkros.2021.16.1.056.
Full textMoodie, Erica E. M., Nema Dean, and Yue Ru Sun. "Q-Learning: Flexible Learning About Useful Utilities." Statistics in Biosciences 6, no. 2 (2013): 223–43. http://dx.doi.org/10.1007/s12561-013-9103-z.
Full textHatcho, Yasuyo, Kiyohiko Hattori, and Keiki Takadama. "Time Horizon Generalization in Reinforcement Learning: Generalizing Multiple Q-Tables in Q-Learning Agents." Journal of Advanced Computational Intelligence and Intelligent Informatics 13, no. 6 (2009): 667–74. http://dx.doi.org/10.20965/jaciii.2009.p0667.
Full textClifton, Jesse, and Eric Laber. "Q-Learning: Theory and Applications." Annual Review of Statistics and Its Application 7, no. 1 (2020): 279–301. http://dx.doi.org/10.1146/annurev-statistics-031219-041220.
Full textHe, Ningxia. "Image Sampling Using Q-Learning." International Journal of Computer Science and Engineering 8, no. 1 (2021): 5–12. http://dx.doi.org/10.14445/23488387/ijcse-v8i1p102.
Full textGanapathi Subramanian, Sriram, Matthew E. Taylor, Kate Larson, and Mark Crowley. "Multi-Agent Advisor Q-Learning." Journal of Artificial Intelligence Research 74 (May 5, 2022): 1–74. http://dx.doi.org/10.1613/jair.1.13445.
Full textHu, Yuepeng, Lehan Yang, and Yizhu Lou. "Path Planning with Q-Learning." Journal of Physics: Conference Series 1948, no. 1 (2021): 012038. http://dx.doi.org/10.1088/1742-6596/1948/1/012038.
Full textSarigül, Mehmet, and Mutlu Avci. "Q LEARNING REGRESSION NEURAL NETWORK." Neural Network World 28, no. 5 (2018): 415–31. http://dx.doi.org/10.14311/nnw.2018.28.023.
Full textKamanchi, Chandramouli, Raghuram Bharadwaj Diddigi, and Shalabh Bhatnagar. "Successive Over-Relaxation ${Q}$ -Learning." IEEE Control Systems Letters 4, no. 1 (2020): 55–60. http://dx.doi.org/10.1109/lcsys.2019.2921158.
Full textPatnaik, Srikanta, and N. P. Mahalik. "Multiagent coordination utilising Q-learning." International Journal of Automation and Control 1, no. 4 (2007): 377. http://dx.doi.org/10.1504/ijaac.2007.015863.
Full textLecué, Guillaume, and Philippe Rigollet. "Optimal learning with Q-aggregation." Annals of Statistics 42, no. 1 (2014): 211–24. http://dx.doi.org/10.1214/13-aos1190.
Full textAhmadabadi, M. N., and M. Asadpour. "Expertness based cooperative Q-learning." IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics) 32, no. 1 (2002): 66–76. http://dx.doi.org/10.1109/3477.979961.
Full textLinn, Kristin A., Eric B. Laber, and Leonard A. Stefanski. "Interactive Q-Learning for Quantiles." Journal of the American Statistical Association 112, no. 518 (2017): 638–49. http://dx.doi.org/10.1080/01621459.2016.1155993.
Full textGoldberg, Yair, and Michael R. Kosorok. "Q-learning with censored data." Annals of Statistics 40, no. 1 (2012): 529–60. http://dx.doi.org/10.1214/12-aos968.
Full textPeng, Jing, and Ronald J. Williams. "Incremental multi-step Q-learning." Machine Learning 22, no. 1-3 (1996): 283–90. http://dx.doi.org/10.1007/bf00114731.
Full textEl Wafi, Mouna, My Abdelkader Youssefi, Rachid Dakir, and Mohamed Bakir. "Intelligent Robot in Unknown Environments: Walk Path Using Q-Learning and Deep Q-Learning." Automation 6, no. 1 (2025): 12. https://doi.org/10.3390/automation6010012.
Full textMustafa, Hasan Kathim, Azma Zakaria Nurul, Abidin Z.Zainal, Kamil Maseer Ziadoon, and Hasan Alzamili Ali. "Online Sequential Extreme Learning Machine (OSELM) based Q-learning(OSELM-QL)." Seybold Report V16, no. 11 (2021): 1–14. https://doi.org/10.5281/zenodo.6553518.
Full textHOSOYA, Yu, and Motohide UMANO. "Improvement of Updating Method of Q Values in Fuzzy Q-Learning." Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 27, no. 6 (2015): 942–48. http://dx.doi.org/10.3156/jsoft.27.942.
Full textDuryea, Ethan, Michael Ganger, and Wei Hu. "Exploring Deep Reinforcement Learning with Multi Q-Learning." Intelligent Control and Automation 07, no. 04 (2016): 129–44. http://dx.doi.org/10.4236/ica.2016.74012.
Full textHwang, Kao-Shing, Wei-Cheng Jiang, and Yu-Jen Chen. "ADAPTIVE MODEL LEARNING BASED ON DYNA-Q LEARNING." Cybernetics and Systems 44, no. 8 (2013): 641–62. http://dx.doi.org/10.1080/01969722.2013.803387.
Full textBOCHOK, Viacheslav, and Nataliia FEDOROVA. "CENTRALIZED LEARNING FOR THE DEEP Q-LEARNING MODELS." Information Technology and Society, no. 2 (13) (2024): 6–11. http://dx.doi.org/10.32689/maup.it.2024.2.1.
Full textda Costa, Luis Antonio L. F., Rafael Kunst, and Edison Pignaton de Freitas. "Q-FANET: Improved Q-learning based routing protocol for FANETs." Computer Networks 198 (October 2021): 108379. http://dx.doi.org/10.1016/j.comnet.2021.108379.
Full textMeng, Xiao-Li. "Discussion: The Q-q Dynamic for Deeper Learning and Research." International Statistical Review 84, no. 2 (2015): 181–89. http://dx.doi.org/10.1111/insr.12151.
Full textGuo, Yanqin. "Enhancing Flappy Bird Performance With Q-Learning and DQN Strategies." Highlights in Science, Engineering and Technology 85 (March 13, 2024): 396–402. http://dx.doi.org/10.54097/qrded191.
Full textD'Orazio, Tiziana, and Grazia Cicirelli. "Q-Learning: computation of optimal Q-values for evaluating the learning level in robotic tasks." Journal of Experimental & Theoretical Artificial Intelligence 13, no. 3 (2001): 241–70. http://dx.doi.org/10.1080/09528130110063100.
Full textChen, Bo-Wei, Shih-Hung Yang, Yu-Chun Lo, et al. "Enhancement of Hippocampal Spatial Decoding Using a Dynamic Q-Learning Method With a Relative Reward Using Theta Phase Precession." International Journal of Neural Systems 30, no. 09 (2020): 2050048. http://dx.doi.org/10.1142/s0129065720500483.
Full textLiu, Peiyi. "Q-Learning: Applications and Convergence Rate Optimization." Highlights in Science, Engineering and Technology 63 (August 8, 2023): 210–15. http://dx.doi.org/10.54097/hset.v63i.10878.
Full textRaza, Ali, Asfand Ali, Alaptageen Qayyum, Ghulam Shabir, Zahid Hussain, and Ghulam Murtaza. "HYPERPARAMETER IMPACT ON LEARNING EFFICIENCY IN Q-LEARNING AND DQN USING OPENAI GYMNASIUM ENVIRONMENTS." International Journal of Advanced Research 13, no. 05 (2025): 1164–76. https://doi.org/10.21474/ijar01/21007.
Full text古, 彭. "Improvement and Implementation of Q-Learning Algorithm." Computer Science and Application 11, no. 07 (2021): 1994–2007. http://dx.doi.org/10.12677/csa.2021.117204.
Full textXu, Shenghua, Yang Gu, Xiaoyan Li, et al. "Indoor Emergency Path Planning Based on the Q-Learning Optimization Algorithm." ISPRS International Journal of Geo-Information 11, no. 1 (2022): 66. http://dx.doi.org/10.3390/ijgi11010066.
Full textZhang, Chunyuan, Qi Song, and Zeng Meng. "Minibatch Recursive Least Squares Q-Learning." Computational Intelligence and Neuroscience 2021 (October 8, 2021): 1–9. http://dx.doi.org/10.1155/2021/5370281.
Full textTakashi, Sato and Fumiko Shirasaki NIT Okinawa College Japan. "A Comparative Study on the Performances of Q-Learning and Neural Q-Learning Agents toward Analysis of Emergence of Communication." Journal of Information and Communication Engineering(JICE) Volume 3, Issue 5 (2020): 128–35. https://doi.org/10.5281/zenodo.4309746.
Full textWei-Kai Sun, Wei-Kai Sun, Xiao-Mei Wang Wei-Kai Sun, Bin Wang Xiao-Mei Wang, Jia-Sen Zhang Bin Wang, and Hai-Yang Du Jia-Sen Zhang. "MR-SFAMA-Q: A MAC Protocol based on Q-Learning for Underwater Acoustic Sensor Networks." 電腦學刊 35, no. 1 (2024): 051–63. http://dx.doi.org/10.53106/199115992024023501004.
Full textMd Nurul Raihen and Jason Tran. "Optimizing reinforcement learning in complex environments using neural networks." International Journal of Science and Research Archive 12, no. 2 (2024): 2047–62. http://dx.doi.org/10.30574/ijsra.2024.12.2.1471.
Full textHao, Qixuan. "The Achievement of Dynamic Obstacle Avoidance Based on Improved Q-Learning Algorithm." Highlights in Science, Engineering and Technology 63 (August 8, 2023): 252–58. http://dx.doi.org/10.54097/hset.v63i.10883.
Full textShin, YongWoo. "Q-learning to improve learning speed using Minimax algorithm." Journal of Korea Game Society 18, no. 4 (2018): 99–106. http://dx.doi.org/10.7583/jkgs.2018.18.4.99.
Full textXu, Haoran, Xianyuan Zhan, and Xiangyu Zhu. "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 8 (2022): 8753–60. http://dx.doi.org/10.1609/aaai.v36i8.20855.
Full text