Journal articles on the topic 'Reward functions'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Reward functions.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Meder, Björn, and Jonathan D. Nelson. "Information search with situation-specific reward functions." Judgment and Decision Making 7, no. 2 (2012): 119–48. http://dx.doi.org/10.1017/s1930297500002977.
Full textSoltani, A. Reza. "Reward processes with nonlinear reward functions." Journal of Applied Probability 33, no. 4 (1996): 1011–17. http://dx.doi.org/10.2307/3214982.
Full textSoltani, A. Reza. "Reward processes with nonlinear reward functions." Journal of Applied Probability 33, no. 04 (1996): 1011–17. http://dx.doi.org/10.1017/s0021900200100440.
Full textXu, Zhe, Ivan Gavran, Yousef Ahmad, et al. "Joint Inference of Reward Machines and Policies for Reinforcement Learning." Proceedings of the International Conference on Automated Planning and Scheduling 30 (June 1, 2020): 590–98. http://dx.doi.org/10.1609/icaps.v30i1.6756.
Full textTUMER, KAGAN, and ADRIAN AGOGINO. "MULTIAGENT LEARNING FOR BLACK BOX SYSTEM REWARD FUNCTIONS." Advances in Complex Systems 12, no. 04n05 (2009): 475–92. http://dx.doi.org/10.1142/s0219525909002295.
Full textCorazza, Jan, Ivan Gavran, and Daniel Neider. "Reinforcement Learning with Stochastic Reward Machines." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 6 (2022): 6429–36. http://dx.doi.org/10.1609/aaai.v36i6.20594.
Full textPastor-Bernier, Alexandre, Arkadiusz Stasiak, and Wolfram Schultz. "Reward-specific satiety affects subjective value signals in orbitofrontal cortex during multicomponent economic choice." Proceedings of the National Academy of Sciences 118, no. 30 (2021): e2022650118. http://dx.doi.org/10.1073/pnas.2022650118.
Full textBooth, Serena, W. Bradley Knox, Julie Shah, Scott Niekum, Peter Stone, and Alessandro Allievi. "The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 5 (2023): 5920–29. http://dx.doi.org/10.1609/aaai.v37i5.25733.
Full textMguni, David, Taher Jafferjee, Jianhong Wang, et al. "Learning to Shape Rewards Using a Game of Two Partners." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 10 (2023): 11604–12. http://dx.doi.org/10.1609/aaai.v37i10.26371.
Full textToro Icarte, Rodrigo, Toryn Q. Klassen, Richard Valenzano, and Sheila A. McIlraith. "Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning." Journal of Artificial Intelligence Research 73 (January 11, 2022): 173–208. http://dx.doi.org/10.1613/jair.1.12440.
Full textHuo, Liangyu, Zulin Wang, and Mai Xu. "Learning Noise-Induced Reward Functions for Surpassing Demonstrations in Imitation Learning." Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 7 (2023): 7953–61. http://dx.doi.org/10.1609/aaai.v37i7.25962.
Full textForbes, Grant C., and David L. Roberts. "Potential-Based Reward Shaping for Intrinsic Motivation (Student Abstract)." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 21 (2024): 23488–89. http://dx.doi.org/10.1609/aaai.v38i21.30441.
Full textZhou, Zhiheng. "A Meta-Analysis of Reward Function and Childhood Obesity." Lecture Notes in Education Psychology and Public Media 2, no. 1 (2023): 1015–20. http://dx.doi.org/10.54254/2753-7048/2/2022640.
Full textMizuhiki, Takashi, Barry J. Richmond, and Munetaka Shidara. "Encoding of reward expectation by monkey anterior insular neurons." Journal of Neurophysiology 107, no. 11 (2012): 2996–3007. http://dx.doi.org/10.1152/jn.00282.2011.
Full textNordhaug, Odd. "Reward Functions of Personnel Training." Human Relations 42, no. 5 (1989): 373–88. http://dx.doi.org/10.1177/001872678904200501.
Full textLiu, Zhixiang, Rui Lin, and Minmin Luo. "Reward Contributions to Serotonergic Functions." Annual Review of Neuroscience 43, no. 1 (2020): 141–62. http://dx.doi.org/10.1146/annurev-neuro-093019-112252.
Full textDharmavaram, Akshay, Matthew Riemer, and Shalabh Bhatnagar. "Hierarchical Average Reward Policy Gradient Algorithms (Student Abstract)." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 10 (2020): 13777–78. http://dx.doi.org/10.1609/aaai.v34i10.7160.
Full textRamakrishnan, Arjun, Yoon Woo Byun, Kyle Rand, Christian E. Pedersen, Mikhail A. Lebedev, and Miguel A. L. Nicolelis. "Cortical neurons multiplex reward-related signals along with sensory and motor information." Proceedings of the National Academy of Sciences 114, no. 24 (2017): E4841—E4850. http://dx.doi.org/10.1073/pnas.1703668114.
Full textWeng, Paul, and Olivier Spanjaard. "Functional Reward Markov Decision Processes: Theory and Applications." International Journal on Artificial Intelligence Tools 26, no. 03 (2017): 1760014. http://dx.doi.org/10.1142/s0218213017600144.
Full textProper, Scott, and Kagan Tumer. "Multiagent Learning with a Noisy Global Reward Signal." Proceedings of the AAAI Conference on Artificial Intelligence 27, no. 1 (2013): 826–32. http://dx.doi.org/10.1609/aaai.v27i1.8580.
Full textOstaszewski, Pawel, and Katarzyna Karzel. "Discounting of Delayed and Probabilistic Losses of Different Amounts." European Psychologist 7, no. 4 (2002): 295–301. http://dx.doi.org/10.1027//1016-9040.7.4.295.
Full textLinke, Cam, Nadia M. Ady, Martha White, Thomas Degris, and Adam White. "Adapting Behavior via Intrinsic Reward: A Survey and Empirical Study." Journal of Artificial Intelligence Research 69 (December 14, 2020): 1287–332. http://dx.doi.org/10.1613/jair.1.12087.
Full textNiekum, Scott. "Evolved Intrinsic Reward Functions for Reinforcement Learning." Proceedings of the AAAI Conference on Artificial Intelligence 24, no. 1 (2010): 1955–56. http://dx.doi.org/10.1609/aaai.v24i1.7772.
Full textStefanov, Valeri T. "Exact distributions for reward functions on semi-Markov and Markov additive processes." Journal of Applied Probability 43, no. 4 (2006): 1053–65. http://dx.doi.org/10.1239/jap/1165505207.
Full textStefanov, Valeri T. "Exact distributions for reward functions on semi-Markov and Markov additive processes." Journal of Applied Probability 43, no. 04 (2006): 1053–65. http://dx.doi.org/10.1017/s0021900200002424.
Full textWang, Min, Xin Li, Leiji Zhang, and Mingzhong Wang. "MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 14 (2024): 15555–62. http://dx.doi.org/10.1609/aaai.v38i14.29482.
Full textSchultz, Wolfram. "Predictive Reward Signal of Dopamine Neurons." Journal of Neurophysiology 80, no. 1 (1998): 1–27. http://dx.doi.org/10.1152/jn.1998.80.1.1.
Full textMaulana, Mohammad Deni Irkhamil, and Langgeng Budianto. "THE STUDENT'S PERCEPTION OF REWARDS TO INCREASE THEIR MOTIVATION IN ENGLISH LEARNING IN JUNIOR HIGH SCHOOL." English Edu: Journal of English Teaching and Learning 1, no. 1 (2022): 18–25. http://dx.doi.org/10.18860/jetl.v1i1.1623.
Full textVelasquez, Alvaro, Brett Bissey, Lior Barak, et al. "Dynamic Automaton-Guided Reward Shaping for Monte Carlo Tree Search." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 13 (2021): 12015–23. http://dx.doi.org/10.1609/aaai.v35i13.17427.
Full textLouie, Kenway. "Asymmetric and adaptive reward coding via normalized reinforcement learning." PLOS Computational Biology 18, no. 7 (2022): e1010350. http://dx.doi.org/10.1371/journal.pcbi.1010350.
Full textLamberton, Damien. "Optimal stopping with irregular reward functions." Stochastic Processes and their Applications 119, no. 10 (2009): 3253–84. http://dx.doi.org/10.1016/j.spa.2009.05.005.
Full textSchultz, Wolfram. "Reward functions of the basal ganglia." Journal of Neural Transmission 123, no. 7 (2016): 679–93. http://dx.doi.org/10.1007/s00702-016-1510-0.
Full textKnox, W. Bradley, Alessandro Allievi, Holger Banzhaf, Felix Schmitt, and Peter Stone. "Reward (Mis)design for Autonomous Driving (Abstract Reprint)." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 20 (2024): 22702. http://dx.doi.org/10.1609/aaai.v38i20.30602.
Full textSchultz, Wolfram. "Neuronal Reward and Decision Signals: From Theories to Data." Physiological Reviews 95, no. 3 (2015): 853–951. http://dx.doi.org/10.1152/physrev.00023.2014.
Full textGehring, Clement, Masataro Asai, Rohan Chitnis, et al. "Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators." Proceedings of the International Conference on Automated Planning and Scheduling 32 (June 13, 2022): 588–96. http://dx.doi.org/10.1609/icaps.v32i1.19846.
Full textRajendran, Janarthanan, Richard Lewis, Vivek Veeriah, Honglak Lee, and Satinder Singh. "How Should an Agent Practice?" Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (2020): 5454–61. http://dx.doi.org/10.1609/aaai.v34i04.5995.
Full textYang, Luting, Jianyi Yang, and Shaolei Ren. "Contextual Bandits with Delayed Feedback and Semi-supervised Learning (Student Abstract)." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 18 (2021): 15943–44. http://dx.doi.org/10.1609/aaai.v35i18.17968.
Full textWang, Xusheng, Jiexin Xie, Shijie Guo, Yue Li, Pengfei Sun, and Zhongxue Gan. "Deep reinforcement learning-based rehabilitation robot trajectory planning with optimized reward functions." Advances in Mechanical Engineering 13, no. 12 (2021): 168781402110670. http://dx.doi.org/10.1177/16878140211067011.
Full textVILASECA, JORDI, ANTONI MESEGUER, JOAN TORRENT, and RAQUEL FERRERAS. "REWARD FUNCTIONS AND COOPERATIVE GAMES: CHARACTERIZATION AND ECONOMIC APPLICATION." International Game Theory Review 10, no. 02 (2008): 165–76. http://dx.doi.org/10.1142/s0219198908001856.
Full textDragoni Divrak, Dora. "Reward actualities." Journal of Historical Archaeology & Anthropological Sciences 6, no. 2 (2021): 62–64. http://dx.doi.org/10.15406/jhaas.2021.06.00247.
Full textMa, Shuai, and Jia Yuan Yu. "State-Augmentation Transformations for Risk-Sensitive Reinforcement Learning." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 4512–19. http://dx.doi.org/10.1609/aaai.v33i01.33014512.
Full textFard, Neshat Elhami, and Rastko Selmic. "Consensus of Multi-agent Reinforcement Learning Systems: The Effect of Immediate Rewards." Journal of Robotics and Control (JRC) 3, no. 2 (2022): 115–27. http://dx.doi.org/10.18196/jrc.v3i2.13082.
Full textZuo, Guoyu, Qishen Zhao, Jiahao Lu, and Jiangeng Li. "Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards." International Journal of Advanced Robotic Systems 17, no. 1 (2020): 172988141989834. http://dx.doi.org/10.1177/1729881419898342.
Full textUchibe, Eiji, and Kenji Doya. "Hierarchical Reinforcement Learning for Multiple Reward Functions." Journal of the Robotics Society of Japan 22, no. 1 (2004): 120–29. http://dx.doi.org/10.7210/jrsj.22.120.
Full textGrundel, Soesja, Peter Borm, and Herbert Hamers. "Resource allocation problems with concave reward functions." TOP 27, no. 1 (2018): 37–54. http://dx.doi.org/10.1007/s11750-018-0482-7.
Full textLuo, Xudong, Yufeng Yang, and Ho-fung Leung. "Reward and Penalty Functions in Automated Negotiation." International Journal of Intelligent Systems 31, no. 7 (2015): 637–72. http://dx.doi.org/10.1002/int.21797.
Full textKim, MyeongSeop, Jung-Su Kim, and Jae-Han Park. "Automated Hyperparameter Tuning in Reinforcement Learning for Quadrupedal Robot Locomotion." Electronics 13, no. 1 (2023): 116. http://dx.doi.org/10.3390/electronics13010116.
Full textHahn, Ernst Moritz, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, and Dominik Wojtczak. "Omega-Regular Decision Processes." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 19 (2024): 21125–33. http://dx.doi.org/10.1609/aaai.v38i19.30105.
Full textRolls, Edmund T. "Précis of The brain and emotion." Behavioral and Brain Sciences 23, no. 2 (2000): 177–91. http://dx.doi.org/10.1017/s0140525x00002429.
Full textChen, Yang, Xiao Lin, Bo Yan, et al. "Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 10 (2024): 11407–15. http://dx.doi.org/10.1609/aaai.v38i10.29021.
Full text