SARSA is an algorithm used in reinforcement learning to learn a Markov decision process policy. It updates the Q-value based on the current state, action taken, reward received, next state and next action chosen. The acronym for this quintuple is SARSA. It was proposed by Rummery and Niranjan in a technical note.
Stanford University
Autumn 2022-2023
Stanford's CS 221 course teaches foundational principles and practical implementation of AI systems. It covers machine learning, game playing, constraint satisfaction, graphical models, and logic. A rigorous course requiring solid foundational skills in programming, math, and probability.
No concepts data
+ 88 more conceptsUC Berkeley
Fall 2008
This advanced course focuses on the applications of machine learning in the robotics and control field. It covers a wide range of topics including Markov Decision Processes, control theories, estimation methodologies, and robotics principles. Recommended for graduate students.
No concepts data
+ 27 more concepts