IT354

Course Name: 

Reinforcement Learning (IT354)

Programme: 

B.Tech (AI)

Category: 

Programme Specific Electives (PSE)

Credits (L-T-P): 

(3-0-2) 4

Content: 

Introduction to Reinforcement Learning, Markov Processes Markov Reward Processes (MRPs) Markov Decision Processes (MDPs), MDP Policies, Policy Evaluation, Policy Improvement, Policy Iteration, Value operators, Model-free learning - Q-learning, SARSA, Scaling up: RL with function approximation, RL with function approximation, Imitation learning in large spaces, Policy search, Exploration/Exploitation, Meta-Learning, Batch Reinforcement Learning, Bandit problems and online learning, Solution methods: dynamic programming, Monte Carlo learning, Temporal difference learning, Eligibility traces, Value function approximation, Models and planning, Case studies: successful examples of RL systems, Frontiers of RL research

References: 

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition.
Reinforcement Learning: State-of-the-Art, Marco Wiering and Martijn van Otterlo, Eds
Artificial Intelligence: A Modern Approach, Stuart J. Russell and Peter Norvig.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville.

Department: 

Information Technology

Contact us

Head of the Department,
Department of Information Technology,
National Institute of Technology Karnataka,
SurathkalP. O. Srinivasnagar, Mangalore - 575 025
Ph.:    +91-824-2474056
Email:  hodit [at] nitk [dot] edu [dot] in
 

Web Admin: Sowmya Kamath S

Connect with us

We're on Social Networks. Follow us & stay in touch.