Chief Delphi

Chief Delphi (http://www.chiefdelphi.com/forums/index.php)
-   Programming (http://www.chiefdelphi.com/forums/forumdisplay.php?f=51)
-   -   Q learning instead of PIDS (http://www.chiefdelphi.com/forums/showthread.php?t=135384)

faust1706 02-03-2015 21:26

Q learning instead of PIDS
 
Has any team implemented a q learning algorithm instead of a PID? Just curious. I did this in the lab I work at few times as practice with new algorithms.

Altainia 02-03-2015 21:49

Re: Q learning instead of PIDS
 
I don't know much about Q learning, but from what I can tell it's for picking discrete moves rather than a continuous range of possibilities? If that's the case, it would be a horrible replacement to PID or neural network.

faust1706 02-03-2015 22:01

Re: Q learning instead of PIDS
 
You are right that most q learning algorithms involve searching for the best sequence of finite moves, but there is plenty of work done with a continuous state and action. The best example I can think of is with robot navigation using q learning.

Even with the discrete amount of steps, it still works. (I know first hand). What essentially happens is that its initial state is always being updated (as fast as the sensor(s) can update it at least). What I did was put a delay on the input, 5hz. I gave the learning algorithm 100ms to find the best action(s), another 100ms to execute it, then repeat.


All times are GMT -5. The time now is 01:45.

Powered by vBulletin® Version 3.6.4
Copyright ©2000 - 2017, Jelsoft Enterprises Ltd.
Copyright © Chief Delphi