Go to Post Dean may have asked that every high school in America be involved with FIRST, but he never said [teams] all had to have separate high schools. - Amanda Morrison [more]
Home
Go Back   Chief Delphi > Technical > Programming
CD-Media   CD-Spy  
portal register members calendar search Today's Posts Mark Forums Read FAQ rules

 
Closed Thread
Thread Tools Rating: Thread Rating: 3 votes, 5.00 average. Display Modes
  #1   Spotlight this post!  
Unread 02-03-2015, 21:26
faust1706's Avatar
faust1706 faust1706 is offline
Registered User
FRC #1706 (Ratchet Rockers)
Team Role: College Student
 
Join Date: Apr 2012
Rookie Year: 2011
Location: St Louis
Posts: 498
faust1706 is infamous around these partsfaust1706 is infamous around these parts
Q learning instead of PIDS

Has any team implemented a q learning algorithm instead of a PID? Just curious. I did this in the lab I work at few times as practice with new algorithms.
__________________
"You're a gentleman," they used to say to him. "You shouldn't have gone murdering people with a hatchet; that's no occupation for a gentleman."
  #2   Spotlight this post!  
Unread 02-03-2015, 21:49
Altainia Altainia is offline
That one geeky guy...
FRC #5098 (Sting-R)
Team Role: Mentor
 
Join Date: Jan 2008
Rookie Year: 2007
Location: Kansas City, MO
Posts: 18
Altainia will become famous soon enoughAltainia will become famous soon enough
Re: Q learning instead of PIDS

I don't know much about Q learning, but from what I can tell it's for picking discrete moves rather than a continuous range of possibilities? If that's the case, it would be a horrible replacement to PID or neural network.
  #3   Spotlight this post!  
Unread 02-03-2015, 22:01
faust1706's Avatar
faust1706 faust1706 is offline
Registered User
FRC #1706 (Ratchet Rockers)
Team Role: College Student
 
Join Date: Apr 2012
Rookie Year: 2011
Location: St Louis
Posts: 498
faust1706 is infamous around these partsfaust1706 is infamous around these parts
Re: Q learning instead of PIDS

You are right that most q learning algorithms involve searching for the best sequence of finite moves, but there is plenty of work done with a continuous state and action. The best example I can think of is with robot navigation using q learning.

Even with the discrete amount of steps, it still works. (I know first hand). What essentially happens is that its initial state is always being updated (as fast as the sensor(s) can update it at least). What I did was put a delay on the input, 5hz. I gave the learning algorithm 100ms to find the best action(s), another 100ms to execute it, then repeat.
__________________
"You're a gentleman," they used to say to him. "You shouldn't have gone murdering people with a hatchet; that's no occupation for a gentleman."
Closed Thread


Thread Tools
Display Modes Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 21:08.

The Chief Delphi Forums are sponsored by Innovation First International, Inc.


Powered by vBulletin® Version 3.6.4
Copyright ©2000 - 2017, Jelsoft Enterprises Ltd.
Copyright © Chief Delphi