Loading
Feedback

orlov_alexander 232

Location

DE

Badges

1
0
2

Activity

Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Mon
Wed
Fri

Ratings Progression

Loading...

Challenge Categories

Loading...

Challenges Entered

Latest submissions

See All
graded 22438
graded 22217
graded 21638
Gold 1
EulerLearner
May 16, 2020
Silver 0
Bronze 2
Trustable
May 16, 2020
Newtonian
May 16, 2020

Badges

Participant Rating
Participant Rating

NeurIPS 2019: Learn to Move - Walk Around

Constant baseline solution ~60 reward

11 months ago

Just guys if you are wondering - for baseline comparison
action = [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.7, 0.1, 0.1, 0.1, 0.886927] * 2
gives you about ~60 reward, more or less depends on starting position etc.

How I found it? I tried to make a symmetric vector of constant parameters and started from [0.1] * 22. Then I pulled each muscle and observed through visualisation the effect of it.
After founding that [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.7, 0.1, 0.1, 0.1, 0.8] stands a little longer than usual (e.g. 10 reward), I manually tuned 0.8 coefficient to make longest standing. I tuned 0.8 coefficient manually via binary search, thatโ€™s why it is so weird looking.

I spend like 1 hour for it, it is an easy one solution.

Different reward on local and remote environments

12 months ago

We have same issues. Our RL model could score 100+ on local, and score like 14 on remote.

Flatland Challenge

Any discord server?

About 1 year ago

Do you have a discord server?

orlov_alexander has not provided any information yet.