Loading

orlov_alexander 332

Activity

May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
Mon
Wed
Fri

Ratings Progression

Loading...

Challenge Categories

Loading...

Challenges Entered

Latest submissions

See All
graded 22438
graded 22217
graded 21638
Gold 1
gold-challenge-end
May 16, 2020
Silver 0
Bronze 1
bronze-challenge-end
May 16, 2020

NeurIPS 2019: Learn to Move - Walk Around

Constant baseline solution ~60 reward

7 months ago

Just guys if you are wondering - for baseline comparison
action = [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.7, 0.1, 0.1, 0.1, 0.886927] * 2
gives you about ~60 reward, more or less depends on starting position etc.

How I found it? I tried to make a symmetric vector of constant parameters and started from [0.1] * 22. Then I pulled each muscle and observed through visualisation the effect of it.
After founding that [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.7, 0.1, 0.1, 0.1, 0.8] stands a little longer than usual (e.g. 10 reward), I manually tuned 0.8 coefficient to make longest standing. I tuned 0.8 coefficient manually via binary search, that’s why it is so weird looking.

I spend like 1 hour for it, it is an easy one solution.

Different reward on local and remote environments

8 months ago

We have same issues. Our RL model could score 100+ on local, and score like 14 on remote.

Flatland Challenge

Any discord server?

10 months ago

Do you have a discord server?

orlov_alexander has not provided any information yet.