AIcrowd | orlov_alexander

0 Follower

0 Following

orlov_alexander

Activity

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

Apr

May

Mon

Wed

Fri

Challenge Categories

Challenges Entered

Completed

NeurIPS 2019: Learn to Move - Walk Around

Stanford Neuromuscular Biomechanics Laboratory

Reinforcement Learning on Musculoskeletal Models

Latest submissions

See All

graded	22438	Sat, 26 Oct 2019 15:36:31
graded	22217	Sat, 26 Oct 2019 08:26:41
graded	21638	Thu, 24 Oct 2019 17:11:39

Completed

NeurIPS 2019 : Disentanglement Challenge

Max Planck Institute for Intelligent Systems

Disentanglement: from simulation to real-world

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2019 : MineRL Competition

MineRL Labs - Carnegie Mellon University

Sample-efficient reinforcement learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

Flatland Challenge

SBB

Multi Agent Reinforcement Learning on Trains.

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2019 - Robot open-Ended Autonomous Learning

GOAL-Robots

Robots that learn to interact with the environment autonomously

Latest submissions

No submissions made in this challenge.

Participant	Rating

Participant	Rating

SimBodyWithDummyPlug NeurIPS 2019: Learn to Move - Walk Around
View

NeurIPS 2019: Learn to Move - Walk Around

Constant baseline solution ~60 reward

Over 6 years ago

Just guys if you are wondering - for baseline comparison
action = [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.7, 0.1, 0.1, 0.1, 0.886927] * 2
gives you about ~60 reward, more or less depends on starting position etc.

How I found it? I tried to make a symmetric vector of constant parameters and started from [0.1] * 22. Then I pulled each muscle and observed through visualisation the effect of it.
After founding that [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.7, 0.1, 0.1, 0.1, 0.8] stands a little longer than usual (e.g. 10 reward), I manually tuned 0.8 coefficient to make longest standing. I tuned 0.8 coefficient manually via binary search, that’s why it is so weird looking.

I spend like 1 hour for it, it is an easy one solution.