AIcrowd | leckofunny | Participants

0 Follower

0 Following

Leckofunny

Marco Pleines

Activity

Dec

Jan

Feb

Mar

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Mon

Wed

Fri

Challenge Categories

Challenges Entered

Completed

NeurIPS 2020: Procgen Competition

OpenAI

Measure sample efficiency and generalization in reinforcement learning using procedurally generated environments

Latest submissions

See All

graded

68908

Tue, 23 Jun 2020 09:54:32

Completed

Unity Obstacle Tower Challenge

Unity Technologies

A new benchmark for Artificial Intelligence (AI) research in Reinforcement Learning

Latest submissions

See All

graded	9149	Wed, 17 Jul 2019 13:42:58
graded	9146	Wed, 17 Jul 2019 13:26:17
graded	9142	Wed, 17 Jul 2019 12:18:12

Participant	Rating

Participant	Rating

Leckofunny has not joined any teams yet...

NeurIPS 2020: Procgen Competition

Problems of using rllib for a research competition

Over 5 years ago

I decided to not participate in this competition as well, due to the aforementioned constraints. I’ll continue using Procgen though using my established workflow and code.

Selecting seeds during training

Over 5 years ago

My auto curriculum algorithm just alters the way seeds are sampled to provide much more useful data to the agent and hence improve sample efficiency. Having more seeds than 100 or 200 doesn’t even help in my opinion.

Selecting seeds during training

Over 5 years ago

I guess my assumption is correct since nobody negates it.
It is a pity that Curriculum Learning cannot be done during this challenge.

Selecting seeds during training

Over 5 years ago

@mohanty
I’d like to explicitly set a distinct seed for each worker during training, because I’ve got a concept for sampling seeds.
The implementation would probably look similar to this:
https://docs.ray.io/en/master/rllib-training.html#curriculum-learning

As far as I know, the Procgen environment has to be closed and instantiated again to apply a distinct seed (num_leves = 1, start_level = my_desired_seed), because I cannot enforce a new seed during the reset() call.

So I assume that 200 seeds will be sampled uniformly and it will not be possible to inject my logic to alter the sampling strategy of the 200 seeds.

Selecting seeds during training

Over 5 years ago

Any info about this @mohanty ?

Selecting seeds during training

Over 5 years ago

Hi!

How are you enforcing the usage of 200 training seeds once submitted?
I’m planning on a submission that has some logics to sample certain seeds for each environment.
And as far as Procgen is implemented, I’d have to close and instantiate again the environment to apply the designated seed.

FAQ: Regarding rllib based approach for submissions

Over 5 years ago

Is there any kind of interface that could be used to dynamically tell each environment instance which seed to use? I’ve got some curriculum concepts to sample seeds during training.

From first sight, I think that this is way too cumbersome using RLlib.

Multi-Task Challenge?

Over 5 years ago

According to this image, each environment is being trained and evaluated solely.
After all, the agent gets to train on the unknown environments as well, right?

And does this image mean that the training and the evaluation are done on your side?

Multi-Task Challenge?

Over 5 years ago

Hi!

I’m wondering whether this competition challenges us with a multi-task setting.

To my understanding, one agent shall train on 16 environments So this agent/model should be able to play each environment and the 4 unseen ones, right?

Unity Obstacle Tower Challenge

Good testing environment that does not need X?

Over 6 years ago

Unfortunately I did not find one yet.

Release of the evaluation seeds?

Over 6 years ago

1001, 1002, 1003, 1004, 1005 are the evaluation seeds.
The environment’s source is finally available.

Submissions are stuck

Over 6 years ago

Thanks, the fix works!

Submissions are stuck

Over 6 years ago

@mohanty

Are you going to fix the bug of the show post-challenge submission button?
Pressing this button does not change the leaderboard.

Good testing environment that does not need X?

Over 6 years ago

Hey,

do you guys know of an environment, which would be suitable for testing DRL features?

Obstacle Tower takes too much time as well as the dependency of using an X server is daunting.
I’m working on two clusters, one in Jülich and one in Dortmund and neither of them has a suitable strategy for making X available. X needs root privileges to be started and that’s basically their major issue.

If X was not an issue, I would build myself a Unity environment.

Does anybody know if Unreal Engine is dependant on X as well?

Release of the evaluation seeds?

Over 6 years ago

Hey @arthurj
when could we get an OT build with the evaluation seeds?
I guess we all would love to see what our agents are capable of doing.

Thanks for the great challenge!

Deep Reinforcement Learning PhD

Notebooks

Create Notebook

Filters

Private

Notebooks

Create Notebook

Filters

Private

Organization

Location

Badges

Connect

Activity

Challenge Categories

Challenges Entered

NeurIPS 2020: Procgen Competition

Latest submissions

Unity Obstacle Tower Challenge

Latest submissions

NeurIPS 2020: Procgen Competition

Problems of using rllib for a research competition

Selecting seeds during training

Selecting seeds during training

Selecting seeds during training

Selecting seeds during training

Selecting seeds during training

FAQ: Regarding rllib based approach for submissions

Multi-Task Challenge?

Multi-Task Challenge?

Unity Obstacle Tower Challenge

Good testing environment that does not need X?

Release of the evaluation seeds?

Submissions are stuck

Submissions are stuck

Good testing environment that does not need X?

Release of the evaluation seeds?

Submissions are stuck

Submissions are stuck

Is the evaluation seed truly random?

Evaluation perspective config?

Evaluation perspective config?

Notebooks

Notebooks