AIcrowd | brandonhoughton

0 Follower

0 Following

BrandonHoughton

Brandon Houghton

Activity

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

Apr

May

Jun

Mon

Wed

Fri

Challenge Categories

Challenges Entered

Completed

NeurIPS 2022: MineRL BASALT Competition

MineRL Labs

Learning From Human-Feedback

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2021: MineRL Diamond Competition

MineRL Labs - Carnegie Mellon University

Training sample-efficient agents in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2021: MineRL BASALT Competition

C.H.A.I. - UC Berkeley

Sample Efficient Reinforcement Learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2020: Procgen Competition

OpenAI

Measure sample efficiency and generalization in reinforcement learning using procedurally generated environments

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2020: MineRL Competition

MineRL Labs - Carnegie Mellon University

Sample-efficient reinforcement learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2019 : MineRL Competition

MineRL Labs - Carnegie Mellon University

Sample-efficient reinforcement learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

Flatland Challenge

SBB

Multi Agent Reinforcement Learning on Trains.

Latest submissions

No submissions made in this challenge.

Completed

ISWC 2019 Column-Type Annotation (CTA) Challenge

SemTab: Semantic Web Challenge on Tabular Data to Knowledge Graph Matching

Latest submissions

No submissions made in this challenge.

Participant	Rating

Participant	Rating

BrandonHoughton has not joined any teams yet...

NeurIPS 2020: MineRL Competition

Am I allowed to use the openai gym environment without joining the competition?

Over 5 years ago

The MineRL package is open-source and we certainly welcome anyone to experiment with it!

Division of datasets based on reward

Over 5 years ago

Hi @michael-tanaka,

Thanks for your question! This year we decided that you CAN use reward when learning the action distribution from human demonstrations. E.g. it is permitted to learn the joint distribution between reward and human actions and condition on this distribution when sampling.

What you are describing, however, sounds like a hard-coded meta-controller, as the policy is dictated by hand-encoding the reward thresholds.

One option to mitigate this would simply be to learn a meta-controller that only observes reward, and decides against a fixed number of policies. You could then weight demonstrations by their reward to have a uniform sampling distribution.

The difference between ObtainDiamondVectorObf and ObtainDiamondDenseVectorObf

Almost 6 years ago

That’s an error with the documentation - I thought we had fixed that but we must have missed a section, sorry!

NeurIPS 2019 : MineRL Competition

Cannot download the dataset

Over 6 years ago

Should be working now!

About the rule on pre-trained model

Over 6 years ago

Miffyli is correct here - even pre-training using a small number of learned weights is not allowed.

We will be investigating code as well to validate submissions, the large file restriction simply provides an easy way to enforce the pre-training rule generally

When will the results of round 1 be announced?

Over 6 years ago

Teams should be notified! Congratulations to the top teams!

Can't train in MineRLObtainIronPickaxeDense-v0 since 0.2.7

Over 6 years ago

Fixed! Install minerl 0.2.8

Can't train in MineRLObtainIronPickaxeDense-v0 since 0.2.7

Over 6 years ago

Great catch - until we can update the PyPI repo, using the MineRLObtainDiamondDense-v0 environment should be a close replacement especially if you limit the number of steps!

Can't train in MineRLObtainIronPickaxeDense-v0 since 0.2.7

Over 6 years ago

Sorry I will take a look now - I thought this was covered by our unit tests!

Announcement - Round 1 Scores

Over 6 years ago

To clarify - we now have moved to Round 1.5! The scores of Round 1 will be for archival purposes only.

Announcement - Round 1 Scores

Over 6 years ago

Announcement - Round 1 Scores

We have reviewed multiple submissions that obtain rewards that should not be achievable in the MineRLObtainDiamond-v0 environment.

As this is due to an easily exploitable reward loop present in outdated minerl versions (prior to minerl 0.2.5,) we have decided to add 5 additional submissions to each team. The new maximum number of submissions is now 25.

Please verify submissions locally to ensure your current scoreboard results. Top submissions submitted using out-dated minerl versions (prior to minerl 0.2.5) will be re-run to verify their performance.

Additionally, participants should retrain their models to account for the reward loop removal.

On AIcrowd, the minerl version can be checked by looking for the minerl==<version> line in the requirements.txt file.

Locally, the python package can be updated with python -m pip install --upgrade minerl command.

Internal Reward Dependent on Expert Data and State

Over 6 years ago

As long as the internal reward is learned from the data, this is allowed. This is not allowed if it is directly a function of the state and external data.

Did the new dataset release already?

Over 6 years ago

Unfortunately we are unable to release additional data at this time.
We will make an announcement if more data will be available for round 2.

Repeated reward for logs and furnace

Over 6 years ago

Just to follow up here - this was indeed an issue and the fix is being bundled in minerl 0.2.5!

Equip item failed

Over 6 years ago

This was an issue with the obtainDiamond.xml - we have resolved it in the most recent release being deployed today or tomorrow!

Using open-sourced networks

Over 6 years ago

Unfortunately, ImageNet pre-training is not allowed this year!

Re-training models is a key part of round 2 and if pre-trained weights are used there is no way to tell how those pre-trained weights were generated. Additionally, if pre-training happened during evaluation, it would be possible for competitors to upload large amounts of data which could be used to load other pre-trained weights.

In future iterations of the competition, if pre-training on ImageNet is a common ask, we could consider including certain datasets in the provided docker container; however, note that the texture pack of Minecraft will change in round 2 so techniques that work well transferring from the natural images to Minecraft may not work well in round 2!

How to use furnace to "nearbySmelt" coal?

Over 6 years ago

They should have the same item-ID so this should not be an issue but I will verify this when checking it out!

Agent Behavior for conflicting actions?

Over 6 years ago

The behavior is defined as occurs in vanilla Minecraft (where possible). For movement, nothing happens when asking for conflicting actions. For place and attack, both actions will be processed as the place handler is through Malmo and the attack action is handled by default Mincraft.

How to use furnace to "nearbySmelt" coal?

Over 6 years ago

Thanks for this, I will take a look. Could be a weird interaction between the Minecraft give commands and the Malmo agent, I will explore building a world with the needed resources and see if this is still the case

Repeated reward for logs and furnace

Over 6 years ago

Try using the minecraft id for items instead of a number, e.g. equip='wooden_pickaxe'

Organizer for MineRL Competition on Sample Efficient Reinforcement Learning

Notebooks

Create Notebook

Filters

Private

Notebooks

Create Notebook

Filters

Private