AIcrowd | jyotish | Participants

13 Follower

4 Following

jyotish

Jyotish

Activity

Dec

Jan

Feb

Mar

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Mon

Wed

Fri

Challenge Categories

Challenges Entered

20 hours left · Ending 31 Dec 23:55 UTC

Global Chess Challenge 2025

AIcrowd

AGI House

Train LLMs to Play Chess

Latest submissions

See All

failed	306073	Fri, 26 Dec 2025 07:46:03
graded	306050	Thu, 25 Dec 2025 22:25:06
failed	306046	Thu, 25 Dec 2025 22:11:24

32 days left

Orak Game Agent Challenge 2025

Krafton AI

Build an LLM agent for five real-world games

Latest submissions

See All

graded	306148	Fri, 26 Dec 2025 22:26:58
graded	304593	Sat, 13 Dec 2025 13:00:55
failed	304578	Sat, 13 Dec 2025 10:56:40

Completed

Flextrack Challenge 2025

University of Wollongong

Detecting Energy Flexibility in Buildings

Latest submissions

See All

graded

292399

Fri, 22 Aug 2025 14:43:13

Completed

Commonsense Persona-Grounded Dialogue Challenge 2025

Sony Group Corporation

Create Context-Aware, Dynamic, and Immersive In-Game Dialogue

Latest submissions

See All

failed	285403	Sun, 18 May 2025 19:27:43
failed	285312	Sat, 17 May 2025 18:11:27
failed	283221	Thu, 1 May 2025 16:59:55

Completed

Meta CRAG - MM Challenge 2025

Meta

Improve RAG with Real-World Benchmarks | KDD Cup 2025

Latest submissions

See All

failed	292367	Thu, 21 Aug 2025 16:45:21
failed	292325	Thu, 14 Aug 2025 13:11:44
graded	286048	Tue, 27 May 2025 18:51:52

Completed

Food Recognition Benchmark 2022

Seerave Foundation

A benchmark for image-based food recognition

Latest submissions

See All

failed	172430	Thu, 27 Jan 2022 12:59:55
graded	172229	Tue, 18 Jan 2022 08:11:07
failed	172228	Tue, 18 Jan 2022 06:58:34

Completed

NeurIPS 2022: CityLearn Challenge

AIcrowd

Intelligent Environments Lab

Using AI For Building’s Energy Management

Latest submissions

See All

failed	193327	Mon, 11 Jul 2022 06:59:55
failed	193315	Mon, 11 Jul 2022 06:02:30
failed	193310	Mon, 11 Jul 2022 05:26:04

Completed

Data Purchasing Challenge 2022

Leibniz Centre for European Economic Research

What data should you label to get the most value for your money?

Latest submissions

See All

failed	178246	Fri, 1 Apr 2022 09:59:15
failed	177490	Fri, 25 Mar 2022 17:44:08
failed	177425	Thu, 24 Mar 2022 13:54:22

Completed

ESCI Challenge for Improving Product Search

Amazon Search

Amazon KDD Cup 2022

Latest submissions

See All

graded	192426	Tue, 5 Jul 2022 16:38:35
failed	192410	Tue, 5 Jul 2022 14:43:38
submitted	192407	Tue, 5 Jul 2022 14:20:11

Completed

Multi Agent Behavior Challenge 2022

AIcrowd

MABe Team

Behavioral Representation Learning from Animal Poses.

Latest submissions

No submissions made in this challenge.

Completed

Airborne Object Tracking Challenge

Amazon Prime Air

Airborne Object Tracking Challenge

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2021 - The NetHack Challenge

AIcrowd

ASCII-rendered single-player dungeon crawl game

Latest submissions

See All

graded	155140	Wed, 8 Sep 2021 07:24:45
graded	147319	Sat, 19 Jun 2021 02:50:34

Completed

IJCAI 2022 - The Neural MMO Challenge

Parametrix.ai

MIT

THU_SIGS

AIcrowd

Latest submissions

No submissions made in this challenge.

Completed

ADDI Alzheimers Detection Challenge

ADDI

Machine Learning for detection of early onset of Alzheimers

Latest submissions

No submissions made in this challenge.

Completed

AI Blitz XIII

AIcrowd

5 Puzzles 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2021: MineRL BASALT Competition

C.H.A.I. - UC Berkeley

Sample Efficient Reinforcement Learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

Learn-to-Race: Autonomous Racing Virtual Challenge

Carnegie Mellon University

Arrival

The first, open autonomous racing challenge.

Latest submissions

See All

graded	176785	Tue, 15 Mar 2022 07:56:02
graded	176487	Wed, 9 Mar 2022 09:01:40
graded	176466	Tue, 8 Mar 2022 19:54:31

Completed

NeurIPS 2020: Procgen Competition

OpenAI

Measure sample efficiency and generalization in reinforcement learning using procedurally generated environments

Latest submissions

See All

submitted	90059	Wed, 21 Oct 2020 06:28:38
graded	83575	Tue, 22 Sep 2020 13:34:25
failed	81249	Sun, 6 Sep 2020 17:59:17

Completed

AI Blitz XII

AIcrowd

5 Puzzles 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2021 AWS DeepRacer AI Driving Olympics Challenge

AIcrowd

Self-driving RL on DeepRacer cars - From simulation to real world

Latest submissions

No submissions made in this challenge.

Completed

The Neural-MMO Challenge

MIT

Robustness and teamwork in a massively multiagent environment

Latest submissions

No submissions made in this challenge.

Completed

Seismic Facies Identification Challenge

SEAM AI

3D Seismic Image Interpretation by Machine Learning

Latest submissions

See All

failed

99353

Wed, 18 Nov 2020 14:51:01

Completed

AI Blitz XI

AIcrowd

5 Puzzles 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

Completed

Music Demixing Challenge ISMIR 2021

Sony Group Corporation

Latest submissions

No submissions made in this challenge.

Completed

Insurance pricing game

Imperial CPG

Play in a realistic insurance market, compete for profit!

Latest submissions

See All

graded	125874	Thu, 11 Mar 2021 21:05:16
graded	121934	Tue, 16 Feb 2021 20:35:47
failed	116909	Sun, 24 Jan 2021 00:28:24

Completed

AI Blitz #9

AIcrowd

5 Puzzles 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

Completed

Flatland

SBB

Deutsche Bahn

SNCF

Multi-Agent Reinforcement Learning on Trains

Latest submissions

No submissions made in this challenge.

27333 days left

Spotify Million Playlist Dataset Challenge

Spotify

A dataset and open-ended challenge for music recommendation research

Latest submissions

See All

failed

303444

Sun, 2 Nov 2025 09:28:12

Completed

Food Recognition Challenge

Seerave Foundation

A benchmark for image-based food recognition

Latest submissions

See All

graded	114994	Fri, 15 Jan 2021 18:34:25
graded	114972	Fri, 15 Jan 2021 17:20:13
failed	114971	Fri, 15 Jan 2021 17:16:09

Completed

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2020: MineRL Competition

MineRL Labs - Carnegie Mellon University

Sample-efficient reinforcement learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

Multi-Agent Behavior: Representation, Modeling, Measurement, and Applications

AIcrowd

MABe Team

Latest submissions

See All

failed	124981	Fri, 5 Mar 2021 18:00:04
failed	124727	Thu, 4 Mar 2021 17:17:28
failed	124726	Thu, 4 Mar 2021 17:10:38

Completed

AI Blitz #8

AIcrowd

5 Puzzles, 3 Weeks. Can you solve them all? 😉

Latest submissions

No submissions made in this challenge.

Completed

Dr. Derks Mutant Battlegrounds

AIcrowd

Dr Derk

Multi-agent RL in game environment. Train your Derklings, creatures with a neural network brain, to fight for you!

Latest submissions

No submissions made in this challenge.

Completed

Learning to Smell

Firmenich

Predicting smell of molecular compounds

Latest submissions

No submissions made in this challenge.

Completed

SnakeCLEF2021 - Snake Species Identification Challenge

Institute of Global Health

LifeCLEF

Classify images of snake species from around the world

Latest submissions

No submissions made in this challenge.

Completed

CYD Campus Aircraft Localization Competition

OpenSky Network

Cyber-Defence Campus, armasuisse

Find all the aircraft!

Latest submissions

No submissions made in this challenge.

Completed

AI Blitz #6

AIcrowd

5 Problems 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

Completed

AI Blitz #7

AIcrowd

5 Puzzles 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

Completed

AI Blitz 5 ⚡

AIcrowd

5 Puzzles, 3 Weeks | Can you solve them all?

Latest submissions

No submissions made in this challenge.

Completed

AI Blitz #4

AIcrowd

5 PROBLEMS 3 WEEKS. CAN YOU SOLVE THEM ALL?

Latest submissions

No submissions made in this challenge.

Completed

Hockey Team Classification

HAC Software Inc

Grouping/Sorting players into their respective teams

Latest submissions

No submissions made in this challenge.

Practice

Latest submissions

No submissions made in this challenge.

Completed

AIcrowd Blitz⚡#2

AIcrowd

5 Problems 15 Days. Can you solve it all?

Latest submissions

See All

failed	71051	Mon, 13 Jul 2020 13:08:04
failed	71041	Mon, 13 Jul 2020 12:44:31

Completed

NeurIPS 2019 : MineRL Competition

MineRL Labs - Carnegie Mellon University

Sample-efficient reinforcement learning in Minecraft

Latest submissions

No submissions made in this challenge.

Completed

Flatland Challenge

SBB

Multi Agent Reinforcement Learning on Trains.

Latest submissions

No submissions made in this challenge.

Practice

Latest submissions

See All

graded	191633	Thu, 30 Jun 2022 16:00:40
submitted	191628	Thu, 30 Jun 2022 15:49:33
submitted	191622	Thu, 30 Jun 2022 15:43:23

Practice

CRDSM

AIcrowd

KAIR

Crowdsourced Map Land Cover Prediction

Latest submissions

See All

graded	60315	Fri, 3 Apr 2020 17:40:13
graded	60314	Fri, 3 Apr 2020 17:38:46

1688 days left

Trajnet++ (A Trajectory Forecasting Challenge)

VITA (EPFL)

Latest submissions

No submissions made in this challenge.

Completed

AIcrowd Blitz - May 2020

AIcrowd

5 Problems 15 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

48 days left

EPFL ML Road Segmentation

EPFL ML

Project 2: Road extraction from satellite images

Latest submissions

No submissions made in this challenge.

48 days left

EPFL ML Text Classification

EPFL ML

Project 2: build our own text classifier system, and test its performance.

Latest submissions

No submissions made in this challenge.

Completed

Spotify Sequential Skip Prediction Challenge

Spotify

Predict if users will skip or listen to the music they're streamed

Latest submissions

No submissions made in this challenge.

Completed

ImageCLEF 2018 Caption - Concept Detection

ImageCLEF

Identifying relevant concepts in a large corpus of medical images

Latest submissions

No submissions made in this challenge.

Completed

ImageCLEF 2019 Tuberculosis - CT report

ImageCLEF

Latest submissions

No submissions made in this challenge.

Completed

AI for Good - AI Blitz #3

AIcrowd

AI for Good - ITU

5 PROBLEMS 3 WEEKS. CAN YOU SOLVE THEM ALL?

Latest submissions

See All

failed

77264

Fri, 21 Aug 2020 13:32:28

Completed

Latest submissions

See All

graded	67702	Sat, 30 May 2020 22:03:07
graded	67701	Sat, 30 May 2020 21:33:48
graded	67600	Tue, 26 May 2020 12:03:15

Completed

Latest submissions

No submissions made in this challenge.

Completed

Spotify Sequential Skip Prediction Challenge

Spotify

Predict if users will skip or listen to the music they're streamed

Latest submissions

No submissions made in this challenge.

7184 days left

ECCV 2020 Commands 4 Autonomous Vehicles

KU Leuven

Latest submissions

No submissions made in this challenge.

Completed

JIGSAW

AIcrowd

Solve the jigsaw and finish the picture!

Latest submissions

No submissions made in this challenge.

Completed

Latest submissions

No submissions made in this challenge.

Completed

DA Project RECID

DA IIIT-H

Predict whether an individual will be back to prison

Latest submissions

No submissions made in this challenge.

Completed

ImageCLEF 2021 Tuberculosis - TBT classification

ImageCLEF

Latest submissions

No submissions made in this challenge.

Completed

Sound Sentiment Prediction

AIcrowd

Analyse Sentiment From Sound Clips

Latest submissions

No submissions made in this challenge.

Completed

Evoked Expressions from Videos Challenge (@CVPR 2021)

cvpr-workshop-2021

Predict viewer reactions from a large-scale video dataset!

Latest submissions

See All

graded

124097

Sun, 28 Feb 2021 22:06:35

Completed

RLIITM-1

AIcrowd

IIT Madras

Reinforcement Learning, IIT-M, assignment 1

Latest submissions

No submissions made in this challenge.

Completed

Latest submissions

See All

failed

156316

Tue, 14 Sep 2021 18:18:28

Completed

AI Blitz⚡ Community Challenge

AIcrowd

5 puzzles and 1 week to solve them!

Latest submissions

No submissions made in this challenge.

Completed

Latest submissions

See All

graded

128368

Mon, 5 Apr 2021 14:26:50

Completed

Latest submissions

No submissions made in this challenge.

Completed

Flatland AMLD 2021

AIcrowd

Multi-Agent Reinforcement Learning on Trains

Latest submissions

No submissions made in this challenge.

Completed

Latest submissions

No submissions made in this challenge.

Completed

Latest submissions

See All

graded	165868	Sat, 27 Nov 2021 13:42:25
failed	162152	Wed, 27 Oct 2021 16:38:24

Completed

[ICRA2022 & IROS2023] General Place Recognition: City-scale UGV Localization

AirLab - Carnegie Mellon University

Localization, SLAM, Place Recognition, Visual Navigation, Loop Closure Detection

Latest submissions

No submissions made in this challenge.

Completed

Lip Reading

AIcrowd

Identify Words from silent video inputs.

Latest submissions

No submissions made in this challenge.

Starting soon

Multi-Agent Reinforcement Learning for Iterative Reasoning

Strategic Intelligence for Machine Agents (SIGMA) Lab

Latest submissions

See All

failed	195996	Thu, 21 Jul 2022 18:04:09
submitted	195995	Thu, 21 Jul 2022 17:58:02
failed	183788	Mon, 16 May 2022 16:44:50

Completed

CVPR 2022 CLEAR Challenge

Carnegie Mellon University

A Challenge on Continual Learning using Real-World Imagery

Latest submissions

No submissions made in this challenge.

Completed

NeurIPS 2022 IGLU Challenge - RL Task

AIcrowd

IGLU Team

Use an RL agent to build a structure with natural language inputs

Latest submissions

No submissions made in this challenge.

Completed

Latest submissions

See All

graded	281041	Wed, 5 Mar 2025 10:16:28
failed	281038	Tue, 4 Mar 2025 12:21:34
graded	281037	Tue, 4 Mar 2025 05:20:57

Completed

Single-source Augmentation

Meta

Generating answers using image-linked data

Latest submissions

See All

failed	292367	Thu, 21 Aug 2025 16:45:21
failed	292325	Thu, 14 Aug 2025 13:11:44
graded	286048	Tue, 27 May 2025 18:51:52

Completed

Multi-source Augmentation

Meta

Synthesising answers from image and web sources

Latest submissions

See All

graded	283744	Wed, 7 May 2025 08:49:20
graded	282957	Mon, 28 Apr 2025 20:05:14
graded	282706	Fri, 25 Apr 2025 22:22:16

Completed

Multi-turn QA

Meta

Contextual answering in multi-turn dialogue

Latest submissions

See All

graded	282958	Mon, 28 Apr 2025 20:41:13
graded	282199	Thu, 17 Apr 2025 19:12:34
failed	282198	Thu, 17 Apr 2025 19:12:14

Participant	Rating
BhaviD	0
will_kwan	0
lars12llt	0
jansi_rani_s_v	0
branden_murray	0
saketha_ramanujam	0
vrv	0
jerome_patel	0
shivam	136
cadabullos	0
krishna_kaushik	0
unnikrishnan.r	261
ns601023	0

Participant	Rating
vrv	0
aicrowd-bot
shivam	136
unnikrishnan.r	261

ppo_plis DroneRL
View
teamA QM energy challenge
View
RandomTeam Seismic Facies Identification Challenge
View
AIcrowdHQ Insurance pricing game
View
AIcrowd AI Blitz #6
View
AIcrowdHQ ADDI Alzheimers Detection Challenge
View

Orak Game Agent Challenge

Intermittent UNAUTHENTICATED: Session expired (timeout: 120s) across all games (local + remote)

10 days ago

The sessions are ideally supposed to reauthenticate and continue normally. Can you share with us some script using which we can reproduce this error?

Supemario, game screenshot works?

11 days ago

Hello @RickySong @ChoiSoojin,

On the evaluation server, we run the games using the same scripts as the ones in the starter kit. To keep the behavior consistent, we updated the starter kit to run in headless mode as well.

If you’d like to enable visualizations again, you can switch the environment creation line in super_mario_env.py (around line 217):

to:

self.env = gym_super_mario_bros.make(
    'SuperMarioBros-1-1-v1',
    render_mode='human',
    apply_api_compatibility=True
)

Session cannot get data including mcp_urls

14 days ago

Hello @ns601023

It seems like you are using an older version of the starter kit. Can you please pull the new changes and try again?

git pull origin master --rebase

Pokemon map is broken?

17 days ago

@ilya_gusev is there a particular submission you are referring to that I can check?

We didn’t change anything specific to pokemon and wouldn’t expect this change. Maybe the game hasn’t started yet? The game ROM we received from Krafton team starts at the menu screen and the “Map on Screen” wouldn’t be defined for that screen.

Is there something I can cross check on the server logs or can you give the exact steps to replicate this issue so that we can pass this to Krafton team?

Starcraft submission

18 days ago

@mikhail1 @cheong_wei_xun

can you please take a look at this

Model submission

18 days ago

While running remote mode, it seems like the Star Craft failed after entering the second episode, is it possible that there’s a problem with the code in the MCP server or game envs of AIcrowd?

The issue was due to requests getting queued and dequeued arbitrarily at MCP server. We moved to a gRPC based implementation to get around this issue.

Please pull the recent changes made to starter kit and let us know if you still run into any issues.

long will the evaluation process usually take?

It’s subjective. However, if you use the random agents (not really random, they simply repeat the same static action), this is what you should expect.

Do score only appear after all games are completed?

Yes. Your submission must complete all games to be marked as graded. Your submission won’t appear on leaderboard otherwise.

will it directly show the score on the submission page once all games are completed or it will take some additional time to make the score shown on the submission page

Your submission would be marked as failed if it doesn’t complete all the games. In case your evaluation times out i.e. doesn’t finish all the games in 12 hours, the submission would eventually get marked as failed.

Clarification on ORAK Scoring Standards and Remote Mode Episode Settings

18 days ago

When running the games in remote mode, how many episodes are executed for each game?

Three episodes each for all games.

And for games like 2048, is the final score taken as the average of three rounds

Final score for each game would be the average scores across episodes.

Super_mario

18 days ago

Hey, we released a few patches to the starter kit that would remove fastmcp dependencies. Can you please pull the recent changes, give it a try and reach out to us if you are still running into problems?

📢 Starter Kit Update

18 days ago

📢 Starter Kit Update

18 days ago

Hello everyone! A quick heads up about an important stability update to the starter kit.

We’ve migrated the transport/communication backend (previously based on MCP/FastMCP) to gRPC to make the interaction between your agent and the game environment more robust and predictable .

What changed?

The underlying transport layer is now gRPC-based .
The agent-facing interface and APIs are unchanged . Your existing agent code should continue to work as is.
The main goal of this change is better resilience under long games and reconnections .

Why this matters?

Some of you were seeing:

Random stalls / timeouts during longer runs
Reconnection issues
Episodes hanging with no clear error

These issues were caused by how requests were queued and retried in the previous MCP-based setup. With gRPC, we now strictly enforce “one client, one action in flight” , and we get clearer error handling, which should eliminate these stalls and make reconnect behavior much more reliable.

What you need to do

Update your local starter kit
- Pull the latest changes from the repo (e.g., git pull --rebase ).
Reinstall/refresh dependencies if needed
- uv sync
Run your existing agents as usual
- No changes should be required to your agent logic or environment interaction code.

If you still see issues

If you run into:

Timeouts
Stalls
Reconnection problems

please share:

Logs (client + server, if possible)
Approximate episode length and map
Steps to reproduce

This will help us quickly track down any remaining edge cases.

Thanks for your patience while we tracked this down.

StarCraft Stuck After Episode 1 (‘Client is not connected’ Issue)

20 days ago

We use the Linux headless binary for evaluations, and the steps below should help you get everything set up clearly:

Download the last available version (4.10) from GitHub - Blizzard/s2client-proto: StarCraft II Client - protocol definitions used to communicate with StarCraft II..
Check the README for instructions on unzipping the archive. It’s password protected, and the password is provided in the README itself.
Extract the game into your $HOME directory. After extraction, the expected path should be $HOME/StarCraftII.
Depending on your burnysc2 version, the maps directory may need to be either $HOME/StarCraftII/Maps or $HOME/StarCraftII/maps. To avoid issues with missing map directories, create a symlink:
```
ln -s $HOME/StarCraftII/Maps $HOME/StarCraftII/maps
```
For additional details on configuring the maps, see: Question about SC2 map setting in starter kit

Model submission

21 days ago

The final evaluations would be manually run by Krafton AI team and the prizes would be decided based on the outcome of the manual runs. Krafton AI team would verify the size of the model you submit during the final evaluations.
No, we do not require any sort of access to your LLM server. However, for the final evaluations, you would need to include precise instructions and code that is needed to start your LLM server and ensure that Krafton AI team is able to run everything end-to-end.

Model submission

21 days ago

No, it doesn’t need to be hosted externally. We simply provide the endpoints that let you get game observations, and your agent only needs to return the actions to execute in the game. How you produce those actions whether through local models, external services, or any other setup is entirely up to you.

This means your machine must be able to access whichever models or services you rely on during inference.

In Local Mode, everything (your runner, game launcher, agents, and the MCP game servers) runs directly on your machine. The runner starts the game servers, initializes your agents, and your agents communicate with the servers over localhost. There is no connection to the AIcrowd backend.

In Remote Mode, your runner and agents (including the LLM calls you make etc.,) still run locally, but the MCP game servers run on AIcrowd’s remote infrastructure. Your runner creates a session via the Session API, receives the MCP server URLs, and your agents interact with the remote servers over HTTPS.

Local Mode hosts the entire stack on your machine.
Remote Mode keeps your agents local while offloading the game servers and environments to AIcrowd.

Hope this makes it clear

ModuleNotFoundError: No module named 'omegaconf'

22 days ago

uv looks for pyproject.toml and automatically manages the virtual environment at the repo level.

Can you verify that your environment is actually being used?

# check which Python uv is running
uv run python

# try importing a starter-kit–specific library, for example:
import sc2

If this import works, then uv is configured correctly.

You can install additional packages with:

uv add <package>

Although uv pip install -r pyproject.toml works, it’s generally better to use:

uv sync

This installs the exact dependency versions listed in uv.lock, matching the environment used during starter-kit testing.

Model submission

22 days ago

You don’t need to submit your model directly for this challenge. When you make a submission, we automatically launch an instance of each game and provide your agent with a unique MCP address for that run. Each game reports its score back to us, and we update the leaderboard accordingly.

Global Chess Challenge 2025

Submissions stuck at "Compiling model for Neuron"

10 days ago

@artist @whoamananand it seems like the models are hitting memory limits. Can you share the the config params you used to compile the model on trn1.2xlarge so that we can investigate this further?

Submitting models to Neuron: pick the right `--neuron.model-type` (and tune vLLM if you need to)

10 days ago

Hey @whoamananand

Your model is hitting memory limits and crashing the node on which the evaluation was running. Can you try submitting a smaller model?

We will figure out a way to relay the OOM errors properly on the submission details page.

Submitting models to Neuron: pick the right `--neuron.model-type` (and tune vLLM if you need to)

13 days ago

Submitting models to Neuron: pick the right `--neuron.model-type` (and tune vLLM if you need to)

13 days ago

When you run aicrowd submit-model, the platform spins up a vLLM server for your model. You can pass a handful of --vllm.* flags to control things like max context length, dtype, batching limits, LoRA settings, and a few inference-time parameters.

But there’s one flag you must get right for Neuron hardware:

--neuron.model-type <model-type>

Why this matters (Neuron compilation in one paragraph)

AWS Inferentia/Trainium (Neuron) doesn’t run your PyTorch model “as-is” the way a typical GPU setup might. The model needs to be compiled into a Neuron-compatible artifact before it can run on the accelerator.

Because the compilation path is model-architecture-specific, the submission system needs to know which backend/architecture you’re using. Hence --neuron.model-type.

If you don’t set --neuron.model-type, the submission will default to qwen3.

Supported model types (backends)

The NxD Inference model hub currently supports architectures including: Llama (text), Llama (multimodal), Llama4, Mixtral, DBRX, Qwen2.5, Qwen3, and FLUX.1 (beta).

github.com

aws-neuron/neuronx-distributed-inference/blob/main/src/neuronx_distributed_inference/inference_demo.py#L51-L56


      
          "llama": {"causal-lm": NeuronLlamaForCausalLM},
          "mixtral": {"causal-lm": NeuronMixtralForCausalLM},
          "dbrx": {"causal-lm": NeuronDbrxForCausalLM},
          "qwen2": {"causal-lm": NeuronQwen2ForCausalLM},
          "qwen3": {"causal-lm": NeuronQwen3ForCausalLM},
          "qwen3_moe": {"causal-lm": NeuronQwen3MoeForCausalLM},

(For the others, the exact string is typically the obvious lowercase name, aligned to the supported architecture list above. If you’re unsure, match it to your model family—e.g., Mixtral → mixtral, Qwen3 → qwen3—because picking the wrong type can lead to compile failures or incorrect behavior.)

Supported vLLM server flags

aicrowd submit-model currently supports these vLLM arguments:

--vllm.max-model-len
--vllm.dtype
--vllm.kv-cache-dtype
--vllm.quantization
--vllm.load-format
--vllm.rope-theta
--vllm.rope-scaling
--vllm.max-num-batched-tokens
--vllm.max-num-seqs
--vllm.enforce-eager true
--vllm.enable-lora true
--vllm.lora-dtype
--vllm.lora-extra-vocab-size
--vllm.enable-prefix-caching true
--vllm-env.allow-long-max-model-len

# inference time parameters
--vllm-inference.max-tokens

Example: complete submission command

At minimum, set your repo/tag and the Neuron model type:

aicrowd submit-model \
  --hf-repo <repo> \
  --hf-repo-tag <branch/tag> \
  --neuron.model-type llama

Or, for DBRX:

aicrowd submit-model \
  --hf-repo <repo> \
  --hf-repo-tag <branch/tag> \
  --neuron.model-type dbrx

If you need tighter control over serving behavior, add the relevant --vllm.* flags on the same command.

Concerns About Challenge Readiness - Seeking Clarification

14 days ago

Hello @sankar_ram

We are already accepting submissions. In an unlikely case that there are any more changes to the evaluation setup, we will re-evaluate the submissions as needed. You can start submitting your models right away.
As of now, you can make 5 submissions per day.
“Global Chess Challenge” is the name of the competition. “AWS Trainium Challenge 2025” was a placeholder we used during our internal development.

jyotish has not provided any information yet.

Notebooks

Create Notebook

Filters

Private

Solution for submission 128368 A detailed solution for submission 128368 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXI

jyotish
· Over 4 years ago

View
[Baseline] Detectron2 starter kit for food recognition 🍕 A beginner friendly notebook kick start your instance segmentation skills with detectron2

jyotish
· Almost 5 years ago

Open in Colab · View

Notebooks

Create Notebook

Filters

Private