Organization
Location
Badges
Activity
Challenge Categories
Challenges Entered
Build an LLM agent for five real-world games
Latest submissions
Detecting Energy Flexibility in Buildings
Latest submissions
Create Context-Aware, Dynamic, and Immersive In-Game Dialogue
Latest submissions
Improve RAG with Real-World Benchmarks | KDD Cup 2025
Latest submissions
Generate Synchronised & Contextually Accurate Videos
Latest submissions
Improve RAG with Real-World Benchmarks
Latest submissions
Revolutionise E-Commerce with LLM!
Latest submissions
Revolutionising Interior Design with AI
Latest submissions
Multi-Agent Dynamics & Mixed-Motive Cooperation
Latest submissions
Advanced Building Control & Grid-Resilience
Latest submissions
Specialize and Bargain in Brave New Worlds
Latest submissions
Trick Large Language Models
Latest submissions
Shopping Session Dataset
Latest submissions
Understand semantic segmentation and monocular depth estimation from downward-facing drone images
Latest submissions
Audio Source Separation using AI
Latest submissions
Identify user photos in the marketplace
Latest submissions
A benchmark for image-based food recognition
Latest submissions
Using AI For Buildingโs Energy Management
Latest submissions
Learning From Human-Feedback
Latest submissions
What data should you label to get the most value for your money?
Latest submissions
Interactive embodied agents for Human-AI collaboration
Latest submissions
Specialize and Bargain in Brave New Worlds
Latest submissions
Amazon KDD Cup 2022
Latest submissions
Behavioral Representation Learning from Animal Poses.
Latest submissions
Airborne Object Tracking Challenge
Latest submissions
ASCII-rendered single-player dungeon crawl game
Latest submissions
Latest submissions
5 Puzzles 21 Days. Can you solve it all?
Latest submissions
Measure sample efficiency and generalization in reinforcement learning using procedurally generated environments
Latest submissions
5 Puzzles 21 Days. Can you solve it all?
Latest submissions
Self-driving RL on DeepRacer cars - From simulation to real world
Latest submissions
3D Seismic Image Interpretation by Machine Learning
Latest submissions
5 Puzzles 21 Days. Can you solve it all?
Latest submissions
Latest submissions
5 Puzzles 21 Days. Can you solve it all?
Latest submissions
5 Puzzles 21 Days. Can you solve it all?
Latest submissions
Multi-Agent Reinforcement Learning on Trains
Latest submissions
A dataset and open-ended challenge for music recommendation research
Latest submissions
A benchmark for image-based food recognition
Latest submissions
Latest submissions
Sample-efficient reinforcement learning in Minecraft
Latest submissions
Latest submissions
5 Puzzles, 3 Weeks. Can you solve them all? ๐
Latest submissions
Multi-agent RL in game environment. Train your Derklings, creatures with a neural network brain, to fight for you!
Latest submissions
Predicting smell of molecular compounds
Latest submissions
Find all the aircraft!
Latest submissions
5 Problems 21 Days. Can you solve it all?
Latest submissions
5 Puzzles 21 Days. Can you solve it all?
Latest submissions
5 Puzzles, 3 Weeks | Can you solve them all?
Latest submissions
Latest submissions
Grouping/Sorting players into their respective teams
Latest submissions
5 Problems 15 Days. Can you solve it all?
Latest submissions
5 Problems 15 Days. Can you solve it all?
Latest submissions
Predict Heart Disease
Latest submissions
5 PROBLEMS 3 WEEKS. CAN YOU SOLVE THEM ALL?
Latest submissions
Latest submissions
Remove Smoke from Image
Latest submissions
Classify Rotation of F1 Cars
Latest submissions
Can you classify Research Papers into different categories ?
Latest submissions
Can you dock a spacecraft to ISS ?
Latest submissions
Multi-Agent Reinforcement Learning on Trains
Latest submissions
Multi-Class Object Detection on Road Scene Images
Latest submissions
Localization, SLAM, Place Recognition, Visual Navigation, Loop Closure Detection
Latest submissions
Localization, SLAM, Place Recognition
Latest submissions
Detect Mask From Faces
Latest submissions
Identify Words from silent video inputs.
Latest submissions
A Challenge on Continual Learning using Real-World Imagery
Latest submissions
Latest submissions
See All| graded | 200977 |
Music source separation of an audio signal into separate tracks for vocals, bass, drums, and other
Latest submissions
Amazon KDD Cup 2023
Latest submissions
Amazon KDD Cup 2023
Latest submissions
Make Informed Decisions with Shopping Knowledge
Latest submissions
Generate Videos with Temporal and Semantic Audio Sync
Latest submissions
Create Videos with Spatially Aligned Stereo Audio
Latest submissions
Build Context-Aware Conversational NPC Agents
Latest submissions
Task-Oriented Conversational AI for NPC Agents
Latest submissions
Context-Aware & Task-Driven NPC Agents
Latest submissions
| Participant | Rating |
|---|---|
vrv
|
0 |
cadabullos
|
0 |
cavalier_anonyme
|
0 |
ReachAMY
|
0 |
pravesh_tiwari
|
0 |
| Participant | Rating |
|---|
-
powerpuff AI Blitz XView
-
teamux NeurIPS 2021 - The NetHack ChallengeView
-
tempteam NeurIPS 2022 IGLU ChallengeView
-
testing Sound Demixing Challenge 2023View
-
grogu HackAPrompt 2023View
-
apollo11 MosquitoAlert Challenge 2023View
-
testteam Commonsense Persona-Grounded Dialogue Challenge 2023View
-
temp-team Generative Interior Design Challenge 2024View
Orak Game Agent Challenge
When can we use the starter kit?
4 days agoHello @seokwoo_song
The starter kit can be found here: AIcrowd / Challenges / Orak Game Agent Challenge 2025 / orak-2025-starter-kit ยท GitLab
All the best!
When will the sponsored Brev credits be made available?
4 days agoHello @inchangbaek,
Details on the credits and how to claim them will be announced soon. Thank you for your patience and for participating in this challenge.
๐ฌ Feedback & Suggestions
14 days agoWe are constantly trying to improve this challenge for you and would appreciate any feedback you might have! ![]()
Please reply to this thread with your suggestions and feedback on making the challenge better for you!
- What have been your major pain points so far?
- What would you like to see improved?
All The Best!
๐ฅ Looking for teammates?
14 days agoSolving challenges is more fun with a team!
Introduce yourself here, and find others who are looking to team up! ![]()
- Introduce yourself and share a bit about your background.
- What brings you to this challenge?
Team registration deadline: 12 Dec 2025
All the best!
Flextrack Challenge 2025
๐ Final Results & Next Steps
About 1 month agoOpen-Source License
About 2 months agoHello @slimmer
MIT license would also be fine. Please use either Apache 2.0 or MIT license.
Important Update
About 2 months ago
Update (October 10th, 2025)
This announcement has been superseded by a newer post:
Phase 2 โ We hear you ! Hereโs how weโre updating the plan
Please refer to the latest update for the final Phase 2 format, dataset, and scoring details.
(This original announcement remains for archival reference only.)
Please note some key updates about the challenge:
Phase 1 (Competition Phase)
- Ends on Sunday, 19 October 2025.
- Includes submission of the Solution Documentation.
Phase 2
- Expected to open between 20 and 22 October 2025, 23:59 UTC.
- The Solution Documentation deadline is extended to this window.
Phase 2 Format
- Multiple one-year datasets will be sliced into shorter, equal-length context windows.
- Slices will include a mix of previously seen and new sites.
- In Phase 2, participants will make a single prediction per time series. This is to prevent look-ahead.
- Phase 2 is designed to ensure fairness and reward models that can generalise, are context-aware, and are transferable across sites.
Scoring and Final Ranking
- The final ranking will factor in two components with equal weight:
- the score from the Phase 1 private dataset (SiteF), and
- the score from Phase 2.
- Within Phase 2, scores will be the equal-weighted average across all slices and sites.
Scoring Code and Starter Kit
- We will publish the scoring script and a minimal code sample in the starter kit.
- These materials will be available when Phase 2 opens.
Challenge Description Clarification
- Participants must predict each timestamp tโ using only inputs where t_input โค tโ.
- Training data includes ground-truth time series and demand response flags.
- Models should learn consumption patterns both when demand response is inactive and when it is active.
Live Q&A with Challenge Organisers ๐ Join the Townhall on Sept 22!
2 months agoHello, The recording and slides are shared here: ๐น Townhall Recording & Q&A with Challenge Organisers | How to use digital twin data to predict demand response capacity
Clarification on Future Data and Back Cast
2 months ago๐น Townhall Recording & Q&A with Challenge Organisers | How to use digital twin data to predict demand response capacity
2 months agoHi Ryan, Thanks for flagging this. The end date in the pill on challenge banner is now fixed.
๐น Townhall Recording & Q&A with Challenge Organisers | How to use digital twin data to predict demand response capacity
2 months agoEach participant can only make one submission.
- If you are competing as part of a team, your team submits one entry total.
- If you are competing individually, you submit one entry on your own.
๐น Townhall Recording & Q&A with Challenge Organisers | How to use digital twin data to predict demand response capacity
2 months agoHello all,
Thank you to everyone who joined FlexTrack Challenge 2025 Townhall #1. In this session, we introduced the challenge and explored how digital twin data from commercial buildings can be used to forecast demand response flags and capacities.
If you missed it, you can catch up here:
Watch the recording: https://youtu.be/oKBcMxAQ3vg
Download the slides: Flextrack Townhall โ Google Drive
Highlights from the session:
- Context on the evolving role of prosumers, batteries, and smart devices in the National Electricity Market (NEM)
- Overview of the synthetic dataset, generated from digital twins of commercial office buildings
- Key variables like weather, HVAC setpoints, internal loads, and demand response flags
- Approach to building generalizable, context-aware models that work across multiple building sites
- Details on evaluation metrics, submission format, and documentation requirements
- Q&A on temporal modelling, model transferability, and real-world use cases for aggregators and VPP operators
- New! Top teams will have the chance to co-author a research publication based on their solutions
- Synthetic data design: sites modeled across different Australian climate zones for realistic diversity
- Final submission reminder: competition phase ends October 19, 2025, with one combined CSV per team
If you have questions or ideas to share, drop them in the comments below so the community and organisers can help.
Team FlexTrack
Live Q&A with Challenge Organisers ๐ Join the Townhall on Sept 22!
2 months agoWe will share the recording in the next day or two!
New to the challenge - participation rules
3 months agoHi Priya,
The competition phase is open to all!
All the best
Why does it show that the competition round is completed?
3 months agoHi @priya12
Yes, you can directly participate in Phase 2.
Live Q&A with Challenge Organisers ๐ Join the Townhall on Sept 22!
3 months agoHello everyone,
As Round 2 of the Flextrack Challenge 2025 kicks off, weโre excited to invite you to the first Flextrack Townhall! This session offers an opportunity to engage with the organisers, gain valuable insights into the challenge, and get your questions answered. Prepare to boost your Round 2 submissions and refine your strategies!
Date: 22nd September 2025 (Monday)
Time: 12:00 PM AEST
Zoom Link: Join the Townhall
Canโt make it? Donโt worry! A recording will be available after the event. You can also drop your questions in the comments for the organisers, and theyโll be addressed during the town hall.
What to Expect:
- Overview of the challenge and problem statement
- Task 1: Classification (Warm-Up Phase)
- Task 2: Regression (Competition Phase)
- Live Q&A session
Panellists
This townhall features leading experts:
-
Dr Emily Yap, Research Fellow, University of Wollongongโs Sustainable Buildings Research Centre
Emilyโs work involves exploring and developing digital tools including IoT, digital twins, and immersive technologies like virtual and augmented reality to improve energy performance and decarbonisation in both buildings and primary industries. With a background in materials science and applied sensing, she brings a systems-thinking approach to solving complex sustainability challenges through practical, data-informed solutions across diverse environments. -
Matt Amos, Senior Data Scientist, Commonwealth Scientific and Industrial Research Organisation (CSIRO)
Mattโs work focuses on supporting the digitisation of buildings to improve sustainable practices of energy use and to reduce related costs. He is part of the development team behind CSIROโs Data Clearing House digital buildings platform and leads CSIROโs NSW Digital Infrastructure for Energy Flexibility project.
Mark your calendars, prepare questions, and join us live for this event.
Looking forward to seeing you there!
Team AICrowd
Commonsense Persona-Grounded Dialogue Chall-0431ae
๐ Winners & Call for Paper
3 months agoAn update on Task 1 API Track Winners
3 months agoDear Participants,
Due to an error on AIcrowdโs end for the final compilation of results, the standings and announced winners for the Task 1 API track are invalid and not final. All other tasks and track results remain accurate and unchanged.
We have reviewed and corrected all submissions to ensure the final standings for this track are accurate and fair. The updated winners for Task 1 API are as follows:
Task 1 API โ Final Rankings
| Rank | Team | Automatic Score |
|---|---|---|
|
|
@nicholas_liu | 0.575 |
|
|
@MSRA_SC | 0.572 |
|
|
@TU_Character_lab | 0.563 |
Teams directly impacted by this change will be contacted by AIcrowd through their registered email addresses.
We sincerely apologise for the confusion and inconvenience this issue has caused. Thank you for your patience and understanding as we work to ensure accuracy and fairness in the final results.
Team AIcrowd

Questions about tracks and credits
4 days agoHello!