Activity
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Mon
Wed
Fri
Challenge Categories
Loading...
Challenges Entered
Train LLMs to Play Chess
Latest submissions
Build an LLM agent for five real-world games
Latest submissions
Create Context-Aware, Dynamic, and Immersive In-Game Dialogue
Latest submissions
See All| graded | 291799 | ||
| graded | 291798 | ||
| graded | 291797 |
Improve RAG with Real-World Benchmarks | KDD Cup 2025
Latest submissions
Latest submissions
Build Context-Aware Conversational NPC Agents
Latest submissions
See All| graded | 291799 | ||
| graded | 291797 | ||
| graded | 291794 |
Task-Oriented Conversational AI for NPC Agents
Latest submissions
See All| graded | 291798 | ||
| graded | 291796 | ||
| graded | 291793 |
| Participant | Rating |
|---|
| Participant | Rating |
|---|
-
zvers Commonsense Persona-Grounded Dialogue Challenge 2025View
-
ThinkingFace[o_O] Orak Game Agent Challenge 2025View
Commonsense Persona-Grounded Dialogue Chall-0431ae
dedicated to win :)

๐ Winners & Call for Paper
4 months agoHi Sneha @snehananavati and AIcrowd Team,
Thanks a lot for sharing the winners list!
I had one clarification: the
Task2 Human Rankcolumn in theTask 3 GPUtable seems different from theResponse Rank / Knowledge Rankfigures shown in theTask 2 GPUtable. Could you confirm if this is indeed a separate evaluation (specific to Task 3 finalists), and explain how it relates to the Task 2 rankings?Thanks in advance!
MN