Global Chess Challenge 2025
Challenge Rules
AWS Trainium Challenge 2025:
Official Rules
**NO PURCHASE NECESSARY TO PARTICIPATE. VOID WHERE PROHIBITED. \ ** By participating in this Challenge, you agree to abide by these Official Rules and all decisions of the Organizers.
Definitions
Sponsor: Amazon Web Services, Inc. (“AWS”), providing financial and brand sponsorship. AWS is not responsible for the organization or execution of the Challenge. \ Organizer: AGI House, responsible for administering, managing, and operating the Challenge. \ Platform Provider: AIcrowd SA, responsible for providing the online platform and leaderboard infrastructure. AIcrowd is not an Organizer or Sponsor.
Overview
The AWS Trainium Challenge 2025 ("Challenge") is a global hybrid competition organized by AGI House and sponsored by Amazon Web Services (AWS). Infrastructure and leaderboard services are provided by AIcrowd.
The competition focuses on optimizing performance and reasoning on Amazon Trainium chips using LLAMA-8B. The goal is to build a text-only setup where models play legal chess moves and explain them succinctly, with Stockfish as the objective verifier and large public chess corpora providing scale.
Sponsor
Amazon Web Services, Inc., 410 Terry Ave North, Seattle, WA 98109. \ AWS is the sponsor of the Challenge and is not responsible for the administration or organization of the event.
Organizers & Admins
AGI House is the sole Organizer of the AWS Trainium Challenge 2025. “Organizer Admins” are companies or entities authorized by the Organizer to support execution of the Challenge. For this event, this includes AIcrowd SA as a platform and evaluation infrastructure provider. Organizer Admins are not responsible for the overall organization or legal administration of the Challenge.
Timeline, Website & Registration
- Launch & Registration Opens: AWS re:Invent, December 4, 2025
- Round 1 Submissions Close: December 31
- Winners Announced: TBD
Note: Details regarding the submission process, including formatting, upload method, and platform requirements, will be added once the technical infrastructure has been finalized.
Challenge Evaluation
**Overview \ **All submitted models will be evaluated in a tournament-style format designed to assess performance, consistency, and strategic quality. Each model will compete against other participant models under standardized match conditions determined by the organizers.
**Evaluation Criteria \ **Models will be assessed using a combination of quantitative and qualitative metrics, including but not limited to:
Each submitted model will play 12 games against Stockfish Level 0 (Depth 5) and 12 games against Stockfish Level 1 (Depth 5), and 12 games with a Random Agent,
- Primary Score - Average Centipawn Loss (ACPL) as calculated against Stockfish Level 20 Depth 20 across
- Secondary Score: **Win Rate as **calculated across all the games.
All models will be run in a controlled environment with identical computational resources and time constraints. Final rankings will be determined based on cumulative performance scores and qualitative assessments of innovation. In the event of a tie, organizers may apply additional tie-breaking criteria such as head-to-head record or performance against a baseline reference model.
The organizers reserve the right to modify the evaluation procedure, weighting of criteria, or format of matches at any time to ensure fairness and technical integrity.
AWS Credits and Infrastructure
Participants will receive AWS credits to support their development and testing on Trainium infrastructure. If you exhaust your allocated credits during the competition, you may request additional credits by contacting the organizers through official Challenge channels. Credit allocation decisions will be made at the Organizer's discretion based on availability and demonstrated need.
Am I Eligible to Enter the Challenge?
You are eligible if you (and each team member):
- Are an individual,
- Are 18 years or older (and of majority age in your jurisdiction),
- Have Internet access, an email account, and a personal computer.
Residents of the following are not eligible for cash prizes: Crimea region of Ukraine, Cuba, Iran, North Korea, Sudan, Syria, Quebec (Canada), Brazil, Italy, Russian Federation.
Note: Residents of these regions may still participate and retain their leaderboard rank. Any cash prizes will be passed to the next eligible team.
You are responsible for reviewing and understanding your employer's and country's policies regarding participation. If you participate in violation of these policies, you and your Entry may be disqualified. Organizers disclaim liability for disputes arising between employers and employees or countries and residents.
If your employer has rights to intellectual property created during employment, you must confirm you have received authorization to participate, or that your employer has waived such rights for this Challenge. Organizers may request written confirmation.
Is the Entry an Eligible Entry?
The Entry MUST:
- be compatible with the official submission format;
- be in English;
- be the Team's own original work;
- not have been submitted previously in any promotion of any kind;
- not contain inappropriate, offensive, defamatory, or illegal content, or content that promotes discrimination, violence, or infringes on others' rights
And The Team members MUST:
- Obtain all necessary consents, approvals, and licenses for submission
- Obtain agreement from all team members to these Rules
- Not engage in false, fraudulent, or deceptive acts
- Not tamper with or abuse any aspect of the Challenge
- Ensure transparency in the use of external data sources with clear documentation
- Ensure external datasets are permissible for non-commercial or academic research and comply with licensing terms
- Provide rationale for any declared datasets
Violation of these terms, especially misuse of test data, will result in immediate disqualification.
Submission Timeouts and Infrastructure Requirements
Submissions will be evaluated on a standard hardware configuration disclosed prior to the deadline. Each submission will be subject to strict runtime limits per task, including per-turn timeouts in interactive tasks. Submissions that exceed time limits or fail to execute on the designated infrastructure may be marked as failed. Participants are responsible for ensuring compatibility with evaluation hardware and accounting for resource constraints such as memory limits and disabled external network access.
Submission Limits and Documentation
- Participants may submit up to 5/10 TBD entries per day
- Winning solutions must be validated for compliance with allowed data and thoroughly documented
- Minimum performance thresholds must be met to qualify for prizes
- Winners must submit a comprehensive solution report for publication on a platform designated by organizers
Disqualification
If you, any Team member, or the Entry is found to be ineligible for any reason—including but not limited to conflicts within Teams, noncompliance with these Rules, or tampering with submissions—the Organizer reserves the right to disqualify the Entry and/or you and/or your Team members from this Challenge. A participant may not create or use multiple accounts to participate. Violating this rule will result in immediate disqualification and potential exclusion from future events. The Organizer also reserves the right to disqualif