Loading
2 Follower
1 Following
liberifatali
Cless

Location

VN

Badges

3
2
1

Activity

Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Mon
Wed
Fri

Challenge Categories

Loading...

Challenges Entered

Build an LLM agent for five real-world games

Latest submissions

No submissions made in this challenge.

Detecting Energy Flexibility in Buildings

Latest submissions

See All
graded 300328
graded 300319
graded 300318

Improve RAG with Real-World Benchmarks

Latest submissions

See All
graded 266335
graded 266193
graded 266168

A benchmark for image-based food recognition

Latest submissions

No submissions made in this challenge.

Latest submissions

See All
graded 10024
graded 6995
failed 6992

5 Problems 15 Days. Can you solve it all?

Latest submissions

See All
graded 74031
graded 73938
graded 73831

Latest submissions

No submissions made in this challenge.

A new benchmark for Artificial Intelligence (AI) research in Reinforcement Learning

Latest submissions

No submissions made in this challenge.

Real Time Mask Detection

Latest submissions

See All
graded 73807
graded 73198
graded 72790

Testing RAG Systems with Limited Web Pages

Latest submissions

See All
graded 266167
failed 266122
failed 265909

Evaluating RAG Systems With Mock KGs and APIs

Latest submissions

See All
graded 266168
failed 266069
graded 266068

Enhance RAG systems With Multiple Web Sources & Mock API

Latest submissions

See All
graded 266335
graded 266193
failed 266166

Create Videos with Spatially Aligned Stereo Audio

Latest submissions

No submissions made in this challenge.
Participant Rating
alchemi01 235
yusuf_dogu 0
Participant Rating
alchemi01 235

Flextrack Challenge 2025

๐Ÿ† Final Results & Next Steps

About 1 month ago

Thank you for the update. Iโ€™m curious to see which models are causal.

Solution documentation

About 2 months ago

I have the same question

๐Ÿ“น Townhall Recording & Q&A with Challenge Organisers | How to use digital twin data to predict demand response capacity

2 months ago

I think โ€˜back-castโ€™ refers to the fact that these datasets were collected in the past. So now we cast predictions for previous events.

In the test dataset v0.2, there are Site D, Site E, and Site F. My take is that these sites are in the private test set, not only the site F.

Issue related to submission

3 months ago

Hi Tuan, you can check your submissions here: AIcrowd | Flextrack Challenge 2025 | Submissions

Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937

Did somebody try deploy three llama3-8b on four T4?

Over 1 year ago

There are 4 T4 cards. But I see that the total GPU memory is < 60GB.

Did somebody try deploy three llama3-8b on four T4?

Over 1 year ago

How large is each llama3-8b model? If it is 16GB each, itโ€™s possible.

Whether the task test phase can link to the Internet

Over 1 year ago

There is the Internet when building the Docker image. But no internet when they run submissions.

The CRAG-Mock-API should move to the Meta Comphrehensive RAG Benchmark starter kit project?

Over 1 year ago

The docker file can be customized. It is the environment to run the code in gitlab.

The CRAG-Mock-API should move to the Meta Comphrehensive RAG Benchmark starter kit project?

Over 1 year ago

Look at crag_mock_api/apiwrapper/pycragapi.py to see how to use its functions.

The openai interface cannot be used during evaluation? Why?

Over 1 year ago

You need to pass API key. There is also no internet in the submission.

How can I delete my LFS?

Over 1 year ago

I had to delete the local repo to solve this.

When will the submission limit be reset?

Over 1 year ago

In the Submission tab of the main challenge page, we can see how many submissions are left this week.

Usage of GGUF and finetuned LLaMA Models

Over 1 year ago

I have the same question @mohanty

Meta KDD Cup 24 - CRAG - Retrieval Summarization

Failed to communicate with the grader

Over 1 year ago

I had the same issue. Is it fixed now? @aicrowd_team

Couldn't instantiate the backend tokenizer

Over 1 year ago

Have you tried installing the transformer package?

About Test Set Leakage in Round 1

Over 1 year ago

@aicrowd_team I suggest that the Round 2 test set should be truly private and not share any similarity or distribution with the data in Round 1.

Why am I not eligible to participate?

Over 1 year ago

I have tried again and still failed with the same reason

Why am I not eligible to participate?

Over 1 year ago

I got the same issue

CUDA out of memory issue

Over 1 year ago

Whatโ€™s in the debug log?

A starfruit's worth, the golden prize. Three-span pouch holds, a fortune's rise.