AIcrowd | chris_deotte | Participants

13 Follower

0 Following

Chris_Deotte

Chris Deotte

Activity

Feb

Mar

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mon

Wed

Fri

Challenge Categories

Challenges Entered

Completed

Meta CRAG - MM Challenge 2025

Meta

Improve RAG with Real-World Benchmarks | KDD Cup 2025

Latest submissions

See All

graded	289892	Tue, 17 Jun 2025 22:42:22
graded	289891	Tue, 17 Jun 2025 22:42:05
failed	289715	Tue, 17 Jun 2025 16:48:14

Completed

Meta Comprehensive RAG Benchmark: KDD Cup 2024

Meta

Improve RAG with Real-World Benchmarks

Latest submissions

No submissions made in this challenge.

Completed

Amazon KDD Cup 2024: Multi-Task Online Shopping Challenge for LLMs

Amazon Search

Revolutionise E-Commerce with LLM!

Latest submissions

See All

graded	270741	Tue, 16 Jul 2024 14:01:38
graded	270740	Tue, 16 Jul 2024 14:01:38
graded	270655	Tue, 16 Jul 2024 13:54:45

Completed

Amazon KDD Cup '23: Multilingual Recommendation Challenge

Amazon Search

Shopping Session Dataset

Latest submissions

See All

graded	235811	Wed, 5 Jul 2023 16:34:11
graded	235350	Wed, 14 Jun 2023 02:22:57
graded	235349	Tue, 13 Jun 2023 21:13:40

Completed

Task 1: Next Product Recommendation

AIcrowd

Amazon Search

Amazon KDD Cup 2023

Latest submissions

See All

graded	235349	Tue, 13 Jun 2023 21:13:40
graded	235348	Tue, 13 Jun 2023 19:52:35
graded	235347	Tue, 13 Jun 2023 19:25:45

Completed

Task 2: Next Product Recommendation for Underrepresented Languages

AIcrowd

Amazon Search

Amazon KDD Cup 2023

Latest submissions

See All

graded	235350	Wed, 14 Jun 2023 02:22:57
graded	235166	Thu, 8 Jun 2023 23:28:15
graded	235125	Thu, 8 Jun 2023 22:25:53

Completed

Single-source Augmentation

Meta

Generating answers using image-linked data

Latest submissions

See All

graded	289892	Tue, 17 Jun 2025 22:42:22
graded	289891	Tue, 17 Jun 2025 22:42:05
graded	288840	Sun, 15 Jun 2025 20:19:20

Completed

Multi-source Augmentation

Meta

Synthesising answers from image and web sources

Latest submissions

See All

failed	289190	Mon, 16 Jun 2025 20:39:52
graded	289174	Mon, 16 Jun 2025 19:34:13
graded	289033	Mon, 16 Jun 2025 13:54:36

Participant	Rating
mincheolyoon	0
happystat	0
unna97	0
linchia	0
Karrich	0
gaozhanfire	0
pengbo_wang	0
pp2915	0
pengyue_jia3	0
GenpengXu	0
xiaopeng_li	0
eliot8	0
lai_jinxing	0

Participant	Rating

NVIDIA-Merlin Amazon KDD Cup '23: Multilingual Recommendation Challenge
View
Team_NVIDIA Amazon KDD Cup 2024: Multi-Task Online Shopping Challenge for LLMs
View
Team_NVIDIA Meta CRAG - MM Challenge 2025
View

Meta CRAG - MM Challenge 2025

🏆 Meta CRAG Challenge 2025 Winners Announcement

8 months ago

Do winners have an opportunity to submit a paper and make a presentation at the KDD conference in August? If so, what are the deadlines and procedures?

🚨 Submission Selection Deadline: 23rd June 2025, 12:00 UTC (noon)

8 months ago

@snehananavati @yilun_jin8 @jyotish Before we can submit the form, we need to know why we are choosing two. Could you please explain how the two will be used? For example

If we submit two, you will use the one with less missing responses.
If we submit two, you will choose the code that you like best
If we submit two, you will human evaluate both and choose the one with best score
If we submit two, you will choose the one which runs the fastest
etc, etc, etc

Please respond quickly so we have time to pick our two final submissions before selection deadline. Thank you!

🚨 Submission Selection Deadline: 23rd June 2025, 12:00 UTC (noon)

8 months ago

Hi. When we select two, what do you do with the two? Will you human evaluate both of the two and pick the one that has the best score? (i.e. of the two selected per task, which one of the two will eventually be used as our team’s final submission per task?)

Submissions stucks

9 months ago

This means that your submission is in the queue and hasn’t started yet. There all like 100 submissions to evaluate before yours.

Why did Submission 288508?

9 months ago

Hello. Can you tell me why my submission 288508 failed? Please add a comment to the submission page. And can you tell me why my teammates submission 288507 failed? Please add a comment to his submission page. Thank you!

Why did Submission 287740 Fail?

9 months ago

Hello, why did submission 287740 fail? It finished 68% of “generate predictions”.

Why did Submission 287602 Fail?

9 months ago

And can you tell me why submission 287646 failed? Thank you

Why did Submission 287602 Fail?

9 months ago

Hi, can an admin tell me why my submission 287602 failed with “exit code 1” during “generate predictions”? It made it to 94%, so i thought it would be successful. Thanks!

Can you help me figure out why Submission #287175 failed?

9 months ago

What does the evaluation page say?

Why Did Submission 287138 Fail?

9 months ago

Hello. Can admins tell me why submission 287138 failed? It received a “Evaluation failed with exit code 1” halfway through “generating predictions”. Thank you.

Has Private Data Changed in Phase 2?

9 months ago

I am confused. On day 1 of phase 2, wasn’t the private web search using v0.5 and then recently it was changed to private web search v0.6 (with 10-20% more chunks)? Wasn’t that the reason that private web search started returning None and the disk space on server starting running out?

So in other words, didn’t the private web search change during phase 2? And solutions submitted during day 1 with v0.5 will perform differently on leaderboard than the same solutions submitted today with v0.6?

i.e. if nothing changed with private data, why are we re-running all submission to the leaderboard?

Question about ego sample TAB in multi-turn-QA's Leaderboard

9 months ago

Yes this is normal. Because task 3 is all the same type of image. (I forget, it is either all ego or all non-ego). Therefore there is only 1 LB)

Why Submission #286793 failed?

9 months ago

Does the submission page have more info? If we exceed the 7.5 hour time limit we get a “Gym server stopped unexpectly”. Maybe this is your error.

Error - crag batch iteration

9 months ago

There are two queries in dataset without an answer. If you use the latest version of crag batch iterator here, it will fix that:

The bad ids are
SESSIONS_TO_SKIP = ["04d98259-27af-41b1-a7be-5798fd1b8e95", "695b4b5c-7c65-4f7b-8968-50fe10482a16"]

Why Does Private Web Search v0.6 search_pipeline() return None?

9 months ago

@yilun_jin8 @jyotish

I have noticed that validation web search v0.6 and public test web search v0.6 never return None when we call search_pipeline() locally. This makes sense because a RAG search uses cosine similarity, and it can always find and return k chunks.

But during submission, the private web search v0.6 search_pipeline() returns None. The private web search v0.5 did not return None.

Today, AIcrowd is re-running all our submissions. All of my re-run submissions are failing because of these None (but worked fine with v0.5). Is something going wrong with private web search v0.6 search_pipeline()? What would cause it to return None? And is it only returning a few None or is every call returning None?

Why Submission #286732 failed?

9 months ago

You must wait. Your evaluation is in a queue and has not started yet. Afterward it will say either successful and show scores. Or it will say fail. But it has not started yet.

Status was changed after submission

9 months ago

This is because AIcrowd is re-running our submissions. Here is quote from @yilun_jin8

You don’t need to re-submit by yourself. We have queued all previous submissions for re-evaluation, and the re-evals will happen automatically.

The re-runs are failing because AIcrowd’s function search_pipeline() is returning None values now and this causes code to fail. (So instead of returning a python list with k items, it is just returning a single None value. But top k RAG should never return nothing).

TypeError: ‘NoneType’ object is not a mapping

Has Private Data Changed in Phase 2?

9 months ago

Based on comments by @yilun_jin8 here, here, here (and displayed in quote below). It seems that the private LB test data has been updated. Does this mean that everyone needs to resubmit all their submissions again to see the improvement of the new v0.6 private web search databases?

Can admins please answer the 3 bullet points in my previous post? And in the future, can admins please inform all partcipants when changes are made to private data and/or submission processes? Thank you!

Because the size of the search indices expanded dramatically after the latest update from Meta, there were lower than expected storage space on the evaluation nodes for participants to store their models. Therefore, you may see that your submission failed due to limited storage.

We have addressed the problem by increasing the disk space on the node. There will be at least 250GB for participants’ code and models.

We have also re-queued all recent submissions to account for this, and also for the latest updates in the search index.

Why Submission #286586 failed?

9 months ago

@yilun_jin8 Does this mean that the private web search databases have recently been updated too? For example did you add 10-20% chunks into the private web search databases? (i.e. is there a private web search index v0.6?)

Can admins tell us when changes are made to private data? (Admins tell us about public changes, but it is also important for us to know about private changes too). Please answer my discussion post here. Thank you!

Has Private Data Changed in Phase 2?

9 months ago

Hello admins, can you tell us if anything about private data has changed since day 1 of phase 2? This is important to know for reproducibility and experiment evaluation.

On day 1, were private nonegocentric images being resized incorrectly and then later fixed?
On day 1, were private web search indexes corrupted and then later fixed?
Since day 1, has 10-20% more chunks been added to private web search corpus?

The above 3 occurred on public data, so I’m wondering if they also affected private data. Additionally were there any other changes to private data and/or submission process that has changed since day 1 of phase 2?

Earned a BA in mathematics then worked as a graphic artist, photographer, carpenter, and teacher. Earned a PhD in computational science and mathematics with a thesis on optimizing parallel processing. Now work as a data scientist and researcher.

Notebooks

Create Notebook

Filters

Private

Notebooks

Create Notebook

Filters

Private