AIcrowd | jiazunchen | Participants

4 Follower

0 Following

jiazunchen

Activity

Mar

Apr

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

Mon

Wed

Fri

Challenge Categories

Challenges Entered

Completed

Meta CRAG - MM Challenge 2025

Meta

Improve RAG with Real-World Benchmarks | KDD Cup 2025

Latest submissions

See All

graded	289797	Tue, 17 Jun 2025 18:19:34
graded	289788	Tue, 17 Jun 2025 18:13:23
graded	289778	Tue, 17 Jun 2025 17:50:24

Completed

Meta Comprehensive RAG Benchmark: KDD Cup 2024

Meta

Improve RAG with Real-World Benchmarks

Latest submissions

See All

graded	267130	Wed, 26 Jun 2024 00:53:06
graded	267129	Wed, 26 Jun 2024 00:53:06
graded	267099	Wed, 26 Jun 2024 00:50:56

Completed

ESCI Challenge for Improving Product Search

Amazon Search

Amazon KDD Cup 2022

Latest submissions

No submissions made in this challenge.

Completed

Meta KDD Cup 24 - CRAG - Retrieval Summarization

Meta

Testing RAG Systems with Limited Web Pages

Latest submissions

See All

graded	266952	Tue, 25 Jun 2024 02:32:13
graded	266951	Tue, 25 Jun 2024 02:32:13
graded	266273	Thu, 20 Jun 2024 07:52:19

Completed

Meta KDD Cup 24 - CRAG - End-to-End Retrieval-Augmented Generation

Meta

Enhance RAG systems With Multiple Web Sources & Mock API

Latest submissions

See All

graded	267130	Wed, 26 Jun 2024 00:53:06
graded	267129	Wed, 26 Jun 2024 00:53:06
failed	266263	Thu, 20 Jun 2024 07:32:52

Completed

Single-source Augmentation

Meta

Generating answers using image-linked data

Latest submissions

See All

graded	289797	Tue, 17 Jun 2025 18:19:34
graded	289693	Tue, 17 Jun 2025 16:00:41
graded	289626	Tue, 17 Jun 2025 14:38:43

Participant	Rating
chenghao_shaun	0
shizueyy	0
dako	0
graphway	0

Participant	Rating

db3 Meta Comprehensive RAG Benchmark: KDD Cup 2024
View
db3 Meta CRAG - MM Challenge 2025
View

Meta CRAG - MM Challenge 2025

Why did 289384, 289471 faild?

9 months ago

Will you consider resubmitting 289697 ? @yilun_jin8 @jyotish

Why did 289384, 289471 faild?

9 months ago

It has already reached 100%. And 289,697 shows “Step has exceeded its deadline”

ConnectionError: (MaxRetryError(‘HTTPSConnectionPool(host=‘huggingface.co’, port=443): Max retries exceeded with url: /api/datasets/crag-mm-2025/crag-mm-single-turn-debug-private/revision/b5ff0aaa05fab0256d77682b4b7da582c0660a6b (Caused by NameResolutionError(“<urllib3.connection.HTTPSConnection object at 0x7f7f00af3e50>: Failed to resolve ‘huggingface.co’ ([Errno -3] Temporary failure in name resolution)”))’), ‘(Request ID: 7c73f288-c699-438b-9794-be08cad15999)’) Check the submission page for more details.

Why did Submission 288711 failed?

9 months ago

Why did Submission 288711 failed? thank you.

Important Update on Missing/Refusal Rate

9 months ago

What specifically is the high missing rate?

Suggestion: Make Evaluation Prompts More Flexible

9 months ago

Moreover, I believe the evaluation prompt should be made public. If someone wants to ‘hack’ the prompt, they don’t actually need to know its exact content—keeping it secret only widens the gap between local testing and server-side evaluation results.

Suggestion: Make Evaluation Prompts More Flexible

9 months ago

I think the current evaluation prompt is too strict, causing everyone to respond with ‘I don’t know’ frequently just to ensure a score > 0. In reality, many answers could be considered partially correct—at least, human evaluators would take this into account. However, under the current setup, the top 10 models don’t attempt to provide partially correct strategies, which might actually perform worse in human evaluation compared to strategies scoring below 0. Yet, these strategies never even reach human review. I suggest the organizers relax the evaluation prompt to at least allow for some score differentiation.

Why failed Submission #285113

10 months ago

Evaluation failed with exit code 1. I hope I can take a look at the error message

📢 Dataset Release: CRAG-MM v0.1.1 🚀

11 months ago

In current CragImageKG file （…/cragmm_search/image_search_mock_api/image_kg.py, cragmm-search-pipeline==0.2.10）, the field in get_image_url function should be img_url, otherwise it will cause an error.

📢 Dataset Release: CRAG-MM v0.1.1 🚀

11 months ago

The current rag-agent does not differentiate between task1 and task2. How should UnifiedSearchPipeline be used specifically for task1?

Meta Comprehensive RAG Benchmark: KDD Cup 2-9d1937

Submission failed : You have exceeded the allowed number of parallel submissions. Please wait until your other submission(s) are graded.

No other submissions but failed.

Meta KDD Cup 24 - CRAG - Retrieval Summarization

About Test Set Leakage in Round 1

Almost 2 years ago

In fact, the test set for round1 is the data set given to us, so there is no leakage problem

jiazunchen has not provided any information yet.

Notebooks

Create Notebook

Filters

Private

Notebooks

Create Notebook

Filters

Private