AIcrowd | sigma_g | Participants

1 Follower

0 Following

sigma_g

Activity

May

Jun

Jul

Aug

Sep

Oct

Nov

Dec

Jan

Feb

Mar

Apr

May

Mon

Wed

Fri

Challenge Categories

Challenges Entered

Completed

AI Blitz #6

AIcrowd

5 Problems 21 Days. Can you solve it all?

Latest submissions

See All

graded	124481	Wed, 3 Mar 2021 13:26:50
graded	122093	Wed, 17 Feb 2021 18:37:24
graded	122080	Wed, 17 Feb 2021 16:57:26

Completed

AI Blitz 5 ⚡

AIcrowd

5 Puzzles, 3 Weeks | Can you solve them all?

Latest submissions

No submissions made in this challenge.

Practice

Latest submissions

See All

graded	67522	Thu, 21 May 2020 15:56:01
graded	67521	Thu, 21 May 2020 15:51:03
failed	67519	Thu, 21 May 2020 15:09:19

Completed

AIcrowd Blitz - May 2020

AIcrowd

5 Problems 15 Days. Can you solve it all?

Latest submissions

See All

graded	66745	Fri, 15 May 2020 13:54:25
graded	65264	Mon, 11 May 2020 03:56:00

Completed

ICCV 2019: Learning-to-Drive Challenge

Computer Vision Lab - ETH Zurich

Immitation Learning for Autonomous Driving

Latest submissions

No submissions made in this challenge.

Completed

AI for Good - AI Blitz #3

AIcrowd

AI for Good - ITU

5 PROBLEMS 3 WEEKS. CAN YOU SOLVE THEM ALL?

Latest submissions

See All

graded	80476	Fri, 4 Sep 2020 08:36:57
graded	80473	Fri, 4 Sep 2020 08:34:10
graded	80472	Fri, 4 Sep 2020 08:31:40

Completed

Latest submissions

No submissions made in this challenge.

Participant	Rating
bhuvanesh_sridharan	0

Participant	Rating

BayesianMechanics AIcrowd Blitz - May 2020
View
BayesianMechanics AI for Good - AI Blitz #3
View
BayesianMechanics AI Blitz #6
View

AI Blitz #6

About the new datasets for WinPrediction

About 5 years ago

Hi, I think it does not matter whether or not these game positions were from real human players, grandmasters, or even from the TCEC. Given a board position and which side’s turn it is, there is a clear unique evaluation that Stockfish 12+ will give, which is the evaluation assuming best play from both sides.

Now, in such positions, when giving the win prediction, we have to assume best play from both the side. We cannot assume human play because it’s irregular. A human play can be from a 1200 ELO player or a 2100 ELO player, and we have no way to account for that. Even a 2100 ELO player can have a bad day and play with a drop of 100 points in performance rating.

Now that we have established that there is one unique answer, we come back to the above pictured position - and similarly in another position on this post - to state that we have contradictory information in the dataset (against what we get from Stockfish evaluating the position). And this is not rare. For the first 100 training samples we observed 20 of them with opposite win predictions. Even if we assume our OCR is wrong on half of them, that’s still a 10% error rate in the training dataset.

Moreover, another issue is that not all positions are few moves before checkmate, as the problem statement says on the main page. Several positions are already mated, where there’s no sense of giving whose turn it is. On the other hand, several positions are far from mated, as you can see in the linked post, the evaluation is a meagre approx +3. However, any position near checkmate will ceratinly have a \pm Mx evaluation from stockfish, which means mate in x moves by either white or black.

Let me know if any part is unclear, I will re-explain. But I hope - if the dataset is revised once again - these issues are taken care of, because as it stands, it is almost impossible to submit a better score if we follow standard Chess evaluation metrics.

sigma_g has not provided any information yet.

Notebooks

Create Notebook

Filters

Private

Notebooks

Create Notebook

Filters

Private

Organization

Location

Badges

Connect

Activity

Challenge Categories

Challenges Entered

AI Blitz #6

Latest submissions

AI Blitz 5 ⚡

Latest submissions

MNIST

Latest submissions

AIcrowd Blitz - May 2020

Latest submissions

ICCV 2019: Learning-to-Drive Challenge

Latest submissions

AI for Good - AI Blitz #3

Latest submissions

ML Battleground

Latest submissions

AI Blitz #6

About the new datasets for WinPrediction

Notebooks

Notebooks