AIcrowd | CAPTCHA | Challenges

Blitz : Completed #educational Weight: 10.0

AIcrowd

9272

400

🛠 Contribute: Found a typo? Or any other change in the description that you would like to see? Please consider sending us a pull request in the public repo of the challenge here.

🕵️ Introduction

You want to sign up for a new exciting web service and you’re almost done but the pesky CAPTCHA form has popped up and is now delaying you. If you have been annoyed by the CAPTCHA we feel you! This challenge is to get even -- create your own ML model that will identify values correctly and you may never have to solve a CAPTCHA form again.

For this puzzle, you will be given a dataset of CAPTCHA images created specifically for this challenge. The training set consists of two columns [filename] and [label] . Your task is to predict the captchas with the proper case and save them in a csv with labels filename and label for the test set.

Understand with code! Here is getting started code for you.😄

💾 Dataset

The CAPCHA dataset is a in-house created dataset designed specifically for this challenge. The train set and test sets consist of images of various types of captcha. In the train set you are also given a train.csv which has two columns filename and label. The captchas are a mix of capital letters, lowercase letters and numbers; representative of a real life captcha.

You are to predict the captchas with the proper case and save them in a csv with labels filename and label for the test set.

📁 Files

Following files are available in the resources section:

train_info.csv - (10,000 samples) This csv file contains the labels for each of the images in the train.zip .
train.tar.gz - (10,000 samples) This zip file contatins all the train images on which you can train your model on.
test_info.csv - (5,000 samples) This csv file contains the filename for each of the images in the test.zip for which labels needs to be predicted.
test.tar.gz - (5,000 samples) The tar file contains all the test images, on which the actual evaluation will take place.

🚀 Submission

Prepare a CSV containing filename and label and predicted the string that is in the captcha with proper case.
Name of the above file should be submission.csv.
Sample submission format available at sample_submission.csv in the resorces section.

Make your first submission here 🚀 !!

🖊 Evaluation Criteria

During evaluation mean over normalised Levenshtein Similarity Score will be used to test the efficiency of the model.

The score of only 60% of the test data will be revealed during the competition.

🕵️ Introduction

💾 Dataset

📁 Files

🚀 Submission

🖊 Evaluation Criteria

🔗 Links

📱 Contact

Getting Started