Loading
Feedback
AI Blitz #9: 7 days left #educational #blitz Weight: 30.0
429
194
17
38

🌈 Welcome thread | πŸ‘₯ Looking for teammates? | πŸš€ Easy-2-Follow Code Notebooks

πŸ“ Don't forget to participate in the Community Contribution Prize!

AIcrowd Server

 

 Introduction

 

This is the most interesting puzzle in this Blitz! It utilizes your learnings from the previous puzzles. The challenge is simple, using a sound clip as an input can you accurately output only numbers mentioned in a text format? Feeling confused? Read on to know how to solve it.

Access an easy-to-use beginner-friendly code notebook over here.


πŸ’ͺ Getting Started

This puzzle will also use the DeepSpeech model for the purpose of sound prediction. The starter kit breaks down all the steps in a simple way. This notebook will guide you in installing, training, processing and predict the sound clips using Mozilla's DeepSpeech library.

Pretty simple, right? Check out the notebook and make your submission! 


πŸ’Ύ Dataset

In this dataset, you will need to predict what words were spoken in a voice. The sounds are in .wav the extension. The sample rate of the sound is 8000

 

SoundID label
0 Optimal design of
1 Poisson brackets with

 


πŸ“ Files

Following files are available in the resources section:

  • train.csv - (20000 samples) This CSV file contains a SoundID column as the file's name located in the corresponding folder name and a label column as the number spoked from the sound. 

  • train.zip - (20000 samples) This CSV file containing sound files as the name corresponding to the SoundID column of the corresponding CSV file name.  

  • val.csv - (2000 samples) This CSV file contains a SoundID column as the file's name located in the corresponding folder name and a label column as the number spoked from the sound. 

  • val.zip - (2000 samples) This CSV file containing sound files as the name corresponding to the SoundID column of the corresponding CSV file name.

  • test.csv - (5000 samples) This CSV file contains a SoundID column as the file's name located in the corresponding folder name and a label column as the number spoked from the sound. 

  • test.zip - (5000 samples) This CSV file containing sound files as the name corresponding to the SoundID column of the corresponding CSV file name.


πŸš€  Submission

  • Creating a submission directory
  • Use test.csv and fill the corresponding labels. 
  • Save the test.csv in the submission directory. The name of the above file should be submission.csv.
  • Inside a submission directory, put the .ipynb notebook from which you trained the model and made inference and save it as original_notebook.ipynb.

         Overall, this is what your submission directory should look like - 

submission
β”œβ”€β”€ submission.csv
└── original_notebook.ipynb
  • Zip the submission directory!

Make your first submission here πŸš€ !!


πŸ–Š Evaluation Criteria

During the evaluation, the Mean Word Error Rate across each text in ground truth and submission will be used to test the efficiency of the model. We are using wer function from jiwer python library to calculate word error rate. 

 


πŸ”— Links

πŸ“± Contact

Notebooks

See all
[Getting Started Code] Sound Prediction
By
Shubhamaicrowd
22 days ago
1