Online News Prediction


🛠 Contribute : Found a typo? Or any other change in the description that you would like to see ? Please consider sending us a pull request in the public repo of the challenge here.

đŸ•ĩī¸ Introduction

What does it take to go viral on social media ? We give you the features of news articles. Predict the number of 'share' it gets on social platforms.

Understand with code! Here is getting started code for you.😄

💾 Dataset

The dataset contains 61 columns.
The goal is to predict shares. Thus remaining 58 columns are used for prediction. Some of the columns are as follows - + n_tokens_title - Number of words in the title + num_imgs - Number of images present in the blog + average_token_length - Average length of the words in the content

More info is contained in dataset_info.txt


Following files can be found in resources section

  • train.csv - (26715 samples) This csv file contains the attributes describing the blog written along with share count .
  • test.csv - (13082 samples)File that will be used for actual evaluation for the leaderboard score.

🚀 Submission

  • Prepare a CSV containing header as share and predicted value as share count respectively denoting the number of shares it will recieve.
  • Name of the above file should be submission.csv.
  • Sample submission format available at sample_submission.csv in the resorces section.

Make your first submission here 🚀 !!

🖊 Evaluation Criteria

During evaluation MAE or Mean Absolute Error will be used for accuracy,

For secondary score we use RMSE or Root Mean Squared Error

🔗 Links

📱 Contact

Pulkit Gera

📚 Refrences

  • K. Fernandes, P. Vinagre and P. Cortez. A Proactive Intelligent Decision Support System for Predicting the Popularity of Online News. Proceedings of the 17th EPIA 2015 - Portuguese Conference on Artificial Intelligence, September, Coimbra, Portugal.


Getting Started


01 adithyasunil26 2421.197
02 see 2518.016
03 akshay_goindani 3060.063
04 darthgera123 3076.179
05 Wandra3105 3077.7

Latest Submissions

see graded
see graded
see graded
see graded
see graded