Utilizing the Open Movie Data Base for Predicting the Review Class of Movies
published: July 15, 2015, recorded: May 2015, views: 1662
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
In this paper, we present our contribution to the Linked Data Mining Challenge 2015. Our approach predicts the review class of movies using external data from the Open Movie Database API (OMDb-API). We select specific features, such as movie ratings and box office, that are very likely to describe the quality of a movie. With RapidMiner we utilize these features and apply three basic classification algorithms to train and validate the prediction model using a 10-fold crossvalidation. The results of our evaluation are interesting in a two-fold way: (i) few movie ratings from professional critics provide a higher accuracy (accuracy 0:94) than many ratings from users (accuracy 0:7),and (ii) the Decision Tree classifier (accuracy 0:83) outperforms Naive Bayes (accuracy 0:73), whereask -NN is not suitable at all (accuracy 0:53).
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !