Console Games

25 April 2022| Category: Data Engineering & Analytics| Project hosted at: GitHub
work-single-image

Overview

A console game is a type of interactive multimedia software that uses a video game console to provide an interactive multimedia experience via a television of other display device. The game console generally consists of a handheld control device (although some use cameras to monitor user movements) and a computer that runs the game's software. The global console games market is about $26.8b in 2018 with key vendors in the market like Microsoft, Sony, and Nintendo.

Our group decided to create a database that serves as the backend for any application or services, which can leverage it to provide product recommendations (videogames) based on key attributes such as rating, genre, console compatibility, and so on. Our goal is to set up a working database that can also be used for data-science tasks such as predicting success of a new game based on analyzing similar games’ information stored in our database.

This is an academic project. It is a project that showcases data engineering, database skills, as well as data science skills.

Project Key Steps

Database Design

Data Generation and Loading

Data Collection The data was collected from two sources. Games Data The games data primarily comes from Online Game System Repository (https://www.gamesdatabase.org/). The following screenshot shows one such page:

Since the webpage has a defined structure and predictable formatting, we used python to web-scrap this data. We took only those categories that has more than 1000 games.

Users Data For Users data, we used Kaggle dataset (https://www.kaggle.com/datasets/nathanlauga/nba-games). This dataset contains games and players information of NBA.

Data Cleaning

We used python for scraping the data from the web. Also, used data science libraries such as pandas and numpy to fornat and clean the data.



See project code at GitHub for more details.



Creating Database schema

The ER diagram of the created database is shown below:



Data Visualization

Following are some of the examples of data visualization using above database.

Game Counts by Genre


Game Counts by Console Type


Games Location


Machine Learning Models

Created a machine learning model in Azure ML studio.



Please see GitHub page for more details.

SO WHAT YOU THINK ?

I am available for short term projects as well as long term contracts.

Contact With Me