Skip to the content.

Hotel Ratings Classification: Project Overview

Demo :

GIF

Code and Resources Used

Python Version: 3.7
Packages: pandas, numpy, sklearn, matplotlib, seaborn, nltk, spacy, gensim, wordcloud, flask, pickle
For Web Framework Requirements: pip install -r requirements.txt

Business Problem/Obejctive:

A sample dataset which consists of 20,000 reviews and ratings for different hotels and goal is to examine how travellers are communicating their positive and negative experiences in online platforms for staying in a specific hotel and major objective is what are the attributes that travellers are considering while selecting a hotel. With this manager can understand which elements of their hotel influence more in forming a positive review or improves hotel brand image.

Project Architecture / Flow

Project workflow

Dataset Details:

dataset details

Exploratory Data Analysis (EDA):

This is the most crucial part of any Data Science project. I adopted following EDA steps for this NLP project:

1. Text pre-processing:

Following text pre-processing steps were included:

2. N-gram Analysis:

3. Word Clouds

4. Sentimental Analysis:

5. Named Entity Recognition (NER):

6. Topic Modeling:

Model Building:

model details

Model Evaluation

model evaluation

Challenges Faced: