Machine Learning, Text Analytics, Sentiment and Topic Analysis: Italian Restaurants TripAdvisor Case
Case
-
Reference no.
524-0013-1
Subject category:
Marketing
Published by:
SDA Bocconi
Length: 25 pages
Data source: Field research
Share a link:
https://casecent.re/p/196787
Write a review
|
No reviews for this item
This product has not been used yet
Abstract
This case investigates the Italian restaurant sector from a reputation management perspective based on online reviews received by the businesses. To achieve this, the project involves a complex process, from building the dataset through web scraping from the TripAdvisor platform to the implementation of various data analysis techniques on the collected data. The study encompasses a sample of restaurants located in Italy's regional capitals, with the aim of providing nationwide coverage. The ambition of the analyses performed is to harness structured data related to the descriptive information of the establishments available on the platform, as well as unstructured data, namely, the title and text of the reviews received by each business. To tap into the information in textual data, the central step of the study focused on applying major Natural Language Processing (NLP) methodologies for text mining, such as Sentiment Analysis and Topic Analysis. The overall goal guiding the entire project at every stage of its development is to understand the most important factors in determining a restaurant's overall rating on the platform.
Teaching and learning
This item is suitable for executive education courses.Time period
The events covered by this case took place in 2009-2022.Geographical setting
Region:
Europe
Country:
Italy
Featured company
Tripadvisor Inc
Employees:
1001-5000
Turnover:
USD 1.20 Billion
Industry:
Travel services
About
Abstract
This case investigates the Italian restaurant sector from a reputation management perspective based on online reviews received by the businesses. To achieve this, the project involves a complex process, from building the dataset through web scraping from the TripAdvisor platform to the implementation of various data analysis techniques on the collected data. The study encompasses a sample of restaurants located in Italy's regional capitals, with the aim of providing nationwide coverage. The ambition of the analyses performed is to harness structured data related to the descriptive information of the establishments available on the platform, as well as unstructured data, namely, the title and text of the reviews received by each business. To tap into the information in textual data, the central step of the study focused on applying major Natural Language Processing (NLP) methodologies for text mining, such as Sentiment Analysis and Topic Analysis. The overall goal guiding the entire project at every stage of its development is to understand the most important factors in determining a restaurant's overall rating on the platform.
Teaching and learning
This item is suitable for executive education courses.Settings
Time period
The events covered by this case took place in 2009-2022.Geographical setting
Region:
Europe
Country:
Italy
Featured company
Tripadvisor Inc
Employees:
1001-5000
Turnover:
USD 1.20 Billion
Industry:
Travel services