Topic Modeling: Hogwarts Legacy Reviews

This Git repository contains the code for a topic modeling analysis of reviews for the game "Hogwarts Legacy" in R. The project is divided into several steps:

1. Load data

The data is loaded from the CSV file "hogwarts_legacy_reviews.csv" from Kaggle. Irrelevant columns are removed, and the review text is selected for further analysis. Here you can find the dataset: https://www.kaggle.com/datasets/georgescutelnicu/hogwarts-legacy-reviews

2. Preprocessing

The review text undergoes several cleaning steps, including the removal of emojis, numbers, punctuation, and stop words. The remaining words are tokenized and grouped by document ID.

3. Model Building

A Latent Dirichlet Allocation (LDA) model is constructed to identify topics within the reviews. The number of topics is initially set to 20 and later optimised using coherence scores and other metrics.

4. Model Optimisation

Two metrics, CaoJuan2009 and Deveaud2014, are used to optimise the number of topics in the LDA model. The final model is chosen with 8 topics based on these metrics.

5. Visualisation

Various visualisations are provided, including the distribution of words across topics, the distribution of topics across documents (theta), and the word-topic probability matrix (phi). Additional visualisations like word clouds and bar plots can be explored. [..still working on this part :) ]

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Plots		Plots
Game_Reviews_Folien_deutsch.pdf		Game_Reviews_Folien_deutsch.pdf
Game_reviews_LDA.md		Game_reviews_LDA.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Topic Modeling: Hogwarts Legacy Reviews

1. Load data

2. Preprocessing

3. Model Building

4. Model Optimisation

5. Visualisation

About

Uh oh!

Releases

Packages

License

3lle4/Topic_Modeling_LDA

Folders and files

Latest commit

History

Repository files navigation

Topic Modeling: Hogwarts Legacy Reviews

1. Load data

2. Preprocessing

3. Model Building

4. Model Optimisation

5. Visualisation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages