TikCheck

TikCheck combines rule-based checks and fast deep neural networks to deliver more trustworthy and relevant Google reviews.

📌 Project Overview

This project explores patterns in Google Local Reviews using Natural Language Processing (NLP) techniques. Our analysis focuses on uncovering insights that can inform platform strategies across five key stakeholder groups:

Users (General Audience): Ease of content creation, engagement, community building, accessibility, and data privacy.

Creators: Monetization opportunities, recognition, and growth support.

Businesses & Advertisers: Product promotion, e-commerce integration, brand safety, and advertising analytics.

Platform & Technology: AI-driven personalization, new content formats, scalability, and performance.

Regulators & Society: Compliance, transparency, and social responsibility.

By leveraging the dataset of millions of user-generated reviews (Google Local Reviews dataset), we aim to identify factors that enhance user engagement, content creation, business promotion, and trust on digital platforms.

⚙️ Setup Instructions

Clone the Repository git clone cd
Create a virtual environment
Install Dependencies: pip install -r requirements.txt

All dataset can be found here in the repo, and are clearly specified in the notebooks.

👥 Team Member Contributions

[Coco] – Data preprocessing, model development, README documentation.

[Xavier] – Model training, evaluation, and results analysis.

[Ungchan] – Model training, evaluation, and results analysis.

[Kai Jun] – Model training, evaluation, and results analysis.

[Jairus] – Model training, evaluation, and results analysis.

📊 Key Themes Explored

Content creation & engagement → Making platforms fun, interactive, and inclusive.

Business promotion → Enabling brands to effectively reach audiences.

E-commerce integration → Analyzing opportunities for in-app purchasing.

Privacy & accessibility → Ensuring trust, safety, and inclusivity.

AI & regulation → Balancing innovation with compliance and responsibility.

↩️ General Flow of Pipeline

Data Collection (Both Reviews Dataset & User-Metadata Dataset) -> Data Cleaning -> EDA -> Feature Engineering -> Dataset Split -> Modelling -> Final Evaluation

For the user_metadata dataset, we would be use the user's past reviews that we have collected, aggregate them to form user-level features, and merge them with the main Reviews Dataset.

⚠️ Disclaimer

All the relevant documentations are clearly done in the 11 notebooks.

Please note that since we have crawled data using APIs, used GPT to synthesize data, it would not be possible to be able to replicate the data collection process (step1_data_collection.ipynb).

The dataset that we have collected (initial datasets) can be found under final_data/initial_dataset.csv and final_data/user_metadata.csv.

As such, please start running the notebook from step2_data_cleaning.ipynb all the way to step8_reporting.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
DNN		DNN
archived		archived
final_data		final_data
raw_data		raw_data
test_results		test_results
val_results		val_results
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
step1_data_collection.ipynb		step1_data_collection.ipynb
step2_data_cleaning.ipynb		step2_data_cleaning.ipynb
step3_eda.ipynb		step3_eda.ipynb
step4a_feature_engineering_for_reviews_business.ipynb		step4a_feature_engineering_for_reviews_business.ipynb
step4b_feature_engineering_for_users.ipynb		step4b_feature_engineering_for_users.ipynb
step5_dataset_splitting.ipynb		step5_dataset_splitting.ipynb
step6a_rule_based.ipynb		step6a_rule_based.ipynb
step6b_dnn.ipynb		step6b_dnn.ipynb
step6c_hybrid_model.ipynb		step6c_hybrid_model.ipynb
step7_final_evaluation.ipynb		step7_final_evaluation.ipynb
step8_reporting.ipynb		step8_reporting.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TikCheck

📌 Project Overview

⚙️ Setup Instructions

👥 Team Member Contributions

📊 Key Themes Explored

↩️ General Flow of Pipeline

⚠️ Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TikCheck

📌 Project Overview

⚙️ Setup Instructions

👥 Team Member Contributions

📊 Key Themes Explored

↩️ General Flow of Pipeline

⚠️ Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages