dc.description.abstract | Online customer reviews play a crucial role in e-commerce, but the emergence of fake
reviews negatively impacts the e-commerce ecosystem. Developing effective methods for fake
reviews detection (FRD) has become an important research domain. Recent studies using
generative AI to create fake reviews focus on embellishing existing reviews. This research aims to
devise efficacious FRD methods and evaluate the effects of different aspects of fake reviews
generated by ChatGPT on detection performance. We use the YelpZip dataset and apply multiple
feature extraction methods, resampling techniques, and combining various machine learning (ML)
and deep learning (DL) approaches, such as Random Forest, eXtreme Gradient Boosting
(XGBoost), Logistic Regression (LR), Bidirectional Encoder Representations from Transformers
(BERT), and Robustly Optimized BERT Approach (RoBERTa), to find the most effective method.
Different types of fake reviews were generated using ChatGPT, including simply embellished fake
reviews, those containing different restaurant information, and reviews with various dining
experiences. We deeply analyze how various review aspects impact FRD performance. The results
show that among ML methods, the combination of BERT embeddings, Random Oversampling
(ROS), and LR achieved the best performance (AUC: 0.715). In DL methods, RoBERTa
performed the best (AUC: 0.770). The detection results suggest that ChatGPT-rephrased fake
reviews are easier to identify, and the inclusion of restaurant information and dining experiences
in the reviews impacts the performance of FRD. This study leverages ChatGPT to rephrase and
create fake reviews with varying aspects, providing new perspectives and valuable insights in fake
reviews detection. | en_US |