In e‑commerce, enormous amounts of data are generated through the interaction of customers with Web platforms. Customer feedback in the form of product reviews, for instance, is an example for unstructured data, which processing requires approaches from the fields of computer linguistics and machine learning. As an alternative to the classical approaches of supervised and unsupervised learning, which often reach their limits in the business context and the application domain of product reviews, this article presents a hybrid approach for categorizing product reviews that combines the advantages of machine learning and human expertise. The aim of this paper is to present an approach that allows to automatically extract structured topics and related aspects from product reviews based on practical requirements. Word2Vec is used to train semantic relationships between words that occur in product reviews. In this way, individual words of each review can be compared with in advance defined topic words regarding their similarity and can then be extracted from the reviews. This approach is demonstrated using around five million product reviews of the Amazon online platform. The results are getting compared with those from a common topic modelling technique.
mehr| Titel | Hybrider Ansatz zur automatisierten Themen-Klassifizierung von Produktrezensionen |
|---|---|
| Medien | HMD Praxis der Wirtschaftsinformatik |
| Verlag | Springer Fachmedien Wiesbaden |
| Heft | 5 |
| Band | 56 |
| Verfasser | René Götz, Prof. Dr. Alexander Piazza, Freimut Bodendorf |
| Seiten | 932-946 |
| Veröffentlichungsdatum | 01.04.2019 |
| Zitation | Götz, René; Piazza, Alexander; Bodendorf, Freimut (2019): Hybrider Ansatz zur automatisierten Themen-Klassifizierung von Produktrezensionen. HMD Praxis der Wirtschaftsinformatik 56 (5), 932-946. DOI: 10.1365/s40702-019-00521-w |