In this talk I will present experiences of using a combination of Hadoop and Python to build pipelines that process large amount of textual hotel reviews in more than a dozen a languages. In particular I will show the application Word2vec (via Gensim) to extract information and cluster similar hotels based on the opinion of users.
Miguel Fernando Cabrera 00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.
Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: [ Ссылка ]
Ещё видео!