command: sudo /usr/bin/pip3.7 install mrjob[aws]
mrjob lets you write MapReduce jobs in Python 2.7/3.4+ and run them on several platforms. You can:
Write multi-step MapReduce jobs in pure Python
Test on your local machine
Run on a Hadoop cluster
Run in the cloud using Amazon Elastic MapReduce (EMR)
Run in the cloud using Google Cloud Dataproc (Dataproc)
Easily run Spark jobs on EMR or your own Hadoop cluster
Ещё видео!