mrjob v0.7.4 documentation

  • ← Dataproc runner options
  • Elastic MapReduce Quickstart →
  • Home
  • Guides

Elastic MapReduceΒΆ

  • Elastic MapReduce Quickstart
    • Configuring AWS credentials
    • Configuring SSH credentials
    • Running an EMR Job
    • Choosing Type and Number of EC2 Instances
  • Cluster Pooling
  • EMR runner options
    • Amazon credentials
    • Instance configuration
    • Cluster software configuration
    • Monitoring your job
    • Cluster pooling
    • S3 Filesystem
    • Docker
    • API Endpoints
    • Other rarely used options
  • EMR Bootstrapping Cookbook
    • When to use bootstrap, and when to use setup
    • Installing Python packages with pip
    • Installing PyPy
    • Installing System Packages
  • Troubleshooting
    • Using persistent clusters
  • Advanced EMR usage
    • Spot Instances
    • Custom Python packages
    • Bootstrap-time configuration
    • Manually Reusing Clusters

Need help?

Join the mailing list by visiting the Google group page or sending an email to mrjob+subscribe@googlegroups.com.

This Page

  • Show Source
  • ← Dataproc runner options
  • Elastic MapReduce Quickstart →
  • Home
  • Guides
© 2009-2018 Yelp and Contributors. Created using Sphinx 1.3.1 with the better theme.