Archivo de la etiqueta: AWS

Getting latest EMR release label


Usually latest release label gets updated on EMR’s Whats New page. So a way to getting the last EMR release label would be:   curl -s https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-whatsnew.html |grep “(Latest)”|head -n1|awk ‘{ print $3 }’   Have fun!     Anuncios

Publicado en Uncategorized | Etiquetado , , | Deja un comentario

AWS S3 API – Throughput Notes


Some notes on settings to maximize throughput and increase parallelism while using S3 API: aws configure set default.s3.max_concurrent_requests 20 aws configure set default.s3.max_queue_size 10000 aws configure set default.s3.multipart_threshold 64MB aws configure set default.s3.multipart_chunksize 16MB aws configure set default.s3.max_bandwidth 50MB/s aws … Seguir leyendo

Publicado en Uncategorized | Etiquetado , | Deja un comentario

AWS EMR – Big Data in Strata New York


Will you be in New York next week (Sept 25th – Sept 28th)?                    Come meet the AWS Big Data team at Strata Data Conference, where we’ll be happy to answer your questions, hear about your requirements, and help you … Seguir leyendo

Publicado en Uncategorized | Etiquetado , , , , , | Deja un comentario