• About
  • Contact
  • Cookie
  • Disclaimer
  • Privacy Policy
  • Change the purpose of use

Tnhelearning.edu.vn - Various useful general information portal

  • Photo
  • Bio
  • How To
  • Tech

davidjegan/AWS-EMR-Node-Calculator: AWS-EMR-Node-Calculator

March 30, 2024 by admin Category: Top

You are viewing the article davidjegan/AWS-EMR-Node-Calculator: AWS-EMR-Node-Calculator  at Tnhelearning.edu.vn you can quickly access the necessary information in the table of contents of the article below.

Electromagnetic Wave Calculations
Electromagnetic Wave Calculations

AWS Elastic Map Reduce(EMR) Node Calculator – a Serverless way

Table of Contents

  • Context
  • Cluster Node Calculation Formulae
  • Pre-Requisite
  • Services and components
  • Process Flow
  • Set-up
  • PS

Context

In order to ensure parallelism, perfect number of nodes should be chosen in EMR Clusters. This involves a complex look up and referencing. Using this tool, that arduous process is simplified. This tool, returns the exact nodes required for your application to run seemlessly.

Cluster Node Calculation Formulae

  1. Read the default Mapred-site.xml
  2. Get mapreduce.map.memory.mb and yarn.scheduler.maximum-allocation-mb values
  3. Number of mappers = maximum allocation memory/mapreduce.map.memory

i.e., Total Mappers Required = Total Size of Input / Input Split Size

Numerator = Total Mappers * Time to process Sample files Denominator = Instance Mapper Capacity * Desired Processing Time

Estimated number of nodes = Numerator / Denominator

Pre-Requisite

  1. Get a test Work Load
  2. Number of Sample files should match the number of mappers
  3. RUN an EMR cluster with single core and process the sample file.
  4. The time taken to process is the Processing time
  Industrial Compliance & Safety

Services and components

  1. DynamoDB : NoSQL database offering of AWS
  2. Lambda : A compute solution which can run without deploying servers
  3. API Gateway: An Apification service of AWS to invoke the Lambda method
  4. Front-end components: HTML, CSS, JS, Jquery and AJAX

Process Flow

  1. Get the details of all instances in AWS Compute and store it in a DB
  2. Create a Lambda function that refers this DB and returns the contents
  3. Create an API endpoint to invoke this lambda method
  4. Embed this API in the Front-end code
  5. Parse the response and render the contents of the webpage dynamically
  6. (Optional) Lambda function can be created to listen to AWS SNS notification of service change, to update the DynamoDB contents on the fly
READ More:   Lawsuit targeting disputed ballot design as unconstitutional can proceed, judge says

Set-up

  • DynamoDB => Contains the data of instances
    • Load the following contents into the DynamoDB using the following script
  • Lambda => To retrieve DB contents
    • Create a lambda function in the AWS console
  • API Gateway
    • Go to the API Gateway
    • provide a name
    • description
    • endpoint type.
    • Create a GETmethod
    • Choose Lambda Functionas theIntegration type
    • Turn on the Use Lambda Proxy Integration
    • Provide the region and lambda name created in the previous step
    • Click OKwhen the popup asks you to provide access to Lambda function.
    • Reference Image:
    • Click on ActionsandDeploy API
    • Provide a stage name and description
    • Deploythe API
    • Note the Invoke URL, this will be used in the next step.
  • Front-End updation
    • Embed this endpoint in the code at js file
    • Run the html file. Provide the inputs and find the number of nodes at ease!
  What Is An EMR? Everything You Need To Know

PS

  • The number of mappers depends on the number of Hadoop splits
  • If your files are smaller than HDFS or Amazon S3 split size, the number of mappers is equal to the number of files
  • If some or all of your files are larger than HDFS or Amazon S3 split size (fs.s3.block.size) the number of mappers is equal to the sum of each file divided by the HDFS/Amazon S3 block size.

Thank you for reading this post davidjegan/AWS-EMR-Node-Calculator: AWS-EMR-Node-Calculator at Tnhelearning.edu.vn You can comment, see more related articles below and hope to help you with interesting information.

Related Search:

READ More:   3 Ways to Measure a Cord of Wood

Related Posts

Bernd Leno: Hành Trình Sự Nghiệp Của Thủ Môn Tài Năng
Bernd Leno: Hành Trình Sự Nghiệp Của Thủ Môn Tài Năng
Dominic Calvert-Lewin: Ngôi sao sáng giá của bóng đá Anh
Dominic Calvert-Lewin: Ngôi sao sáng giá của bóng đá Anh
Nguyễn Thị Oanh – Ngôi sao sáng của điền kinh Việt Nam
Nguyễn Thị Oanh – Ngôi sao sáng của điền kinh Việt Nam

Category: Top

Previous Post: « 9333 Genesee Ave, San Diego, CA
Next Post: Everything You Need to Know About Leaving Reviews on Airbnb »

Copyright © 2025 · Tnhelearning.edu.vn - Useful Knowledge