Using Dell servers for a Hadoop & Spark Cluster


When creating a Hadoop & Spark Cluster, two types of server configurations (Management and Data nodes) are desired.

 Server Types Service Placement Examples Server Design Focus
Management Node Name node, Job Tracker, HBase Master, Zookeeper, Hive, Oozie, etc… Fast CPU, Memory and Network
Data Node HDFS, GPFS, Task Tracker, HBase Region Fast Dense Storage and Network

Management nodes – Dell PowerEdge R430

When using Dell server models, the Dell  Power Edge R430 server configured with at least two LFF 3.5″ drives can be used as a Management node. Click HERE to download summary information. For detailed information click HERE.

CPU and memory resources are adjusted based on the workload planned for the Hadoop Cluster.

Cluster Purpose, Management Node(s) CPU Memory (min)
IA – Internet Analytics E5-2640 v3 64GB (1866 MHz)
LZ / DL – Landing Zone / Data Lake E5-2660 v3 128GB (2133 MHz)
NS / CA – NoSQL / Complex Data Analytics E5-2680 v3 128GB (2133 MHz)

Data nodes – Dell Power Edge R730xd

The Dell Power Edge R730xd server with up to 16 LFF 3.5″ drives can be used as a Data node. Click HERE to download summary information. For detailed information click HERE.

Image result for dell r730xd 

Cluster Purpose, Data Nodes CPU Memory (min)
IA – Internet Analytics E5-2620 v3 64GB (1866 MHz)
LZ / DL – Landing Zone / Data Lake E5-2640 v3 64GB (1866 MHz)
NS / CA – NoSQL / Complex Data Analytics E5-2680 v3 128GB (2133 MHz)