SC18 Demo – Hadoop
Modern Hadoop cluster workloads are scaling into the Petabyte range, stressing infrastructure from both compute resource and storage I/O perspectives. ScaleFlux Computational Storage is the first turnkey, easy-to-deploy solution that accelerates GZIP compression and Erasure Coding processing (Hadoop 3), the main bottlenecks for big data ingest. In addition, ScaleFlux opens storage I/O by adding low-latency PCIe Flash as temporary storage space for analytical processing jobs assigned to HDDs. The end result? Optimized time-to-insight from massive data workloads. This demo utilizes standard Hadoop Teragen and Terasort benchmarks to show how ScaleFlux Computational Storage can cost-effectively deliver 3x faster data ingest times while running 60% more jobs on the same infrastructure footprint.