Cloud Hadoop

Hadoop Clusters for Easy and Fast Big Data Processing/Analytics

Samsung Cloud Platform offers Hadoop clusters for big data processing and analytics. Cloud Hadoop supports small-scale computing resources to enable clustering and parallel processing of large-capacity data. In addition, the Hadoop ecosystem and management environments with validated compatibility based on Apache open source are also provided for convenient use.

Overview

01

04

Service Architecture

Data Ingestion
  • Real-time Data Collection - Kafka
  • Structured/ Unstructured Data Collection - Sqoop, Flume
Data Process/Analytics
  • Data Processing - Map Reduce, Hive, Hue, Livy, Solr
  • Execution Engine - Tez, Spark
  • Data Operation - YARN
  • Coordinator - Zookeeper
  • Data Governance - Altas
  • Security - Ranger
  • NoSQL DB - HBase
  • Data Storage for Any Data Type - HDFS
  • Data Ingestion → Data Process/Analytics
* Data Ingestion to be applied in H2’22

Key Features

  • Automated Hadoop clusters

    - Offer Hadoop ecosystem with validated mutual compatibility and grant users server (VM) access
    - Initial installations : Monitoring servers, Zookeeper, HDFS, YARN, and HBase

  • Support for a number of open source software

    - HDFS, Zookeeper, YARN, Spark, Hive, TEZ, Atlas, Ranger, Livy, Hue, Kerberos, HBase, and Solr

  • User convenience features

    - Installation/Management by Hadoop ecosystem
    - Optimal configuration value and version management
    - Dashboard for integrated monitoring on system resources
    - Alerts on service failure

Pricing

    • Billing
    • Hourly rate for VM type of cluster node (VM + Hadoop application cost)
Let’s talk

Whether you’re looking for a specific business solution or just need some questions answered, we’re here to help