Brightics Data Hub

Distributed Data Platform - Optimized Hadoop Ecosystem for Enterprise Business

Easy management for Hadoop environment that is optimized for your business needs

Brightics Data Hub provides the latest version of the open source Hadoop ecosystem to efficiently manage your data, compatibility for each service. You can manage all of it through the Web-based Hadoop Manager.

Major Services

Provides scalability and high stability based on easy and smart management capabilities.

  • Various components in Hadoop Eco Simplified installation and management
  • Commercial SW level High stability Completion of open source stability verification




Service Stack of Brightics Data Hub

* Click each module to view detailed information.

Brightics Data Hub
Brightics Data Hub Manager
Data Processing : Map Reduce, Hive, Hue, Livy, Solr
>Execution Engine : Tez, Spark
Data Operation : YARN
Coordinator : Zookeeper
Data Govemance : Atlas
Security : Ranger
NoSQL DB : HBase
Data Storage for Any Data Type : HDFS
Brightics Data Hub Manager

Brightics Data Hub Manager provides distribution and management of service applications, setting and version management for each service, recommendation of optimal setting values, integrated monitoring, clustering, etc., making it easier to build and manage the Brightics Data Hub Ecosystem.

Data Processing

It consists of Map Reduce to process large amounts of data quickly and securely distributed parallel computing, Hive to summarize / query / analyze data, Hue, Livy, Solr to manage and monitor data tasks, etc.

Execution Engine

It consists of Tez, an execution engine for data processing operations, and Spark for big data distributed processing.

Data Operation

YARN which provides management of Hadoop execution engine and resource, is included


Zookeeper, which provides distributed coordination services for various services in the Hadoop ecosystem, is included.

Data Governance

Atlas, which is for meta-based data standards and lineage, is included.


Ranger which is managed data security for Hadoop cluster, is included.


HBase, which is the most widely used NoSQL based on the Hadoop platform, is included.

Data Storage

HDFS is a distributed file system that distributes large amounts of data to multiple nodes and processes data at the same time on each node.

Analyst Reports

  • Samsung SDS was positioned as a Major Player in the 2021 APEJ IDC MarketScape for Vision AI Software Platform.

    MarketScape Asia/Pacific (Excluding Japan) Vision AI Software Platform 2021 Vendor Assessment, IDC

  • Magic Quadrant for Data Science and Machine Learning Platforms, 2021, Gartner

    Magic Quadrant for Data Science and Machine Learning Platforms, 2021, Gartner

    Samsung SDS is honored to be named in Gartner’s Magic Quadrant for Data Science and Machine Learning Platforms.

    in Magic Quadrant for Data Science and Machine Learning Platforms, March 2021, Gartner

    Learn more

  • Samsung SDS Brightics AI was selected as a leading vendor in the 'Multimodal Predictive Analytics and Machine Learning' category of Forrester Wave™

    in The Forrester Wave™ : Multimodal Predictive Analytics and Machine Learning, Q3 2020, Forrester



Use Cases


    • Stability for enterprise business

      It is packaged through verification and version management to utilize Hadoop distribution for business use and supports data standards and access management with Apache Atlas, Apache Ranger, etc.

    • Easy and smart management

      It is easy to install and manage optimum configurations based on the web UX. It provides version management for configuration changes. It can be managed smarter by notifying the administrator immediately when an issue occurs.

    • Provide flexible scalability

      Provides easy storage and utilization of structured and unstructured data and a flexible interface environment for customizing for your business.


    Recommended specifications
    • Data Hub Platform

      x86 server with 3 or more nodes
      -CPU: 16 cores/Node
      -Memory: 64GB/Node
      -Disk: 4TB/Node
      -Operating system: CentOS, RHEL 7.x

    • PC for platform users

      - Browser : Chrome (Version 50.0 or higher)
      - Screen resolution: 1280 x 900 (recommended)


    Let’s talk

    Whether you’re looking for a specific business solution or just need some questions answered, we’re here to help

    Brightics AI is an AI-powered analytics platform designed to provide an integrated environment for fast and accurate business analytics. It covers the entire lifecycle of data from data collection and analysis to utilization by using different modules including data preparation, machine learning, and deep learning. It allows everyone to easily perform structured and unstructured data analysis, build analytics models and work with others seamlessly using intuitive, easy-to-use modeling tools and functions without specialized data science knowledge or scripting. Brightics AI has more than 100 references across various industries, including manufacturing, marketing, logistics, security, and healthcare. Brightics AI represents one of Samsung SDS’s five key technology areas — AI, blockchain, cloud, data analytics and security (ABCDS) and can be complemented with other proprietary offerings, such as Samsung Cloud and Brightics IoT. Take advantage of the Brightics AI trial service, provided free of charge for 60 days, in a cloud environment here today.

    Disclaimer : Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings or other designation. Gartner research publications consist of the opinions of Gartner’s research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose.