Huawei Technologies

Enable Huawei to implement different functionalities and integration support with presto and hive in CarbonData

Headquarters

USA

Industry

Telecommunications

Technologies Used

Scala, Java, Apache-Spark, Spark-Streaming, Presto, Hive, Hadoop, AWS S3

Apache CarbonData is an indexed columnar data format for fast analytics on big data platform, e.g. Apache Hadoop, Apache Spark, etc. Knoldus enable Huawei to work in collaboration with them to implement different functionalities or integration support with different technologies including presto and hive in CarbonData.

Challenges

Huawei wants to explore a domain where backend, frontend, and continuous integration ensure backward compatibility of the older versions when the newer versions will be rolled out frequently.

Solutions

Knoldus worked with Huawei to develop a file-format which is faster and efficient in processing and querying on big data. Now, Huawei clients able to speed up their system by utilizing the features of CarbonData.

Results

With the rapid development and concise code offered by Scala, Knoldus could get the system into production in 4 months.

Challenges

Huawei wants to explore a domain where backend, frontend, and continuous integration ensure backward compatibility of the older versions when the newer versions will be rolled out frequently. Knoldus worked along with Huawei Team to help CarbonData in becoming an Apache-licensed project from an incubating project.

Solution

Knoldus worked closely with the Huawei team and helped in building the crucial functionalities, some of which are listed below:

Development of Dictionary Generation Tool for CarbonData.
Improved cost efficiencies: Automated cluster management reduced their operational costs by more than 50%.
CarbonData integration with Presto, Hive, Flink, and S3 technologies.
Setting up of continuous Integration via Jenkins.
Creation of Performance Testing tool to do benchmarking
Achieving zero bugs with Automation Testing.
Development of Apache CarbonData website and its maintenance.
Development and enhancement in core packages of CarbonData.
Benchmarking CarbonData against available file formats like Parquet and ORC, against frameworks like Spark, Presto, and Impala and against different storage systems like Hadoop, S3 and Kudu.
Knoldus worked with Huawei to develop a file-format which is faster and efficient in processing and querying on big data. Now, Huawei clients able to speed up their system by utilizing the features of CarbonData.
Our team also developed a proprietary performance benchmarking tool for CarbonData. This benchmark tool tests the performance of the CarbonData in comparison with its competitors like Parquet and ORC Format. The key functionality supported by the Benchmark tool are as follows:
Generating the TPCH benchmarking data depending on the cluster size driven by configuration.
Defining workloads as a configuration for particular datasets.
Loading the data into all the formats into the Hive Store like CarbonData, Parquet, and ORC.
Configuration based Tuning for Spark that included parallelism settings as well as spark configuration based on different workloads.
Executing the workloads and capturing the response time and results with respect to load for all the formats.
Comparison of the results in all the formats.
Generating an Excel report showing the comparison of the results as well as success and failures of test execution.

Results

With the rapid development and concise code offered by Scala, Knoldus could get the system into production in 4 months. The alerts are routed to different buckets based on rules defined and reach the consumers’ mailbox in a matter of seconds as soon as the news is broken. The product is being heavily used as a part of the infrastructure.

Explore latest Case Studies

Osttra is equipped with complete knowledge of new applications with… …
OSTTRA Read More »
Knoldus helped Amway management with robust, highly available, secure, and… …
Amway Read More »
The performance and user experience benefits have translated into amazing… …
Verizon Read More »

Services

High performance systems

Cloud Engineering

Data Engineering, Strategy and Analytics

Intelligence Driven Decisioning - AI/ML

Architecture Strategy, Audit & Academy

Accelerators

Platforms

KDP

KDSP

Products

Premon

Studio9

Tech Hub

Industries

Travel

Media and Publishing

Healthcare

Retail

Consumer Internet

Finance

Hi-tech & IoT

Insights

Case Studies

OS contributions

Knolx

Blogs

Books

Community

Resources

Webinars

Huawei Technologies

Enable Huawei to implement different functionalities and integration support with presto and hive in CarbonData

Headquarters

Industry

Technologies Used

Challenges

Solutions

Results

Challenges

Solution

Results

Explore latest Case Studies

Ready to gain a competitive advantage with Future Ready Emerging Technologies?

OUR OFFERINGS

COMPANY

LEARN

CONNECT

Follow us Here:

Awards & Recognitions

Partners

© 2023 Knoldus, Inc. All Rights Reserved.

Privacy Policy | Sitemap