Real time streaming data storage and processing using storm and analytics with Hive

D. Surekha; G. Swamy; Venkatramaphanikumar Sistla

doi:10.1109/icaccct.2016.7831712

ScienceGate Book Chapters

JOURNAL ARTICLE

Real time streaming data storage and processing using storm and analytics with Hive

D. Surekha G. Swamy Venkatramaphanikumar Sistla

Year: 2016 Pages: 606-610

DOI: 10.1109/icaccct.2016.7831712

Get Full-Text PDF Get Analytical Report

Abstract

In big data world, Hadoop Distributed File System (HDFS) is one of the famous file system to store huge data. HDFS will take care about managing and maintaining the data in distributed way. Based on research we did to discuss that how the real time streaming data can be processed and stored into Mongo DB and Hive. Big data analytics can be performed on data stored on Hadoop distributed file system using Apache Hive, Tez and Apache Presto. Hive is an ecosystem which is on top of Hadoop (MapReduce), and provides higher-level language to use Hadoop's core component MapReduce to process the data. The key benefits of this approach are it can able to store and process the large amount of data. It can also handle the millions of user requests concurrently. It can provide the scalability for the system is enhanced by adding new nodes. Integrating the Visualization tools with Big Data applications will give the big picture to the users to view the insights of the Big data. It can provide the analytic reports for giving the big picture about the system.

Keywords:

Big data Computer science Scalability Distributed File System File system Database Analytics Process (computing) Operating system Distributed database Stream processing

Metrics

Cited By

3.98

FWCI (Field Weighted Citation Impact)

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Cloud Computing and Resource Management

Physical Sciences → Computer Science → Information Systems

Advanced Data Storage Technologies

Physical Sciences → Computer Science → Computer Networks and Communications

IoT and Edge/Fog Computing

Physical Sciences → Computer Science → Computer Networks and Communications

Real time streaming data storage and processing using storm and analytics with Hive

Abstract

Metrics

Citation History

Topics

Related Documents

Real-Time Data Processing With Storm: Using Twitter Streaming

Real-Time Data Processing With Storm: Using Twitter Streaming

Real-Time Data Streaming and Processing using Synapse Analytics

Real-Time Data Streaming and Processing using Synapse Analytics

REAL TIME DATA PROCESSING USING STORM