Performing Big Data over Cloud on a Test-Bed

Vishal Dubey; Saumya Gupta; Sapeksh Garg

Research Article

Performing Big Data over Cloud on a Test-Bed

by Vishal Dubey, Saumya Gupta, Sapeksh Garg

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 120 - Issue 10

Published: June 2015

Authors: Vishal Dubey, Saumya Gupta, Sapeksh Garg

10.5120/21267-3872

PDF

Vishal Dubey, Saumya Gupta, Sapeksh Garg . Performing Big Data over Cloud on a Test-Bed. International Journal of Computer Applications. 120, 10 (June 2015), 49-52. DOI=10.5120/21267-3872

                        @article{ 10.5120/21267-3872,
                        author  = { Vishal Dubey,Saumya Gupta,Sapeksh Garg },
                        title   = { Performing Big Data over Cloud on a Test-Bed },
                        journal = { International Journal of Computer Applications },
                        year    = { 2015 },
                        volume  = { 120 },
                        number  = { 10 },
                        pages   = { 49-52 },
                        doi     = { 10.5120/21267-3872 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }

                        %0 Journal Article
                        %D 2015
                        %A Vishal Dubey
                        %A Saumya Gupta
                        %A Sapeksh Garg
                        %T Performing Big Data over Cloud on a Test-Bed%T 
                        %J International Journal of Computer Applications
                        %V 120
                        %N 10
                        %P 49-52
                        %R 10.5120/21267-3872
                        %I Foundation of Computer Science (FCS), NY, USA

Abstract

Data analytics has been rapidly growing in a variety of application areas like mining business intellect for processing the huge amount of data. MapReduce programming paradigm adds itself well to these data-intensive analytics jobs, given its one of the well known ability to scale-out and force several machines to parallely process data. This paper introduces a detailed analysis of big data over cloud computing with several mapred techniques from system and application aspects. Here in this work we say that such Mapper and Reducer based analytics provide a better result over cloud platform. However, End-Users in this environment has the ability to use MapReduce applications to minimize the incurred cost, while obtaining the best performance. From the implementation point of view, we describe the key issues and challenges of big data on cloud and on local system as well. At last, the challenges come across in implementing MapReduce functions over Hadoop and the analysis of standalone, clustered and virtualized systems over our test-bed.

References

"What is Apache Hadoop," http://hortonworks. com /hadoop/,2011-2014
"Gartner IT Glossary," http://www. gartner. com/it-glossary/big-data/,2013.
"Fern Helper, Bringing big data to the enterprise ," http://www. ibm. com/software/ in/data/ bigdata/, January 2012
Cloudera. http://www. cloudera. com/.
J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Proc. of OSDI, 2004.
RedHat Enterprises virtualization workbook for student.
Big Data Processing in Cloud Computing Environments College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
Virtualization in Linux a Key Component for Cloud Computing
"White Paper: Ten Things You Need to Know About Virtualization. " www. datacore. com
B. Golden. Virtualization for Dummies, Wiley: Hoboken, New Jersey: 2008.
Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch published by IJCA 2011
Pipeline computing. From Wikipedia en. wikipedia. org/wiki/Pipeline_(computing

Index Terms

Computer Science

Information Sciences

No index terms available.

Keywords

Cloud computing Big data Hadoop HDFS Map Reduce virtualization