Research Article

Performing Big Data over Cloud on a Test-Bed

by  Vishal Dubey, Saumya Gupta, Sapeksh Garg
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 120 - Issue 10
Published: June 2015
Authors: Vishal Dubey, Saumya Gupta, Sapeksh Garg
10.5120/21267-3872
PDF

Vishal Dubey, Saumya Gupta, Sapeksh Garg . Performing Big Data over Cloud on a Test-Bed. International Journal of Computer Applications. 120, 10 (June 2015), 49-52. DOI=10.5120/21267-3872

                        @article{ 10.5120/21267-3872,
                        author  = { Vishal Dubey,Saumya Gupta,Sapeksh Garg },
                        title   = { Performing Big Data over Cloud on a Test-Bed },
                        journal = { International Journal of Computer Applications },
                        year    = { 2015 },
                        volume  = { 120 },
                        number  = { 10 },
                        pages   = { 49-52 },
                        doi     = { 10.5120/21267-3872 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2015
                        %A Vishal Dubey
                        %A Saumya Gupta
                        %A Sapeksh Garg
                        %T Performing Big Data over Cloud on a Test-Bed%T 
                        %J International Journal of Computer Applications
                        %V 120
                        %N 10
                        %P 49-52
                        %R 10.5120/21267-3872
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Data analytics has been rapidly growing in a variety of application areas like mining business intellect for processing the huge amount of data. MapReduce programming paradigm adds itself well to these data-intensive analytics jobs, given its one of the well known ability to scale-out and force several machines to parallely process data. This paper introduces a detailed analysis of big data over cloud computing with several mapred techniques from system and application aspects. Here in this work we say that such Mapper and Reducer based analytics provide a better result over cloud platform. However, End-Users in this environment has the ability to use MapReduce applications to minimize the incurred cost, while obtaining the best performance. From the implementation point of view, we describe the key issues and challenges of big data on cloud and on local system as well. At last, the challenges come across in implementing MapReduce functions over Hadoop and the analysis of standalone, clustered and virtualized systems over our test-bed.

References
  • "What is Apache Hadoop," http://hortonworks. com /hadoop/,2011-2014
  • "Gartner IT Glossary," http://www. gartner. com/it-glossary/big-data/,2013.
  • "Fern Helper, Bringing big data to the enterprise ," http://www. ibm. com/software/ in/data/ bigdata/, January 2012
  • Cloudera. http://www. cloudera. com/.
  • J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Proc. of OSDI, 2004.
  • RedHat Enterprises virtualization workbook for student.
  • Big Data Processing in Cloud Computing Environments College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China
  • Virtualization in Linux a Key Component for Cloud Computing
  • "White Paper: Ten Things You Need to Know About Virtualization. " www. datacore. com
  • B. Golden. Virtualization for Dummies, Wiley: Hoboken, New Jersey: 2008.
  • Implementation of MapReduce Algorithm and Nutch Distributed File System in Nutch published by IJCA 2011
  • Pipeline computing. From Wikipedia en. wikipedia. org/wiki/Pipeline_(computing
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Cloud computing Big data Hadoop HDFS Map Reduce virtualization

Powered by PhDFocusTM