Wiley India Pvt Ltd

Publication Year 2014

ISBN 9788126550517

ISBN-10 8126550511


Number of Pages 240 Pages
Language (English)

Computer programming

Big data has become an industry in itself and Hadoop is the vehicle that extracts information from the growing massive data sets today's companies are holding. Big-data processing is a cross section of technical disciplines, including distributed systems (processing, storage and networking), systems deployment and management, data science and software development. Hadoop For Dummies educates readers about the new value of big data and how Hadoop can exploit that data to create real valueIntroducing Hadoop and Seeing What It's Good ForCommon Use Cases for Big Data in Hadoop Setting Up Your Hadoop Environment Storing Data in Hadoop - The Hadoop Distributed File System Reading and Writing Data MapReduce Programming Frameworks for Processing Data in Hadoop - YARN and MapReduce Pig - Hadoop Programming Made Easier Statistical Analysis in Hadoop Developing and Scheduling Application Workflows with Oozie Hadoop and the Data Warehouse - Friends or Foes? Extremely Big Tables - Storing Data in HBase Applying Structure to Hadoop Data with Hive Integrating Hadoop with Relational Databases Using Sqoop The Holy Grail - Native SQL Access to Hadoop Data Deploying Hadoop Administering Your Hadoop Cluster Ten Hadoop Resources Worthy of a Bookmark Ten Reasons to Adopt Hadoop