Format:
1 online resource (x, 248 pages)
Edition:
1st ed
ISBN:
9781782161097
Series Statement:
Quick answers to common problems
Content:
Cover -- Copyright -- Credits -- About the Authors -- About the Reviewer -- www.PacktPub.com -- Table of Contents -- Preface -- Chapter 1: Developing Hive -- Introduction -- Deploying Hive on a Hadoop cluster -- Deploying Hive Metastore -- Installing Hive -- Configuring HCatalog -- Understanding different components of Hive -- Compiling Hive from source -- Hive packages -- Debugging Hive -- Running Hive -- Changing configurations at runtime -- Chapter 2: Services in Hive -- Introducing HiveServer2 -- Understanding HiveServer2 properties -- Configuring HiveServer2 high availability -- Using HiveServer2 Clients -- Introducing the Hive metastore service -- Configuring high availability of metastore service -- Introducing Hue -- Chapter 3: Understanding the Hive Data Model -- Introduction -- Using numeric data types -- Using string data types -- Using Date/Time data types -- Using miscellaneous data types -- Using complex data types -- Using operators -- Partitioning -- Partitioning a managed table -- Partitioning an external table -- Bucketing -- Chapter 4: Hive Data Definition Language -- Introduction -- Creating a database schema -- Dropping a database schema -- Altering a database schema -- Using a database schema -- Showing database schemas -- Describing a database schema -- Creating tables -- Dropping tables -- Truncating tables -- Renaming tables -- Altering table properties -- Creating views -- Dropping views -- Altering the view properties -- Altering the view as select -- Showing tables -- Showing partitions -- Show the table properties -- Showing create table -- HCatalog -- WebHCat -- Chapter 5: Hive Data Manipulation Language -- Introduction -- Loading files into tables -- Inserting data into Hive tables from queries -- Inserting data into dynamic partitions -- Writing data into files from queries -- Enabling transactions in Hive
Content:
Inserting values into tables from SQL -- Updating data -- Deleting data -- Chapter 6: Hive Extensibility Features -- Introduction -- Serialization and deserialization formats and data types -- Exploring views -- Exploring indexes -- Hive partitioning -- Creating buckets in Hive -- Analytics functions in Hive -- Windowing in Hive -- File formats -- Chapter 7: Joins and Join Optimization -- Understanding the joins concept -- Using a left/right/full outer join -- Using a left semi join -- Using a cross join -- Using a map-side join -- Using a bucket map join -- Using a bucket sort merge map join -- Using a skew join -- Chapter 8: Statistics in Hive -- Bringing statistics in to Hive -- Table and partition statistics in Hive -- Column statistics in Hive -- Top K statistics in Hive -- Chapter 9: Functions in Hive -- Using built-in functions -- Using the built-in User-defined Aggregation Function (UDAF) -- Using the built-in User Defined Table Function (UDTF) -- Creating custom User-Defined Functions (UDF) -- Chapter 10: Hive Tuning -- Enabling predicate pushdown optimizations in Hive -- Optimizations to reduce the number of map -- Sampling -- Chapter 11: Hive Security -- Securing Hadoop -- Authorizing Hive -- Configuring the SQL standards-based authorization -- Authenticating Hive -- Chapter 12: Hive Integration with Other Frameworks -- Working with Apache Spark -- Working with Accumulo -- Working with HBase -- Working with Google Drill -- Index
Additional Edition:
9781782161080
Additional Edition:
Erscheint auch als Druck-Ausgabe 978-1-78216-108-0
Additional Edition:
Print version Bansal, Hanish Apache Hive Cookbook Birmingham : Packt Publishing,c2016
Language:
English
URL:
Volltext
(lizenzpflichtig)
Bookmarklink