Product
Hive
Marketplace AppsGoogle Workspace Project Management
A distributed, fault-tolerant data warehouse system that enables analytics at a massive scale.
☆☆☆☆☆ 0.0 Based on 0 Reviews
Hive
Learn More
About the Hive
Apache Hive is an open-source data warehousing software project built on top of Apache Hadoop. It provides a SQL-like interface called HiveQL, allowing users to query, read, write, and manage petabytes of data residing in distributed storage systems like Hadoop Distributed File System (HDFS), Amazon S3, Azure Blob Storage, and Google Cloud Storage. Hive transforms HiveQL queries into batch processing jobs (MapReduce, Apache Tez, or Spark jobs) that run on Hadoop's distributed job scheduling framework (YARN). It's designed for offline batch processing and large-scale data analysis, making big data accessible to users familiar with SQL without needing to write complex low-level code.