Impala¶
Cloudera created Impala in 2014 and donated to Apache in 2016. In late 2017, Impala became a top-level Apache project. It provides a SQL query layer on top of data in HBase or files stored on HDFS.
Logo |
![]() |
---|---|
Website |
|
Repository |
|
Byline |
Apache Impala is the open source, native analytic database for Apache Hadoop. Impala provides low latency and high concurrency for BI/analytic queries on Hadoop (not delivered by batch frameworks such as Apache Hive). |
License |
Apache 2.0 |
Project age |
10 years 10 months |
Backers |
Apache (Governed by), Cloudera (Creator) |
Size score (1 to 10, higher is better) |
7.25 |
Trend score (1 to 10, higher is better) |
3.0 |
Education Resources¶
URL |
Resource Type |
Description |
---|---|---|
Documentation |
This is a link to Impala documentation in PDF. |
Git Commit Statistics¶
Statistics computed using Git data through March 31, 2022.
Statistic |
Lifetime |
Last 12 Months |
---|---|---|
Commits |
38,936 |
4,441 |
Lines committed |
154,371,062 |
1,673,939 |
Unique committers |
246 |
47 |
Core committers |
5 |
14 |

Similar Projects¶
Project |
Size Score |
Trend Score |
Byline |
---|---|---|---|
9.75 |
8.75 |
ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time. |
|
6.0 |
4.0 |
CouchDB is an open source NoSQL database. |
|
8.75 |
4.0 |
The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive. |