Impala

Cloudera created Impala in 2014 and donated to Apache in 2016. In late 2017, Impala became a top-level Apache project. It provides a SQL query layer on top of data in HBase or files stored on HDFS.

Logo

../_images/apache_impala-small.png

Website

https://impala.apache.org/

Repository

https://github.com/apache/impala

Byline

Apache Impala is the open source, native analytic database for Apache Hadoop. Impala provides low latency and high concurrency for BI/analytic queries on Hadoop (not delivered by batch frameworks such as Apache Hive).

License

Apache 2.0

Project age

11 years 6 months

Backers

Apache (Governed by), Cloudera (Creator)

Size score (1 to 10, higher is better)

7.25

Trend score (1 to 10, higher is better)

4.5

Education Resources

URL

Resource Type

Description

https://impala.apache.org/docs/build/impala-3.4.pdf

Documentation

This is a link to Impala documentation in PDF.

Git Commit Statistics

Statistics computed using Git data through November 30, 2022.

Statistic

Lifetime

Last 12 Months

Commits

46,569

3,553

Lines committed

158,181,514

868,217

Unique committers

257

43

Core committers

5

13

../_images/apache_impala-monthly-commits.png

Similar Projects

Project

Size Score

Trend Score

Byline

ClickHouse

9.75

8.75

ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.

CouchDB

6.25

5.25

CouchDB is an open source NoSQL database.

Hive

9.25

4.75

The Apache Hive data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive.