Apache Drill allows to query non-relational datastores. It supports HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files. The aggregation and query mechanism even allows to join across multiple database backends. The query plan optimizer is datastore-aware and utilizes individual NoSQL filter and query mechanisms depending on each backend scheme.
Homepage
Download
Recent Releases
1.0.021 May 2015 00:25
major feature:
Create an abstraction for repeated value vectors.
Ensure DrillScanRel differentiates skip-all, scan-all
scan-some in a backward compatible fashion.
ScanBatch throws a NullPointerException instead of returning
OUT_OF_MEMORY.
AbstractRecordBatch.buildSchema() should properly handle
OUT_OF_MEMORY outcome.
Add support for context UDFs: user (and its synonyms
session_user, system_user) and current_schema.
RootExec implementations should properly handle
IterOutcome.OUT_OF_MEMORY.
properly handle OutOfMemoryException.
AllocationHelper.allocateNew() doesn't have a consistent
behavior when it can't allocate.