Apache Drill Installation, Storage Plugins Configuration for HBASE, HIVE, CP and DFS for beginners.



Drill is a very useful query engine it provide the facility to use it for multi-purpose. Apache drill definition is “Apache Drill is a low latency distributed query engine for large-scale datasets, including structured and semi-structured/nested data. Drill is designed to scale to several thousands of nodes and query petabytes of data at interactive speeds that BI/Analytics environments require.”

Drill is also beneficial for short, interactive ad-hoc queries on large-scale data sets. Drill is capable of querying nested data in formats like JSON and Parquet and performing dynamic schema discovery. Drill does not require a centralized metadata repository in Hadoop.