Introduction
Drill is a very useful query engine it provide the facility to use it for multi-purpose. Apache drill definition is “Apache Drill is a low latency distributed query engine for large-scale datasets, including structured and semi-structured/nested data. Drill is designed to scale to several thousands of nodes and query petabytes of data at interactive speeds that BI/Analytics environments require.”
Drill is also beneficial for short, interactive ad-hoc queries on large-scale data sets. Drill is capable of querying nested data in formats like JSON and Parquet and performing dynamic schema discovery. Drill does not require a centralized metadata repository in Hadoop.
(more…)