Apache Hive is a data warehouse infrastructure that facilitates querying and managing large data sets which reside in a distributed storage system. Hive is built on top of Hadoop and developed by Facebook. Hadoop (Read in detail) includes the Hadoop Distributed File System (HDFS) and MapReduce. It is not possible for storing a large amount of data on… Read More »
Cloud computing refers to any situation in which computing is done in a remote location (out in the clouds) rather than your portable device or desktop wherein the computing power is tapped over an internet connection. At basic level cloud computing is simply a means of delivering IT resources as services.
Hadoop is an open source framework by Apache Software Foundation and known for writing and running distributed applications that process large amounts of data. It is well suited for voluminous data processing like searching and indexing in the huge data set. It is simple, scalable, robust, accessible.