Image

Hadoop With NOSQL:-

Apache hadoop is the most popular open source tool developed by google for big data processing and most demanding in the market.

NoSQL simply means "Not Only SQL"that means NoSQL is bigger than the SQL. It supports both SQL and Non-SQL. Like Hadoop, NoSQL is also developed for the distributed and parallel computing. The difference is Hadoop is not a database system but is a software ecosystem that allows for massively parallel computing. But, NoSQL is created especially as a database framework. The NoSQL consists of mostly unstructured data. But the framework also supports SQL which is a relational database model. Since, most of the data in NoSQL is unstructured, NoSQL is very efficient by nature. The unstructured data can be stored in the multiple processing nodes, and often across multiple servers.

NoSql databases like HBASE lot of importance. HBASE is widely used over hadoop these days. 

HADOOP WITH AWS:-

What is AWS?:-

Amazon Web Services (AWS) is a Amazon.com subsidiary which offers cloud-computing services at very affordable rates, therefore making its customer base strong from small scale companies like Pinterest (which has just 5 employees) to big enterprises like D-Link.

Apache Hadoop:-

Hadoop also includes a distributed storage system, the Hadoop Distributed File System (HDFS), which stores data across local disks of your cluster in large blocks. HDFS is automatically installed with Hadoop on your Amazon EMR cluster, and you can use HDFS along with AmazonS3 to store your input and output data.

Hadoop allows clustering commodity hardware together to analyze massive data sets in parallel. There are many applications and execution engines in the Hadoop ecosystem, providing a variety of tools to match the needs of your analytics workloads. Amazon makes it easy to create and manage fully configured, elastic clusters of Amazon EC2 instances running Hadoop and other applications in the Hadoop ecosystem.