BigData - Blog

Channel: BigData - Blog

Image may be NSFW.
Clik here to view.

Introduction To BigData

June 2, 2014, 11:17 pm

Big data is defined as data that is too big, fast & hard for existing tools to process. Here, “too big” means that now a days organizations have to deal with petabyte scale collections of data...

View Article

Image may be NSFW.
Clik here to view.

How Google Transformed Big Data To A Life Saver Technology?

June 2, 2014, 11:25 pm

In 2009 a new virus was discovered ,combining elements of bird flu & Seasonal flu the virus strain dubbed H1N1 and spread quickly similar to Spanish flu in 1918 that infected half billion and...

View Article

Image may be NSFW.
Clik here to view.

Applications Of Big Data

June 9, 2014, 12:29 am

Medical Records The extreme cost of healthcare in the U.S. can be reduced with the adoption of electronic patient medical records. Many companies are searching way out to explore through large...

View Article

Characteristics Of Big Data

June 13, 2014, 12:18 pm

Volume – The quantity of data that is generated is very important it is the size of the data which determines the value and potential of the data under consideration and whether it can actually be...

View Article

Big Data Analysis

June 24, 2014, 11:19 am

Big data analytics refers to the process of collecting, organizing and analyzing large sets of data ("big data") to discover patterns and other useful information. Big data analytics help in...

View Article

Relational Database Management System (RDBMS)

July 5, 2014, 12:47 am

Traditional RDBMS (relational database management system) have been the conventional standard for database management throughout the age of the internet. This is also known as Traditional row-column...

View Article

NoSQL

July 5, 2014, 12:49 am

NoSQL(also known as "Not Only SQL") represents a completely different framework of databases that allows high-performance, agile processing of information at massive scale i.e. it is a database...

View Article

Hadoop

July 5, 2014, 12:53 am

Apache Hadoop is an open source framework for writing and running distributed application that process large amounts of data. Their are some key distinction of Hadoop which give it an edge over...

View Article

Hadoop Distributed File System

July 22, 2014, 5:38 am

The Hadoop Distributed File System (HDFS) is the primary storage system used by Hadoop. HDFS provides high-performance access to data across Hadoop clusters. HDFS has become a key tool for managing...

View Article

Hadoop Cluster

July 22, 2014, 5:44 am

A Hadoop cluster is a special type of computational cluster designed specifically for storing and analyzing huge amounts of unstructured data in a distributed computing environment. These clusters run...

View Article

Map Reduce

August 1, 2014, 6:57 am

MapReduce is a software framework that allows developers to write programs to process massive amounts of unstructured data in parallel across a distributed cluster of processors or stand-alone...

View Article

Image may be NSFW.
Clik here to view.

Simple MapReduce Approach For Word Count

August 4, 2014, 7:01 am

The word count operation takes place in two stages a mapper phase and a reducer phase. In mapper phase first the sentence is tokenized into words then we form a key value pair with these words where...

View Article

How Big Data Helped UPS to save millions ?

August 4, 2014, 7:08 am

United Parcel Service Inc.(UPS) the world’s biggest package shipping company is using Big Data from customers, drivers and vehicles in a new route guidance system that will save time and money and...

View Article

Challenges in Big Data

August 12, 2014, 5:08 am

The challenges in Big Data are the implementation hurdles which require immediate attention. If these challenges are not handled they may lead to technology failure and also some unpleasant results....

View Article

New Technological Advancements In Big Data

August 14, 2014, 9:21 am

There are two new technological advancements in Big Data as mentioned below: Spark by Apache Quantum Computing

View Article

Spark

August 18, 2014, 3:20 am

Spark is new technology that is on the top of Hadoop Distributed File System (HDFS) that is characterized as “a fast and general engine for large-scale data processing.” Spark have few key features...

View Article

Quantum Computing

August 26, 2014, 6:46 am

Quantum computing may be the future of most high-end data centers. This is because as the demand to intelligently process a growing volume of online data grows so the limits of silicon chip...

View Article

IBM starting up with Big Data in India

September 22, 2014, 4:19 am

In the American crime drama series Person of Interest, a machine predicts whether a person can be a victim or a perpetrator of a crime. Then it's up to a data scientist to find that person and prevent...

View Article

Architecture Of Apache Hadoop

December 12, 2014, 7:10 am

Apache Hadoop has two pillar1.YARN - Yet Another Resource Negotiator (assigns CPU, memory, and storage to applications running on a hadoop cluster. The first generation of Hadoop could only run...

View Article

Scheduling In Apache Hadoop

December 19, 2014, 9:44 am

Apache Hadoop by default uses FIFO scheduling (That I will explain you In my coming post) and 5 scheduling priorities to schedule jobs from job queue(I think we should sometime arrange operating system...

View Article

More Pages to Explore .....

Latest Images