Tag - HADOOP Cluster

Case Studies

Read more...

Resolved – ” Incompatible clusterIds in… ” in Multi Node Hadoop Cluster Setup

Currently, there are many startups / small companies and their customers, working on Data Analytics, ML, AI and related solutions. Due to their budget constraints, some of them don't want to leverage Cloud-based storage.  Alternatively, to process ingested data, they create basic Data Lake using HDFS. During this process, they might encounter the exception of "org.apache.hadoop.hdfs.server.common.Storage: java.io.IOException:  Incompatible  clusterIDs in /home/....". while starting the Name Node or Master Node in a  multi-node Hadoop Cluster. This may occur in the following scenarios: ...

Read more...

Data Ingestion phase for migrating enterprise data into Hadoop Data Lake

The Big Data solutions helps to achieve valuable information to iron out the accurate strategic business decision. Exponential growth of digitalization, social media, telecommunication etc. are fueling enormous data generation everywhere. Prior to process of huge volume of data, we should have efficient data storage mechanism in a distributed manner to hold any form of data starting from structured to unstructured. Hadoop distributed file systems (HDFS) can be leveraged efficiently as data lake by installing on multi node cluster....

Read more...

Research Papers & Publications

1. Effective image analysis on twitter streaming using Hadoop Eco System on Amazon Web Service EC2 (Download) We have published a research paper on Hadoop and Ecosystem using real-time case study, in “International Journal of Advanced Research in Computer Science and Software Engineering” ISSN:2277 128X Title: Effective Image Analysis on Twitter Streaming using Hadoop Eco System on Amazon Web Service EC2 Paper ID: V5I9-0359 URL: Effective image analysis on twitter streaming using Hadoop Eco System on Amazon Web Service EC2 Abstract: Twitter is becoming the...

Read more...