admin's picture

Big Data and Data Science Notes

Here I will include notes on Big Data and Data Science concepts in general. There will be separate books on specific technologies like Hadoop.  

admin's picture

Apache Zookeeper Notes

Apache ZooKeeper is a software project of the Apache Software Foundation, providing an open source distributed configuration service, synchronization service, and naming registry for large distributed systems.

admin's picture

RDBMS and SQL Notes

A relational database management system (RDBMS) is a database management system (DBMS) that is based on the relational model. Structured Query Language (SQL) is a special-purpose programming language designed for managing data held in a RDBMS.

admin's picture

Apache Kafka Notes

Apache Kafka is an open source publish-subscribe based distributed messaging system. From the architecture perspective, Kafka is closer to traditional messaging systems such as ActiveMQ or RabitMQ. However from a Big Data and Hadoop perspective, Kafka can be compared with Scribe or Flume as it is useful for processing activity stream data.

admin's picture

Why do we need Big Data technologies?

Before learning about big data, let us quickly see the motivations for Big Data and similar technologies.

We saw the three V's of Big data already and not surprisingly, the motivations for Big Data and similar technologies are also based on these three V's: Variety, Velocity and Volume.

Pages

About US

We are few software engineers who are working on, but  still trying to learn more on Data technologies like Big Data, Hadoop etc. The blog section is actually our learning notes.

Contact

Please first use the contact form for any queries. You may also use the contact numbers below for urgent queries.
Tel 1: +91-9916711099
Tel 2: +91-8884636274