Hadoop and Spark – Big Data Technologies

Apache Hadoop Hadoop is an Open Source, Scalable, and Fault-Tolerant framework written in Java. Hadoop captures more than 90% of the big Data market. It efficiently processes large volumes of data on a cluster of commodity hardware. Provides an efficient framework for running jobs on multiple nodes of clusters. Cluster means a group of systems connected via LAN. The basic Hadoop programming language is Java, … Continue reading Hadoop and Spark – Big Data Technologies

All You Need To Know About Kubernetes!

Google open sourced Kubernetes project in 2014. It is a portable, extensible open source platform for managing containerized workloads and services.  It has a large, rapidly growing ecosystem. The name Kubernetes originates from Greek, meaning helmsman or pilot, and is the root of governor and cybernetic. K8s is an abbreviation derived by replacing the 8 “ubernete” with “8”. Of course, containerized workflows can be complicated. … Continue reading All You Need To Know About Kubernetes!

Virtual Machines, Containers, and Dockers

There are tons of articles on the Internet on the topics of Containers, VMs and Dockers. While reading these articles I found that each one has mentioned one or more features about the Technology, which were not mentioned in any of the others. So, I decided to write a post on these topics, to summarize the articles that I have read and the videos I … Continue reading Virtual Machines, Containers, and Dockers