Monthly Archive:: May 2018

Apache Spark : Python vs Scala

In Big Data Analysis,Apache Spark is one of the most popular framework .The Apache Spark is written in Scala .Apache  Spark has API’s for Scala, Python, Java and R , So we can work with any of

HIVE : A Warehousing Tool

Hive is basically a Data Warehouse Infrastructure Tool, which is used for processing structured data in Hadoop. Primarily used to summarize and manage Big Data, Hive helps make querying and analyzing easy. Hive data warehouse software facilitates querying and managing large datasets residing

Let’s Understand Data Lake, Data Warehouse and Database

“Data lakes, data warehouses, and databases “–All these are some terminologies used in Data Management. But what exactly their meaning is and are the same or differ from each other, let’s try to explore in this article.

What is Data Lake?

Data lakes are becoming increasingly important as people, especially in business and technology, want to perform broad data exploration and discovery. Bringing data together into a single place or most of it in a single place can be

How to Build Big Data Analytics Infrastructure

ref https://www.datasciencecentral.com/profiles/blogs/big-data-analytics-infrastructure   Big data can bring huge benefits to businesses of all sizes. However, as with any business project, proper preparation and planning is essential, especially when it comes to infrastructure. Until recently it was hard for companies

Everything about Kotlin

Kotlin is a general purpose, open source, statically typed “pragmatic” programming language for the JVM and Android that combines object-oriented and functional programming features. It is focused on interoperability, safety, clarity, and tooling support. Kotlin originated at JetBrains,

10 Essential Books for Deep Learning

Deep learning is a significant part of what makes up the broader subject of machine learning. Still relatively new, its popularity is constantly growing and so it makes sense that people would want to read and learn

Understanding Fast Data and its Importance in an IoT-driven world

Internet of Things and now Industrial Internet of Things , both are making great impact in the World so lots of people are analyzing the impact the Internet of Things and the Industrial Internet of Things on

11 Most Significant Tips for Learning Python Programming

Stack Overflow data indicates the increasing use of Python — possibly encouraged by its data science friendliness — has driven it to new levels of popularity, making it the “fastest-growing major programming language.” That conclusion comes from

Let’s Learn R – Leading Tool for Machine Learning, Statistics, and Data Analysis

R programming is about 70%  widely used tool from among all the Data analytics tools and languages because it is an open source free software easily extendable with lots of packages. Due to these reasons R Programming