Big Data Resources

January in IoT: Building and Succeeding With IoT

Check out this month's compilation of IoT news and tutorials, including advice for getting started with IoT, building with Arduinos and RPis, and monetizing IoT data.

January 22, 2018

by Mike Gates

· 7,858 Views · 2 Likes

Messaging is a critical technology to execute the data pipelines that are a crucial part of application design. Learn about the different types of messaging.

January 19, 2018

by Jon Bock

· 86,288 Views · 8 Likes

Introduction to Azure Data Lake

In the first part of a series on the Azure Data Lake, get an understanding of the concept behind data lakes and learn how they work.

January 16, 2018

by Joydeep Das

· 22,855 Views · 3 Likes

Data Profiling With Oracle Data Mining

Data profiling is an important step to take before processing data. Learn how to do it with Oracle Data Mining, which is easily implemented in Oracle SQL Developer.

January 11, 2018

by Emrah Mete

· 9,695 Views · 5 Likes

3 Best IoT Frameworks for Beginners

With hundreds of available platforms to start your IoT project with, which one do you choose? We shed some light on the subject and look at three of the best choices.

January 10, 2018

by Raseel Bhagat

· 26,710 Views · 11 Likes

10 Best Frameworks and Libraries for AI

Look at some high-quality libraries that are used for artificial intelligence, their pros and cons, and some of their features.

January 10, 2018

by Anton Shaleynikov

· 182,323 Views · 30 Likes

Building an IoT Notification System

Here's a great way to build your own IoT notification system that sends alerts to multiple devices using an ESP8266, PushingBox, and relatively few lines of code.

Updated January 7, 2018

by Francesco Azzola

· 28,483 Views · 9 Likes

How Does Spark Use MapReduce?

Apache Spark does use MapReduce — but only the idea of it, not the exact implementation. Confused? Let's talk about an example.

January 4, 2018

by Anubhav Tarar

· 38,983 Views · 3 Likes

Ingesting IoT Sensor Data Into S3 With an RPI3

StreamSets Data Collector Edge is a lightweight agent used to create end-to-end data flow pipelines. We'll use it help stream data collected from a sensor.

December 30, 2017

by Rathnadevi Manivannan

· 10,444 Views · 4 Likes

2018 Big Data Predictions (Part 2)

Big data continues to get bigger, and is increasingly analyzed in the cloud or on the edge. Explore this and more intriguing information in this research article.

December 26, 2017

by Tom Smith

CORE

· 11,084 Views · 2 Likes

Installing the ELK Stack on AWS: A Step-by-Step Guide

Want to bring in the ELK stack for your AWS logging and monitoring needs? This guide will get you set up with the open source solution.

December 22, 2017

by Asaf Yigal

· 43,390 Views · 8 Likes

Elasticsearch for Dummies

Get to know the basics of Elasticsearch, its advantages, how to install it, and how to index documents using Elasticsearch.

December 20, 2017

by Alex Mailajalam

· 9,949 Views · 1 Like

DDD: Part I (Introduction)

Learn about DDD- Domain Driven Design- which focuses on software development through collaboration between technical experts and domain experts.

December 14, 2017

by M Yauri at-Tamimi

· 45,446 Views · 25 Likes

Using Jolt in Big Data Streams to Remove Nulls

Learn how to use Jolt code within your big data streams to remove null values with some example source data and JSON code.

December 9, 2017

by Tim Spann

CORE

· 11,851 Views · 3 Likes

Apache Kafka: How to Load Test With JMeter

In today's post, we take a look at how you can load test your Apache Kafka setup using JMeter. Read on for more details.

December 7, 2017

by Roman Aladev

· 21,498 Views · 8 Likes

2017 DevOps Surprises

You never know what the year will bring in development, like DevOps and Cloud's strong positive correlation that begets containers and microservices.

December 7, 2017

by Tom Smith

CORE

· 9,253 Views · 3 Likes

Sensor Data Quality Management Using PySpark and Seaborn

Learn how to check data for required values, validate data types, and detect integrity violation using data quality management (DQM).

December 2, 2017

by Rathnadevi Manivannan

· 11,270 Views · 5 Likes

AI and Machine Learning Trends for 2018: What to Expect

Now is the time to take a look at the most possible and promising machine learning and AI trends for the upcoming year and ask ourselves if we are ready for them.

December 1, 2017

by Dmitry Budko

· 51,880 Views · 15 Likes

Exploratory and Confirmatory Analysis: What's the Difference?

Learn about the differences and uses of exploratory data analysis and confirmatory analysis by considering the process a detective goes through.

November 29, 2017

by Shelby Blitz

· 18,587 Views · 1 Like

Quick Start With Apache Livy

Learn how to get started with Apache Livy, a project in the process of being incubated by Apache that interacts with Apache Spark through a REST interface.

November 28, 2017

by Guglielmo Iozzia

· 47,714 Views · 7 Likes

The Latest Big Data Topics