Big Data Resources

Data Visualization Using Apache Zeppelin

Apache Zeppelin — an open-source data analytics and visualization platform — helps us analyze the data to gain insight and to improve and enhance business decisions.

October 17, 2017

by Kunal Nagar

· 32,259 Views · 2 Likes

51 Database Terms You Need to Know

Some of the biggest terms that you need to know when it comes to databases.

October 17, 2017

by Sarah Davis

· 47,949 Views · 28 Likes

Anomaly Detection With Kafka Streams

Learn how to perform anomaly detection using Kafka Streams with an example of a loan payment website that needs to send an alert if the payment is too high.

October 17, 2017

by Ajmal Karuthakantakath

· 18,346 Views · 5 Likes

Stream Processing With Apache Flink

See how to get started with writing stream processing algorithms using Apache Flink. by reading a stream of Wikipedia edits and getting some meaningful data out of it.

October 16, 2017

by Ivan Mushketyk

· 19,892 Views · 9 Likes

How Much Data Is Created on the Internet Each Day?

Some quick stats: 656 million tweets go out per day, and 15,220,700 texts are sent every minute. This makes for LOTS of data. Read on for more shocking stats!

October 16, 2017

by Jeff Schultz

· 16,083 Views · 3 Likes

API Response Tracking With StreamSets, Elasticsearch, and Kibana

Learn how to track JSON response data from a RESTful API using Elasticsearch and Kibana to capture and visualize the alerts.

October 15, 2017

by Rathnadevi Manivannan

· 12,319 Views · 3 Likes

Getting Started With Batch Processing Using Apache Flink

If you've been following software development news recently you probably heard about the new project called Apache Flink. I've already written about it a bit...

October 13, 2017

by Ivan Mushketyk

· 15,661 Views · 9 Likes

Multiple Datacenter Replication With InfluxDB

Learn the most basic patterns for replicating data between two different clusters of InfluxEnterprise so that you can go forth and replicate.

October 13, 2017

by Dave Patton

· 11,064 Views · 1 Like

Variable Selection and Big Data Analytics in Credit Score Modeling

The variable selection process in the credit score modeling process is critical to finding key information. Learn how to do it to get a good understanding of your data!

October 12, 2017

by Natasha Mashanovich

· 10,098 Views · 3 Likes

Deep Learning vs. Machine Learning

If you have often wondered to yourself about the difference between machine learning and deep learning, read on to get a detailed comparison in simple layman language.

October 12, 2017

by Faizan Shaikh

· 18,775 Views · 6 Likes

Real-Time Activity Tracking With Kafka

More than a third of the Fortune 500 companies now use Kafka in production — and for good reason. In this article, learn how to track real-time activity using Kafka.

October 12, 2017

by Chamath Kirinde

· 42,170 Views · 39 Likes

Keeping the Web API Layer in Kafka With a REST Proxy

Kafka is the quickest way I have seen to get started with real-time data streams. However, I've noticed many Apache products diverting from REST.

October 9, 2017

by Kin Lane

· 17,622 Views · 4 Likes

What Problems Do Microservices Solve?

Microservices allow for the decoupling of monolithic apps so that legacy enterprises can pursue their digital transformation.

October 9, 2017

by Tom Smith

CORE

· 8,975 Views · 4 Likes

Bending CAP Theorem in Geo-Distributed Deployments With CRDTs

Learn about CRDTs, which maintain the additional decrements between syncs and across multiple concurrent writes with full correctness.

Updated October 5, 2017

by Cihan B.

· 6,985 Views · 5 Likes

Data Science and Credit Scorecard Modeling Methodology

Data scientists are responsible for designing and developing accurate, useful, and stable models. This is especially important when it comes to credit risk models.

October 5, 2017

by Natasha Mashanovich

· 14,589 Views · 4 Likes

How to Train TensorFlow Models Using GPUs

GPUs can accelerate the training of machine learning models. In this post, explore the setup of a GPU-enabled AWS instance to train a neural network in TensorFlow.

October 4, 2017

by Kseniya Savitsina

· 62,260 Views · 5 Likes

Using Big Data and Predictive Analytics for Credit Scoring

Learn how data is analyzed and boiled down to a single value — a credit score — using statistical, machine learning, and predictive analytics techniques.

October 4, 2017

by Natasha Mashanovich

· 19,546 Views · 4 Likes

Comparison API for Apache Kafka

Learn about a variety of use cases for Kafka and Kafka's API — from from consuming and writing data to streams to more reactive approaches with Akka.

October 3, 2017

by Artem Rukavytsia

· 9,385 Views · 10 Likes

Processing Hierarchical Data Using Spark GraphX Pregel API

Learn about using the GraphX Pregel API, a very powerful tool that can be used to solve iterative problems and pretty much any graph computation.

September 30, 2017

by Suraj Bang

· 16,276 Views · 2 Likes

Top 12 AI Tools, Libraries, and Platforms

If you're looking to start an AI project but don't know where to start, check out this article. We've listed the top 12 AI tools, libraries, and platforms, what they are typically used for, what pros and cons they come with, and more!

September 27, 2017

by Sarah Davis

· 37,368 Views · 10 Likes

The Latest Big Data Topics