Big Data Resources

Modify Kafka Topic Partitions Count and An Example of Partition Reassignment in Strimzi

I started learning how we can increase the number of Kafka topic partitions and also how we reassign partitions of topics to different replicas of the Kafka setup.

August 4, 2021

by Chandra Shekhar Pandey

· 14,364 Views · 2 Likes

Stream Analytics and Workflow Engines

The following article will show you how to solve some of the challenges related to stream analytics and workflow engines with the help of workflow technology.

July 30, 2021

by Ralph Soika

· 5,493 Views · 3 Likes

A Beginner's Guide to Machine Learning: What Aspiring Data Scientists Should Know

Learn all about machine learning and it's different subsets, such as supervised learning, unsupervised learning, and the subsets within those tops.

Updated July 30, 2021

by Manoj Rupareliya

· 25,958 Views · 20 Likes

Building a Database Written in Node.js From the Ground Up

Node is lightweight and scalable, allows us to develop quickly, and npm has incredible packages. Read the tutorial to find out more!

Updated July 27, 2021

by Margo McCabe

· 19,095 Views · 9 Likes

Running Apache Spark on Kubernetes

This article covers using Spark on K8s to overcome dependency on cloud providers and running Apache Spark on Kubernetes.

July 26, 2021

by Ramiro Alvarez Fernandez

· 11,631 Views · 3 Likes

Top 5 AI and Machine Learning Trends For 2022

Here are some top trends that your business should start preparing for now.

July 24, 2021

by Joydeep Bhattacharya

CORE

· 19,743 Views · 7 Likes

Kafka-Streams - Tips on How to Decrease Re-Balancing Impact for Real-Time Event Processing On Highly Loaded Topics

Need to reduce re-balancing time on Kafka consumer group during deployment and understand pitfalls of Kafka Streams? Read this article to learn about the factors that affect re-balance latency.

July 24, 2021

by Vasyl Sarzhynskyi

· 44,444 Views · 6 Likes

Explaining Cloud Computing in Layman Terms

This post explains how cloud computing uses the internet to store, access, and secure applications, data servers, and networking hardware and software.

July 23, 2021

by Usama Amin

· 9,633 Views · 3 Likes

Next-Gen Data Pipes With Spark, Kafka and k8s

This article examines the architecture patterns and provides some sample code for the readers to implement in their own environment.

July 23, 2021

by Subhendu Dey

· 13,263 Views · 4 Likes

Data Platform: Data Ingestion Engine for Data Lake

In this article, learn how to design and build an automated Data Ingestion Engine based on Spark and Databricks features.

July 22, 2021

by Miguel Garcia

CORE

· 12,622 Views · 6 Likes

Popular JavaScript TreeGrid Components for Productive Data Management

A side-by-side review of commercial JavaScript TreeGrid components

July 21, 2021

by Ivan Petrenko

· 5,797 Views · 2 Likes

5 Examples of How Big Data in Delivery Service Can Transform Logistic Industry

Learn how big data influences the logistic industry in 2021. Raise your delivery service profit, acquire prestige, and scale by implementing state-of-art big data solutions.

July 20, 2021

by Ilon Adams

· 11,160 Views · 9 Likes

AWS Data Pipeline vs Glue vs Lambda: Who Is a Clear Winner?

In this article, you will see a comparison between AWS Data Pipeline, Glue and Lambda

July 17, 2021

by Alex Jordan

· 26,514 Views · 2 Likes

Sensors and Actuators in IoT - Enabling Industrial Automation

In IoT, automation is enabled by connecting data to a machine. Sensors and actuators in IoT represent these two end points of the system.

July 16, 2021

by Madhuri Jadhav

· 24,294 Views · 2 Likes

What Is Data Locality?

This article covers leveraging Data Locality in your Big Data processing.

July 16, 2021

by Tomasz Lelek

· 11,480 Views · 4 Likes

Snowflake vs. Redshift: Which Cloud Data Warehouse Is Right for You?

Analysis of Snowflake and Redshift's scalability, performance, support, security, and more to help determine which one is the best fit for your business.

July 7, 2021

by Ben Putano

· 6,392 Views · 3 Likes

EC2 Instance Types: the Good, the Bad, and the Ugly

Let's explore EC2 instance types and choose ones that offer the best price-performance combo. Let's discuss the best practices for AWS cost optimization.

July 1, 2021

by Vito Clover

· 7,686 Views · 3 Likes

The SOC Technology Stack: XDR, SIEM, WAF, and More

A SOC is composed of a wide range of processes and technologies, as well as a team of security experts. The team often employs automation to support their efforts.

June 27, 2021

by Eddie Segal

· 11,601 Views · 2 Likes

How Carbon Uses PrestoDB With Ahana to Power Real-Time Customer Dashboards

Why ad tech company Carbon chose Presto on AWS for SQL data lake analytics.

June 25, 2021

by Jordan Hoggart

· 13,354 Views · 6 Likes

Integration Patterns in Microservices World

In this in-depth article, architects and developers can use this information to guide their integration solutions.

June 24, 2021

by Jignesh Karia

· 33,737 Views · 19 Likes

The Latest Big Data Topics