Learn how to get started with Apache Livy, a project in the process of being incubated by Apache that interacts with Apache Spark through a REST interface.
What is Apache Kafka, and what can is be used for? Dive deep into what Apache Kafka is all about and learn how to create a Kafka cluster with three brokers.
Settle in to hear the story of a Dutch company that transformed manual elevator logging into a digital industry standard and opened the door for predictive maintenance.
Beacons, communications protocols, edge gateways, integrators and more, if you develop IoT solutions, here are the essential phrases for your hobby or career.
Buckle up for a 30-minute talk about the current state of IoT data and a demo that tackles MQTT, TLS, load balancing, session persistence, and plenty more.
Feature hashing is a valuable tool in the data scientist's arsenal. Learn how to use it as a fast, efficient, flexible technique for feature extraction that can scale to sparse, high-dimensional data.
InfluxDB was designed for time series data, and Elasticsearch wasn't. However, many people use Elasticsearch for this purpose. Is one database better than the other?
Some of the shortcomings of Elasticsearch can be overcome using some Logstash plugins. Check out how to use them to aggregate and index data into Elasticsearch.
Learn about reading data from different data sources such as Amazon Simple Storage Service (S3) and flat files, and writing the data into HDFS using Kafka in StreamSets.
While the rest of the world embraces smart city technology in fits and starts, here is where Amsterdam shines with open data and citizen-led startup initiatives.
By: Natasha Mashanovich, Senior Data Scientist at World Programming, UK Scorecard development describes how to turn data into a scorecard model, assuming tha...