Blog - Dimajix Staging

Uncategorized

I am text block. Click edit button to change this text. Lorem ipsum dolor sit…

Uncategorized

This is part 3 of a series on data engineering in a big data environment.…

Big Data PySpark Spark

This is part 2 of a series on data engineering in a big data environment.…

Big Data Spark

This is part 1 of a series on data engineering in a big data environment.…

most attendees of dimajix Spark workshops seem to like the hands-on approach I am offering…

Amazon Elastic MapReduce (EMR) is something wonderful if you need compute capacity on demand. I…

Traditionally HDFS was the primary storage for Hadoop (and therefore also for Apache Spark). Naturally…

Working with PySpark Currently Apache Spark with its bindings PySpark and SparkR is the processing…

So the other day I wanted to investigate into using Druid as a reporting backend…

dominik_adm1n23. March 2016