Cosmos DB Stored Procedures – handling continuation

I’ve recently did some work involving Stored Procedures in Cosmos DB. There are a few techniques to learn when our stored procedures handle large data sets.  It is all about continuation. In this article, we’ll look at how to build a robust and scalable stored procedure. We’ll start with a naïve approach and then get … More Cosmos DB Stored Procedures – handling continuation

Event Hubs ingestion performance and throughput

Azure Event Hubs is a data streaming platform as a Service.  It is an ingestion service. We’ve looked at Event Hubs as the ingestion end of Azure Stream Analytics in two recent articles (here & here). Here we look at client side performance of different techniques and scenarios.  For instance, we’ll consider both AMQP & … More Event Hubs ingestion performance and throughput

Implementing & Automating Azure Stream Analytics Pipeline

In our last article, we set out to build a resilient architecture around streaming events. In this article, we are going to build the solution.  We are going to use an ARM template which automates deployment.  We will also go through the configuration of different services. The goal of the architecture was to allow a … More Implementing & Automating Azure Stream Analytics Pipeline

Azure Databricks – Parsing escaping CSV files in Spark

In previous weeks, we’ve looked at Azure Databricks, Azure’s managed Spark cluster service. We then looked at Resilient Distributed Datasets (RDDs) & Spark SQL / Data Frames.  We also looked at an example of more tedious transformation prior to querying using the H-1B Visa Petitions 2011-2016 (from Kaggle) data set. Here, we’re going to look … More Azure Databricks – Parsing escaping CSV files in Spark