Skip to Content
andy.cutler

andy.cutler

I do things with data platforms

andy.cutler
andy.cutler

I do things with data platforms

  • Home
  • Blog
  • GitHub
  • About
  • RSS
    • Home
    • Parquet
Optimisation

How does Serverless SQL Pools deal with different file schemas? Part 2 – Parquet

Andy June 30, 2022 1 Comments

Welcome to part 2 in this blog series in which we're looking at what happens when schema changes are made in source data lake files which is then queried by…

Optimisation

10 Billion Rows: Parquet File Size and Distribution When using CETAS

Andy July 5, 2021 0 Comments

When using Serverless SQL Pools to write data to Azure Storage/Data Lake Gen2 using the CREATE EXTERNAL TABLE AS SELECT (CETAS) syntax, the number of source rows and size of…

Recent Posts

  • Dopamine
  • 2025
  • Fabric Architecture: Azure Tenants
  • Microsoft Fabric Architecture
  • Beta Deployment Framework for Materialized Lake Views in Fabric

Recent Comments

  • Onur on Dopamine
  • Dave Wentzel on Dopamine
  • Dopamine – andy.cutler on 2025
  • ScottC on 2025
  • Andy Leonard on 2025

You Missed

Featured

Dopamine

Featured

2025

Fabric Featured

Fabric Architecture: Azure Tenants

Fabric Picks

Microsoft Fabric Architecture

andy.cutler

andy.cutler

I do things with data platforms

Copyright © All rights reserved | Blogus by Themeansar.