Skip to Content
andy.cutler

andy.cutler

I do things with data platforms

andy.cutler
andy.cutler

I do things with data platforms

  • Home
  • Blog
  • GitHub
  • About
  • RSS
    • Home
    • Data Processing
Optimisation

10 Billion Rows: Parquet File Size and Distribution When using CETAS

Andy July 5, 2021 0 Comments

When using Serverless SQL Pools to write data to Azure Storage/Data Lake Gen2 using the CREATE EXTERNAL TABLE AS SELECT (CETAS) syntax, the number of source rows and size of…

Recent Posts

  • Dopamine
  • 2025
  • Fabric Architecture: Azure Tenants
  • Microsoft Fabric Architecture
  • Beta Deployment Framework for Materialized Lake Views in Fabric

Recent Comments

  • Onur on Dopamine
  • Dave Wentzel on Dopamine
  • Dopamine – andy.cutler on 2025
  • ScottC on 2025
  • Andy Leonard on 2025

You Missed

Featured

Dopamine

Featured

2025

Fabric Featured

Fabric Architecture: Azure Tenants

Fabric Picks

Microsoft Fabric Architecture

andy.cutler

andy.cutler

I do things with data platforms

Copyright © All rights reserved | Blogus by Themeansar.