Simon Willison’s Weblog

Subscribe

14th November 2023

TIL Summing columns in remote Parquet files using DuckDB — [vivym/midjourney-messages](https://huggingface.co/datasets/vivym/midjourney-messages) on Hugging Face is a large (~8GB) dataset consisting of 55,082,563 Midjourney images - each one with the prompt and a URL to the image hosted on Discord.

This is a beat by Simon Willison, posted on 14th November 2023.

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe