Simon Willison’s Weblog

Subscribe

Items tagged gpt3, scraping in 2023

Filters: Year: 2023 × gpt3 × scraping × Sorted by date


scrapeghost (via) Scraping is a really interesting application for large language model tools like GPT3. James Turk’s scrapeghost is a very neatly designed entrant into this space—it’s a Python library and CLI tool that can be pointed at any URL and given a roughly defined schema (using a neat mini schema language) which will then use GPT3 to scrape the page and try to return the results in the supplied format. # 26th March 2023, 5:29 am

Types

Years

Months

Tags