Simon Willison's Weblog

Subscribe

Extracting data from unstructured text and images with Datasette and GPT-4 Turbo. Datasette Extract is a new Datasette plugin that uses GPT-4 Turbo (released to general availability today) and GPT-4 Vision to extract structured data from unstructured text and images.

I put together a video demo of the plugin in action today, and posted it to the Datasette Cloud blog along with screenshots and a tutorial describing how to use it.

Posted 9th April 2024 at 11:03 pm

Recent articles

projects 516 ai 1808 datasette 452 datasette-cloud 47 openai 385 generative-ai 1600 gpt-4 43 llms 1566 vision-llms 82 structured-extraction 11

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe