Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
updated at July 7, 2024, 1 p.m.
3 +0
162 +0
18 +0