id | 242643811 |
name | aws-pdf-textract-pipeline |
full_name | aeksco/aws-pdf-textract-pipeline |
html_url | https://github.com/aeksco/aws-pdf-textract-pipeline |
description | Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript |
created_at | Feb. 24, 2020, 4:08 a.m. |
updated_at | July 7, 2024, 1 p.m. |
pushed_at | June 5, 2024, 1:57 p.m. |
size | 1,738 |
stargazers_count | 162 |
watchers_count | 3 |
forks_count | 18 |
open_issues | 5 |
language | TypeScript |
awesome_list |
https://github.com/kalaiser/awesome-cdk
|