AI-powered ETL pipeline designed to extract, deduplicate, and digitize 25 years of noisy legacy business invoices using Python and Google Gemini.
-
Updated
Feb 17, 2026 - JavaScript
AI-powered ETL pipeline designed to extract, deduplicate, and digitize 25 years of noisy legacy business invoices using Python and Google Gemini.
Turn any CSV file into a queryable REST API instantly. Auto-detects schemas, supports filtering, full-text search, sorting, pagination, and field projection. Zero config — drop CSVs in a folder, get a JSON API.
GPU/CUDA-accelerated batch decoding of COBOL numeric fields (COMP-3/zoned) for high-throughput legacy-data ingest. Part of KOBOLD. Apache-2.0.
Add a description, image, and links to the legacy-data topic page so that developers can more easily learn about it.
To associate your repository with the legacy-data topic, visit your repo's landing page and select "manage topics."