Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
-
Updated
Mar 5, 2020 - Scala
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Automated assistance for the schema development lifecycle
A tool to automatically infer columns data types in .csv files
Manifest-driven schema & ingestion for labeled property graphs (GSTL): define once in YAML/Python, ingest from CSV/SQL/RDF/SPARQL/API, load to ArangoDB, Neo4j, TigerGraph, FalkorDB, Memgraph, or NebulaGraph.
NoSQL Data Engineering
Cursor for Data — a local-first AI workspace that understands your datasets: discovers relationships, builds semantic models, and answers questions with DuckDB. CLI + browser app.
A Polars plugin for JSON schema inference from string columns using genson-rs.
TablePilot:本地优先的复杂表格智能分析工作台,把混乱 Excel/CSV/TXT 转化为质量修复计划、洞察卡片和可解释报告。 | Local-first messy table analysis workbench for repair plans, insight cards, and explainable reports over Excel/CSV/TXT files.
Schema inference for semistructured data using Formal Concept Analysis
Learn API schemas from live traffic and generate OpenAPI specs. Analyze MCP/JSON-RPC agent sessions for wasted calls. Zero-infrastructure local reverse proxy.
Progressive data shaping for Rust — the missing layer between serde_json::Value and your types
A tiny CLI that reads JSON and infers a clean, type-oriented YAML summary. Perfect for exploring APIs, documenting unknown data, or bootstrapping schemas.
Lattice is a JSON document store with real-time schema processing, indexing during ingestion, and a SQL-like interface for querying data
Auto-infer scraping schemas from pages with repeated content
ColumnCore is a high-performance analytical database system designed for beginners or small projects. It supports a rich SQL dialect, runs within the same process as the application, has a vectorized query execution engine, and uses a columnar storage format.
Enterprise auto-etl Technical Architecture focusing on Scalability and High Performance.
kafka, apache-kafka, data-profiling, schema-inference, data-contracts, avro, json, streaming, data-quality, data-engineering, cli, observability, python, hyperloglog
A deterministic engine that transforms messy, user-uploaded CSVs into clean, schema-compliant, import-ready data.
Add a description, image, and links to the schema-inference topic page so that developers can more easily learn about it.
To associate your repository with the schema-inference topic, visit your repo's landing page and select "manage topics."