Skip to content

MahmoodKhalil57/ProtoElamite

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Proto-Elamite Corpus Analysis

Computational analysis of the CDLI Proto-Elamite corpus (1,467 tablets from Susa, ~3100 BCE). Extracts testable knowledge about Proto-Elamite numerical systems, administrative technology, and livestock management.

Quick Start

# 1. Clone with the dataset submodule (sfu-natlang/pe-sign-value-data)
git clone --recurse-submodules https://github.com/MahmoodKhalil57/ProtoElamite
#    (already cloned? run: git submodule update --init)

# 2. Verify all 9 primary claims in one command
bun run verify-all.ts

Expected output: 9/9 claims PASS in ~0.1 seconds.

The Proto-Elamite sign-value data lives in the pe-sign-value-data/ submodule, pinned to sfu-natlang/pe-sign-value-data (SFU Natural Language Lab).

What This Project Found

Nine reproducible, falsifiable claims — all independently verified:

  1. N45 = 100 in decimal context (not 120 as previously assumed)
  2. N30C = 3 × N24 (capacity system base ratio)
  3. Standardized measures 6.1 and 150.1 — canonical vessel sizes
  4. Dual-system accounting convention — parallel decimal + capacity sub-totals
  5. Tabular bookkeeping with cross-footing — spreadsheet precursor at ~3100 BCE
  6. Animal production norm of 1.0 by-product per standard-tier animal (7 data points)
  7. Three-tier yield hierarchy matching sheep/goat/cattle biology
  8. Meluhha trade reference on P008239 (me-lu-ha = Indus Valley)
  9. zu-M003~B authorization formula (13 tablets, standardized closing mark)
  10. M054+M340 is an accounting identity (honest null result — NOT a recipe)

See CLAIMS.md for full evidence and falsification criteria.

What This Project Did NOT Find

Production recipes (metallurgical ratios, dye mordants, ceramic formulas, agricultural timings). The corpus is administrative — tablets record receipts, distributions, and censuses, not manufacturing processes. See GOAL-ASSESSMENT.md for the honest reframe.

Documentation

File Purpose
README.md This file — project entry point
CLAIMS.md 10 reproducible, falsifiable claims with evidence
FINDINGS.md All 25 detailed findings from the investigation
ROADMAP.md Research roadmap ordered by confidence
GOAL-ASSESSMENT.md What can and cannot be extracted from this corpus

Analysis Scripts

Verification (run these first)

Core analysis

Focused investigations

Numerical System (confirmed)

Decimal

  • N01 = 1
  • N14 = 10
  • N45 = 100 (confirmed — corrects prior N45=120 interpretation)
  • N34 = 60
  • N50 = 1000
  • N08 = 0.5 (fraction)

Capacity

  • N24 = 1 (base unit)
  • N30C = 3 × N24 (confirmed)
  • N30D = 15 × N24 (best hypothesis)
  • N39B = 75 × N24 (best hypothesis)
  • Base ratios: 1 : 3 : 15 : 75 (factors ×3 × 5 × 5)

Standardized measures

  • 6.1 = 1(N24) + 2(N30C) — small standard (15 tablets)
  • 150.1 = 2(N39B) + 1(N24) — large double-measure (44 tablets)

Sign Identifications (confirmed via Dahl 2005 + our analysis)

Sign Meaning Source
M362 Animal (livestock) Dahl 2005
M362+X Animal of household X Dahl 2005
M036 Grain container Dahl 2005
M260–M270 Beer vessels Dahl 2005
M269 Milk/butter/oil containers Dahl 2005
M288 (pu₂) Unit marker / totalizer Our analysis
M157 Document header (general accounts) Our analysis
M388 Section marker + semantic determiner Our analysis
M106/M106~A Animal by-product (capacity) Our analysis + Dahl context
M309~A Standard allocation (always qty=1) Our analysis
M102~E Standard pair (always qty=2) Our analysis
M206~B "me" phonetic (in me-lu-ha) Desset 2022
M301 "lu" phonetic Dahl + Desset
M263 "ha" phonetic Dahl + Desset

Requirements

  • Bun runtime (v1.3+): curl -fsSL https://bun.sh/install | bash
  • CDLI data: git clone https://github.com/sfu-natlang/pe-sign-value-data

Place this project directory next to pe-sign-value-data/ so scripts can find the corpus.

Project Structure

ProtoElamite/
├── README.md              (this file)
├── CLAIMS.md              (falsifiable claims)
├── FINDINGS.md            (detailed findings)
├── ROADMAP.md             (research roadmap)
├── GOAL-ASSESSMENT.md     (honest assessment)
├── CLAUDE.md              (project instructions)
├── verify-all.ts          (unified pipeline — start here)
├── parse-corpus.ts        (ATF parser)
├── analyze.ts             (syllabary tool)
├── [20+ analysis scripts] (focused investigations)
├── corpus-parsed.json     (cached corpus analysis)
└── pe-sign-value-data/    (CDLI corpus, git submodule)

Citing This Work

When referring to specific claims, cite the claim number and the verification script:

"N45 = 100 in decimal context (Claim 1, verified by P008031, P008136, P008019 via verify-all.ts)."

External Sources

License

Research code for academic/educational use. The CDLI corpus data is under its own license (see pe-sign-value-data).

About

Computational analysis of the CDLI Proto-Elamite corpus (1,467 tablets from Susa, ~3100 BCE) — extracts 9 reproducible, falsifiable claims about early numerical systems, administrative technology, and livestock management.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors