Add DATA-MODEL.md — comprehensive taxonomy doc for research team review
Covers all 5 pipeline stages, 8 cross-cutting systems, entity
relationships, content-addressing model, design principles,
and open research questions. Written for researchers and
architects who need to understand the full system without
reading code.