Five phases. Day-one shippable.
Phase 0 done. Phases 1 & 2 active. The rest is dated against design partner traction, not vibes.
The format ships. Recordings become questionable files.
The thing exists. Recordings go in. Sealed .0wav files come out, ready to question without ever re-running anything.
Sub-second follow-ups against the cached profile
The "second question is free" promise gets its measurable proof point. Every follow-up returns in under a second, against the profile already inside the file.
Behavioral query layer
Cadence, talk time, sentiment by speaker. The questions journalists, analysts, and operators actually want to ask — answered in the file.
Multimodal v2.0 — audio plus video
The cues a transcript cannot carry — gaze, posture, breath, room tone — added to the same second-by-second timeline as the words. The .0wav becomes a behavioral container, not just an audio one.
Storage gets smaller. Signal stays whole.
As storage gets cheaper, the file gets cheaper to keep. Behavioral signal stays untouched.
Streaming
Built for the day a million recordings hit the pipeline at once. The format does not change. The throughput does.