A running measurement system, not a whitepaper.
Alice captures keystroke-level temporal data and submitted text from daily writing sessions. Six signal families measure two orthogonal cognitive channels: motor/dynamical execution (via a native Rust engine) and semantic/propositional output (via deterministic text analysis). The architecture enforces six validity constraints by design, not by policy.
What is captured
Every writing session produces two data channels, recorded simultaneously:
- Key-down and key-up timestamps (millisecond precision)
- Character identity for each event
- Full temporal microstructure: inter-key intervals, hold durations, flight times
- Deletion events and revision sequences
- Pause architecture (location, duration, context)
- Final response text
- Word count and session duration
- Burst sequences (text produced between pauses)
- Revision density and deletion patterns
- The question that prompted the response (for context-aware analysis)
Neither channel alone is sufficient. A decline in lexical diversity with stable production fluency means vocabulary contraction. The same decline with slowed production means retrieval difficulty under cognitive load. You need both to distinguish them.
The signal engine
Every signal family — behavioral and semantic, including cross-session comparisons — is computed by a single native Rust engine compiled to a platform-specific binary. Rust is the single source of truth. There is no fallback implementation, no TypeScript shadow path, and no LLM anywhere in the signal pipeline. A measurement instrument cannot have two sources of truth.
Complexity and structure of the temporal process. How ordered or disordered your keystroke timing is, and at what scales.
The statistical shape of your keystroke timing distribution. How your motor system executes the physical act of typing.
The structural shape of how you wrote. Pause patterns, burst dynamics, revision behavior.
What you wrote, measured as density metrics. Propositional content, lexical choice, epistemic stance, cohesion structure.
How your current session relates to your own history. Longitudinal consistency and drift.
Seven-dimensional state vector derived from session summaries. Fluency, deliberation, revision, commitment, volatility, thermal, presence. Convergence is a derived composite (Euclidean distance from personal center in 7D space).
Each signal family has defined failure modes. When a session produces insufficient data for reliable estimation, the signal returns null with a typed error variant (InsufficientData, ZeroVariance, DegenerateValue). A missing measurement is better than a wrong one.
Calibration
A single writing session cannot distinguish a bad day from a trend. Alice uses within-person calibration to separate transient state (fatigue, illness, distraction) from trajectory shifts.
Brief calibration task before the main writing session. Establishes today's motor baseline: are your fingers fast or slow right now, independent of what you're thinking about?
Deviation is measured against your own accumulated history, not a population average. Your "normal" is defined by your data, not a cohort mean.
A single outlier is noise. A consistent drift over weeks is signal. The longitudinal record makes this distinction possible in ways that cross-sectional designs cannot.
Design constraints enforced by architecture
These are not guidelines. They are properties of the system that cannot be violated without changing the code.
Implementation
This is not a concept. It is a running system producing measurements every day. The gap between "running" and "validated" is the work that remains.