What Alice measures and how.

Six signal families computed from every writing session. Each signal has a research basis, a defined failure mode, and an honest validation status. This page is the instrument card.

Validation tiers

Validated

Published method, replicated across multiple studies. Applied as described in the source literature.

Adapted

Published method applied to a new context (keystroke timing from ecological writing rather than lab tasks). The method is sound; the application is novel.

Experimental

Novel to Alice. Informed by literature but not a direct implementation of a published method. Treated as hypothesis, not established measurement.

Dynamical signals

Nonlinear complexity and structure of the keystroke timing series. Treats the inter-keystroke interval (IKI) sequence as output of a complex adaptive system. Organized into nine theoretical sub-families spanning complexity, scaling, causality, and mode decomposition.

Computed in: Rust native engine (dynamical.rs)

Permutation entropy

Permutation entropy spectrum Validated

Complexity of the IKI series at multiple temporal scales (orders 3-7). High PE = unpredictable rhythm. The spectrum separates local complexity from global structure.

Bandt & Pompe 2002

Weighted permutation entropy Validated

Amplitude-weighted PE that captures both ordinal pattern frequency and the magnitude of fluctuations. More sensitive to large IKI excursions than standard PE.

Fadlallah et al. 2013

Statistical complexity Validated

Jensen-Shannon complexity: the product of entropy and disequilibrium. Distinguishes structured complexity from randomness. Maximum at the edge between order and disorder.

Rosso et al. 2007

Forbidden pattern fraction Validated

Proportion of ordinal patterns that never occur. Deterministic systems forbid patterns; stochastic systems explore them all. Sensitive to low-dimensional structure in the IKI series.

Amigó et al. 2007

Lempel-Ziv complexity Validated

Algorithmic complexity of the symbolized IKI series via LZ76 parsing. Measures compressibility of the temporal sequence. Complements PE by capturing non-ordinal structure.

Lempel & Ziv 1976

Detrended fluctuation analysis

DFA alpha Validated

Detrended fluctuation analysis. Self-affine scaling exponent of the IKI series. Alpha near 0.5 = white noise (no temporal memory). Alpha > 1 = long-range correlations (persistent rhythm).

Peng et al. 1994

MF-DFA spectrum width Validated

Width of the multifractal singularity spectrum. Measures richness of scaling behavior. Wide spectrum = multiple coexisting temporal dynamics. The most ghost-resistant behavioral dimension by 4x: the reconstruction adversary cannot reproduce multifractal structure from a single stochastic process.

Kantelhardt et al. 2002

MF-DFA asymmetry and peak alpha Validated

Asymmetry of the singularity spectrum (left- vs right-skewed multifractal structure) and the Holder exponent at the spectrum peak. Together they characterize whether large or small fluctuations dominate the multifractal signature.

Kantelhardt et al. 2002

Temporal irreversibility Validated

Statistical asymmetry under time reversal. Nonzero irreversibility implies the IKI series is generated by a dissipative, non-equilibrium process rather than a passive noise source.

Lacasa et al. 2012

Spectral analysis

IKI power spectral density Adapted

Five spectral features from Welch PSD of the IKI series: spectral slope (1/f exponent), respiratory-band peak frequency, peak typing frequency, low/high frequency ratio, and fast/slow variance ratio. Spectral slope distinguishes random (flat) from self-organized (steep) timing. The respiratory peak detects physiological entrainment of typing rhythm.

Welch 1967; adapted to keystroke domain

Ordinal pattern transition network

OPTN transition entropy and forbidden transitions Adapted

Entropy of the transition matrix between successive ordinal patterns, and the count of transitions that never occur. Captures temporal dependencies that standard PE discards by treating patterns as independent.

Small 2013

Recurrence quantification

RQA metrics Validated

Six metrics from reconstructed phase-space attractor: determinism (repeatable patterns), laminarity (laminar states), trapping time (average diagonal length), recurrence rate (overall self-similarity), recurrence time entropy, and mean recurrence time.

Webber & Zbilut 2005

Recurrence networks

Recurrence network topology Validated

Graph-theoretic analysis of the recurrence matrix: transitivity (clustering of recurrent states), average path length (separation between attractor regions), clustering coefficient, and assortativity (degree correlation). Reveals geometric structure of the phase-space attractor that RQA diagonal/vertical line statistics miss.

Donner et al. 2010

Causal emergence

Effective information and causal emergence index Adapted

Effective information (EI) quantifies determinism and degeneracy of temporal transitions. The causal emergence index (CEI) measures whether coarse-graining the IKI series increases its causal structure. Optimal causal scale identifies the temporal resolution at which the system is most informative about itself.

Hoel 2017

PID synergy and redundancy Adapted

Partial information decomposition of hold and flight time contributions to IKI prediction. Synergy = information available only when both channels are observed together. Redundancy = information available from either channel alone. High synergy suggests integrated motor-cognitive control.

Williams & Beer 2010

Criticality

Branching ratio and avalanche exponent Adapted

Branching ratio measures propagation of activity (IKI exceedances). Ratio near 1.0 = critical dynamics (self-organized). Avalanche size exponent characterizes the power-law distribution of burst cascades. Together they test whether the keystroke process operates near a critical point.

Beggs & Plenz 2003

Dynamic mode decomposition

DMD modes Adapted

Data-driven decomposition of the IKI series into spatial-temporal modes: dominant oscillatory frequency, decay rate (damping vs growth), active mode count, and modal spectral entropy. Extracts the dominant rhythmic components without assuming stationarity.

Schmid 2010

Pause mixture model

Gaussian mixture pause decomposition Experimental

IKI distribution decomposed into Gaussian mixture components via BIC-selected model order. Component count reflects the number of distinct timing regimes. Motor proportion identifies the fast (automatic) component. Cognitive load index measures the relative weight of slow (deliberative) components.

Novel; informed by pause classification literature

Transfer entropy

Transfer entropy Adapted

Information flow between hold times and flight times using the KSG estimator. Measures causal coupling between motor execution (hold) and cognitive planning (flight). Dominance indicates which channel drives the other.

Schreiber 2000; Kraskov et al. 2004

Motor signals

Statistical shape of the keystroke timing distribution. How the motor system executes the physical act of typing, independent of what is being written.

Computed in: Rust native engine (motor.rs)

Ex-Gaussian parameters (mu, sigma, tau) Validated

IKI distribution decomposed into Gaussian (mu, sigma) and exponential tail (tau) via MLE/EM. Tau captures attentional lapses. Tau proportion = tau / (mu + tau).

Lacouture & Cousineau 2008; Zulueta et al. 2018 (BiAffect)

Sample entropy Validated

Regularity measure of the IKI series. Low = rhythmic, metronomic. High = chaotic, irregular. More robust than approximate entropy for short series.

Richman & Moorman 2000

Motor jerk Adapted

Third derivative of the IKI position series. Measures smoothness of the temporal trajectory. High jerk = abrupt rhythm changes.

Flash & Hogan 1985 (minimum jerk model)

Lapse rate Adapted

Proportion of IKIs exceeding mu + 3 sigma. Captures attentional lapses as statistical outliers in the timing distribution.

Derived from psychomotor vigilance task methodology

Tempo drift Experimental

Linear trend in IKI over session duration. Negative = speeding up (warming up). Positive = slowing down (fatigue).

Novel; informed by fatigue literature

Digraph latency profile Validated

Hold and flight times per bigram (key pair). Biometrically stable across sessions. Used for motor fingerprint stability tracking.

Monrose & Rubin 2000; keystroke biometrics literature

IKI compression ratio Adapted

Lempel-Ziv complexity of the quantized IKI series. Low = repetitive rhythm. High = complex temporal structure.

Lempel & Ziv 1976

Adjacent hold-time covariance Adapted

Correlation between consecutive hold times. Captures finger coordination and motor planning continuity.

Kim et al. 2024

Multiscale entropy (MSE) and complexity index Validated

Sample entropy computed at multiple coarse-graining scales. The complexity index (area under the MSE curve) quantifies overall multi-scale regularity. Healthy systems show high complexity across scales; degraded systems lose complexity at coarse scales first.

Costa et al. 2002

Ex-Gaussian Fisher trace Adapted

Trace of the Fisher information matrix at the MLE/EM parameter estimates. Measures estimation precision: high trace = tight confidence region around (mu, sigma, tau). Low trace = flat likelihood, unreliable parameter estimates. Provides a per-session quality gate for ex-Gaussian parameters.

Fisher information; adapted to ex-Gaussian MLE

Hold-flight rank correlation Adapted

Spearman rank correlation between hold duration and subsequent flight time. Measures the monotonic coupling between motor execution and transition planning, robust to distributional outliers.

Adapted from motor coupling literature

Process signals

Structural shape of the writing session from event log replay. How the text was produced, revised, and structured over time.

Computed in: Rust native engine (process.rs)

Pause classification Validated

Pauses categorized by location: within-word (lexical retrieval), between-word (local planning), between-sentence (high-level planning). Distribution reveals cognitive load allocation.

Wengelin 2006; Deane 2015

R-burst / I-burst classification Validated

Revision bursts (ending in deletion) vs inscription bursts (ending in new text). High R-burst ratio = revision-heavy process. Captures writing strategy independent of output quality.

Chenoweth & Hayes 2001; Baaijen et al. 2012

Abandoned thought count Experimental

Deletions of entire thoughts (heuristic: large deletions at paragraph boundaries). Genuine cognitive revision, not typo correction.

Informed by Faigley & Witte 1981 revision taxonomy

Phase transition point Experimental

Normalized session time when deletions overtake insertions. Early transition = early commitment. Late = extended exploration before settling.

Novel; informed by Hayes & Flower 1980 process model

Strategy shift count Experimental

Detected changes in typing style (IKI distribution shift) within a session. Multiple shifts suggest exploration across approaches.

Novel

Semantic signals

Text-level linguistic analysis of the submitted response. What was written, independent of how it was typed.

Computed in: Rust (src-rs/src/process.rs + semantic helpers)

Idea density Validated

Propositions per word. The original Nun Study biomarker: low idea density in early writing predicted Alzheimer's decades later.

Snowdon et al. 1996

Lexical sophistication Validated

Inverse mean word frequency rank. Higher = rarer vocabulary. Proxy for active vocabulary breadth.

Kyle & Crossley 2015 (TAALES)

Epistemic stance Adapted

Booster / (booster + hedge) ratio. High = assertive, certain. Low = tentative, hedging. Captures confidence independent of content.

Hyland 2005

Integrative complexity Adapted

Contrastive + integrative connective density. Higher = more differentiation and integration of competing perspectives.

Suedfeld & Tetlock 1977

Deep cohesion Validated

Causal, temporal, and intentional connective density. Measures how explicitly the text links ideas across sentences.

McNamara et al. 2014 (Coh-Metrix)

Text compression ratio Validated

Gzip compression ratio as Kolmogorov complexity proxy. High = more information per character. Low = repetitive or formulaic.

Cilibrasi & Vitanyi 2005

Discourse coherence (global) Adapted

Average pairwise semantic similarity across all sentence pairs in the response. Measures overall thematic unity: high global coherence = a tightly focused response. Low = scattered or multi-topic.

Foltz et al. 1998 (LSA coherence)

Discourse coherence (local) Adapted

Average semantic similarity between adjacent sentence pairs. Measures thought-to-thought continuity: high local coherence = smooth transitions. Low = abrupt topic jumps.

Adapted from adjacent-sentence coherence (McNamara et al. 2014)

Global/local coherence ratio Experimental

Ratio of global to local coherence. High = thematically tight but with some local jumps (structured argument). Low = locally smooth but globally diffuse (stream of consciousness).

Novel

Coherence decay slope Experimental

Rate at which pairwise semantic similarity decays with sentence distance. Steep decay = ideas are locally clustered. Flat decay = thematic connections span the full response.

Novel

Cross-session signals

Longitudinal measures comparing the current entry to prior entries. Novelty, cognitive trajectory, and stability over time.

Computed in: Rust (src-rs/src/ — cross-session helpers)

Self-perplexity Adapted

Character trigram model trained on all prior entries, scored against today's text. Higher = more novel language relative to personal history.

Adapted from language modeling perplexity

NCD trajectory (lags 1, 3, 7, 30) Validated

Normalized compression distance between today's text and entries at various time lags. Measures semantic distance over time. High NCD = divergent content.

Cilibrasi & Vitanyi 2005

Digraph stability Adapted

Cosine similarity of today's digraph latency profile to rolling baseline. Motor fingerprint stability. Low = something changed physically.

Adapted from keystroke biometrics (Monrose & Rubin 2000)

Text network density Adapted

Co-occurrence graph density from sliding window analysis. Higher density = more interconnected concept network in the response.

Adapted from InfraNodus methodology (Paranyushkin 2019)

Vocabulary recurrence decay Experimental

Exponential decay rate of Jaccard vocabulary similarity across time lags. Fast decay = rapidly diversifying vocabulary. Slow = stable lexicon.

Novel

Motor self-perplexity Experimental

Character trigram model trained on quantized IKI history, scored against today's motor timing. The motor analogue of textual self-perplexity. Higher = today's typing rhythm is more novel relative to motor history.

Novel

Text network communities Adapted

Number of distinct communities detected in the word co-occurrence graph via modularity-based clustering. More communities = more distinct conceptual clusters in the response.

Adapted from community detection (Blondel et al. 2008)

Bridging ratio Experimental

Proportion of edges in the co-occurrence graph that connect different communities. High bridging = integrative writing that links distinct conceptual clusters. Low = compartmentalized thinking.

Novel

Behavioral state engine

7-dimensional behavioral state computed per session from the signals above. Each dimension is z-scored against personal history. No AI. Pure math.

Computed in: Rust (src-rs/src/ — state engine)

Fluency Adapted

DFA alpha (long-range correlation) + P-burst length (sustained production) + tempo drift. Captures whether the writing flowed or fragmented.

Baaijen et al. 2012; Deane 2015

Deliberation Adapted

Permutation entropy + first keystroke latency + between-sentence pause density. High = effortful, exploratory thinking before and during writing.

Baaijen et al. 2012; Deane 2015

Revision Adapted

Abandoned thought count + R-burst/I-burst ratio + inverse commitment. How much the writer restructured after committing text.

Faigley & Witte 1981; Chenoweth & Hayes 2001

Commitment Experimental

Final/typed ratio + phase transition point + inverse strategy shift count. Whether the writer stuck with a direction or explored multiple approaches.

Novel composite

Volatility Experimental

Euclidean distance from previous entry in 4D subspace. How behaviorally different this session was from the last.

Novel; informed by PersDyn variability construct (Sosnowska et al. 2019)

Thermal Adapted

Ex-Gaussian tau (attentional lapses) + lapse rate + correction rate. Cognitive-motor disruption, editing heat.

Zulueta et al. 2018 (BiAffect)

Presence Adapted

RQA determinism + transfer entropy dominance + inverse tab-away rate. Sustained coherent engagement with the writing task.

Webber & Zbilut 2005; Schreiber 2000

Dynamics layer

Per-dimension trait dynamics computed from the accumulating state history. Baseline, variability, attractor force, and empirical coupling between dimensions.

Computed in: Rust (src-rs/src/dynamical.rs)

Baseline

Rolling mean of each dimension. The stable set point the person returns to.

Variability

Rolling standard deviation. How much the dimension fluctuates session to session.

Attractor force

Ornstein-Uhlenbeck mean-reversion from lag-1 autocorrelation of deviations. High = rigid (snaps back). Low = malleable (shifts persist).

Coupling matrix

Signed lagged cross-correlations between all dimension pairs. Discovers which dimensions causally influence each other for this specific person.

PersDyn model: Sosnowska, Kuppens, De Fruyt & Hofmans (KU Leuven, 2019). Attractor dynamics: Kuppens, Oravecz & Tuerlinckx 2010. Coupling: Critcher (Berkeley xLab).

Known limitations

Sample size

n=1. All baselines, dynamics, and coupling are intra-individual. No between-person generalization is possible at current scale.

Signal-to-construct mapping

The mapping between keystroke features and cognitive constructs is hypothesized from psycholinguistic literature, not validated in a longitudinal ecological context.

Linguistic analysis depth

Semantic signals use word-list-based heuristics, not parsed syntax trees. Idea density is approximated via verb/preposition ratios, not full propositional analysis.

Question confound

Different questions elicit different writing behaviors. The question itself is an intervention. Calibration sessions (neutral prompts) partially control for this but do not eliminate it.

Device sensitivity

Keystroke dynamics are device-specific. Switching keyboards changes the motor baseline. Device type is recorded and calibration is device-scoped, but cross-device comparison is unreliable.

Why this instrument needs to exist Architecture overview Read the papers