Instrument focus shifts, edit churn, and idle gaps to estimate flow disruptions. Use lightweight experience sampling and NASA-TLX variants sparingly to avoid poisoning the moment. Correlate interruptions with outcomes, then refine sensing thresholds and delivery patterns to reduce unnecessary nudges while preserving meaningful assistance. Share findings with design partners to translate insights into precise, empathetic UI changes.
Design experiments that respect users. Use interleaving and ghost suggestions to estimate value without forcing bad experiences. Keep holdout cohorts for invariant baselines. Pre-register metrics and stopping rules. Audit for subgroup harm, consent fatigue, and over-optimization that trades trust for short-term gains. Publish learnings internally to prevent metric gaming and reinforce principled decision-making.