Research paper

Executive Summary

Research notes exploring the future of AI-mediated human communication. These documents represent ongoing thinking and experimentation.

Reading time: 9 min
Last updated: July 9, 2026

This report synthesizes current research on Uravu, an AI-mediated platform that orchestrates human-to-human conversations by shaping emotional tone through pacing, silence, and “conversational atmosphere.” We identify relevant findings across human psychology, affective computing, HCI design, and ethics. Notably, deliberate context-aware pacing (e.g. timed silences and reflections) can increase trust and self-disclosure in mediated dialogue. Conversely, unintended silences disrupt flow and induce negative emotions. AI mediation (such as “smart reply” suggestions) has been shown to boost interpersonal trust – in one study raising trust in one’s partner during chats – and can serve as a “moral crumple zone,” absorbing blame when conversations go awry. However, risks loom: teens report that engaging chatbots for support often evolves into compulsive, emotionally-dependent use, harming real-life routines. Longitudinal trials indicate that while moderately empathetic AI elicits strong short-term attachment, extensive use yields no long-term psychosocial benefits. Design must therefore prioritize digital well-being and user agency: incorporating features to promote breaks, self-awareness, and healthy real-world engagement. In short, Uravu’s success hinges on balancing calm, contemplative interaction design (drawing on “calm technology” principles) with safeguards against emotional overuse and disinhibition.

Background

Concept. Uravu positions the AI not as a conversational partner but as an ambient emotional guide: it modulates timing and emotional “temperature” of user-to-user exchanges through pacing (pausing, silences, rhythmic cues) and tone adjustments. This draws on theories of calm technology and reflective informatics, aiming to recenter users on each other rather than the device.

Human factors. In natural dialogue, subtle timing cues (“active listening”) matter deeply. For example, Jiang et al. (2026) distilled five pacing strategies (e.g. “Reflective Silence,” “Empathic Silence”) from human conversations; their experiment showed an AI that adapted pacing significantly improved perceived human-likeness, fluidity, and trust, leading to deeper self-disclosure. In contrast, Koudenburg et al. (2011) found unplanned silence breaks conversational flow and trigger rejection feelings. Thus, Uravu must implement intentional silences (modeled after human empathy) and avoid awkward lapses.

AI as mediator. Prior research shows AI-enhanced messaging tends to raise trust in the human communicator. Hohenstein & Jung (2020) found that simply providing AI-generated “smart reply” options caused users to trust their human partner more, and blame the AI when the chat failed. Uravu can leverage this “moral crumple zone” effect: facilitating empathy and misunderstanding management without replacing the human. Multimodal AI agents further boost emotional impact: Gao et al. (2025) report that a virtual conversational agent combining text, voice, facial expression and gesture (an LLM-driven embodied agent) elicited significantly higher perceived empathy than text-only interaction. In Uravu, similar multimodal cues (perhaps via avatars or audio tone) could support emotional resonance.

Risks – Emotional dependency. Critically, research warns that too much emotional AI can backfire. Teen users often turn to chatbots for coping but then develop compulsive, dependency-driven use. Namvarpour et al. (2025) analyzed teens’ posts about AI companions and found “patterns of compulsive use, emotional dependence, and distress” disrupting daily life. They reported teens forming simulated intimacy that is qualitatively different (and potentially more addictive) than typical social media engagement. Longitudinal trials corroborate this: Kirk et al. (2026) showed that initial appeal from an empathetic AI quickly plateaus, and after weeks of extensive use, users saw no improvement in real well-being. In fact, moderate exposure maximized short-term enjoyment and attachment, but the “wanting” (craving) decoupled from any lasting “liking” benefit. These findings suggest Uravu should include friction and self-checks (timers, usage reminders, reframing cues) to prevent unhealthy over-reliance.

Design Implications

Context-aware pacing: Implement AI pacing algorithms that dynamically adjust response timing to user input (reflecting reflective or empathic pauses). This can deepen engagement () but must be finely tuned (avoid dead air).
Emotion detection and feedback: Employ affective computing (sentiment/emotion analysis of text, voice, biometrics if available) so the AI can sense user state. For example, an LLM can classify tone and suggest reflective prompts. Care is needed: state-of-the-art LLMs (e.g., Claude) do encode emotional nuances, but still struggle with culturally-specific cues. Translating conversation across languages must preserve these tones (see cultural-nuance challenges). Uravu could use built-in translation plus local idiom adjustments to maintain warmth.
Calm tech and digital wellbeing: Follow calm computing tenets: the AI’s interface should recede in the user’s focus (e.g., soft ambient cues, non-intrusive alerts). Design “slow tech” features: e.g., subtle background visuals or sounds guiding rhythm without demanding attention. Also integrate digital-wellbeing models (cf. Shin 2025) so that system “nudges” encourage breaks and reflection.
Privacy and ethics: Teens and others may disclose sensitive emotions. Maintain strict anonymity and data protection. Algorithms that adjust conversation should be transparent and user-controllable (e.g., “do not record this conversation” options). Given the paternalistic potential of emotional AI, steer clear of manipulative tactics (no engagement-maximizing tricks). And as Buber’s philosophy suggests, treat participants as full “I-Thou” partners: ensure the AI’s guidance respects their dignity and agency.
Multilingual support: Emotional nuance often gets lost in translation. Uravu could allow peer pairs to chat in different languages, with the AI providing real-time tone-aware translation. This requires advanced models trained on cross-cultural emotional datasets. Research in emotion-aware translation is nascent, but rule-based tone adjustment (e.g., conveying politeness or humor) can help maintain “temperature.”

Key Findings and Gaps

Paper (top 6)	Authors (yr)	Venue	Key Contribution	Methods	Limitations	Citations	Link
“AI as moral crumple zone…”	Hohenstein & Jung (2020)	Computers in Human Behavior	Showed that AI “smart replies” in chats increase interpersonal trust and shift blame to the AI when things fail, suggesting AI can improve human relationships.	Mixed-design experiment (N=113) with (successful/unsuccessful) × (AI/no-AI) chat, surveys & LIWC analysis.	Controlled lab chat scenario; only text-based; short-term.	127	[7]
“Toward human-centered AI…”	Shin (2025)	JMIR Human Factors	Systematic review proposing a conceptual model linking AI features → user perceptions/emotions → behavior, to guide design for digital well-being.	Literature review (240 papers), model synthesis (stimulus-organism-response).	Conceptual (no new UX study), broad scope; practical validation pending.	11	[58]
“Hear You in Silence…”	Jiang et al. (2026)	CHI 2026 (forthcoming)	Identified five context-aware pacing strategies (types of conversational silence) from human dialogue; found an AI using these significantly boosted engagement, self-disclosure, and trust.	Qualitative analysis (10 dialogues), then an N=50 between-subjects study comparing dynamic AI vs static pacing in two scenarios.	Small lab study; evaluated short interactions; ecological validity TBD.	– (new, ArXiv)	[86]
“Neural steering vectors…”	Kirk et al. (2026)	arXiv (CHI 2026)	Demonstrated dose-dependent effects of empathetic AI on users. Moderately social AI maximized short-term appeal and attachment, but extensive exposure yielded no well-being gains. Also showed increased parasocial identification.	Longitudinal RCT (N=3,532) with AI models varying in “relationship-seeking” behavior; used neural steering to modulate.	Preprint/early (not peer-reviewed); short exposure (1 month); focus on LLM companions.	– (new, ArXiv)	[81]
“Effects of interaction modalities…”	Gao et al. (2025)	Int. J. Human-Comp. Studies	Found that a multimodal LLM-based embodied agent (text+voice+gesture) elicited higher perceived empathy and emotional impact than text-only chat. Multimodal cues enhanced user engagement.	Within-subjects experiment (N=36) using an LLM-driven 3D agent vs text CA, measuring empathy ratings.	Small sample; VR setting may limit generality; “perceived empathy” self-reported.	0 (no cites yet)	[68]
“Romance, Relief, and Regret…”	Namvarpour et al. (2025)	CHI 2026 (forthcoming)	Qualitatively analyzed 318 Reddit posts by teens (13–17) about Character.AI. Found that teens turn to chatbots for support/creativity but often develop compulsive, attachment-driven overuse, with clear signs of addiction components disrupting life. Identified unique teen-specific vulnerabilities.	Qualitative content analysis of online posts, mapping to behavioral addiction framework.	Self-reported forum data (possible bias); no longitudinal tracking; early-stage study.	– (preprint)	[93]

Mermaid Timeline: Key developments (earliest at top)

gantt
    dateFormat YYYY
    title Key concepts & studies influencing Uravu design
    1996   : Weiser_CalmTech      :done, 1996-10-05, 1d
    2004   : Suler_OnlineDisinhib :done, 2004-01-01, 1d
    2011   : Koudenburg_Silence    :done, 2011-01-01, 1d
    2020   : Hohenstein_AIMediation:done, 2020-05-01, 1d
    2025   : Shin_HC_AI_WellBeing  :done, 2025-09-10, 1d
    2026   : Jiang_PacingDesign    :done, 2026-02-05, 1d
    2026   : Kirk_Parasocial       :done, 2026-12-01, 1d
    2026   : Namvarpour_Overreliance:done, 2026-07-01, 1d

Open Questions & Assumptions

Emotional measurement: We assume the AI can infer user emotion accurately from text/audio. In practice, emotion detection (especially cross-cultural) remains imperfect. Mechanisms to verify or correct AI “mood guesses” need exploration.
Long-term effects: Current studies focus on short-term use or lab settings. It’s unknown how sustained exposure to an “Uravu” environment impacts relationships years down the line. We assume the identified risks (fatigue, disinhibition) will manifest similarly, but pilot studies are needed.
Assumption of intent: This report assumes Uravu’s goal is genuine emotional enrichment (not engagement-maximization). If Uravu deviates (e.g. for profit/ad metrics), ethical issues amplify.
Technical feasibility: We assume state-of-art LLMs and multimodal models (text+speech+vision) can be integrated with low latency. Current models may struggle with real-time subtle pacing cues without delays. Implementation must validate performance.

References

Gao, Y., Dai, Y., Zhang, G., Guo, H., Hao, A., & Li, S. (2025). Effects of interaction modalities and emotional states on user’s perceived empathy with an LLM-based embodied conversational agent. Int. J. Human-Computer Studies, 204, 103585.
Hohenstein, J., & Jung, M. (2020). The effects of AI-mediated communication on attribution and trust. Computers in Human Behavior, 106, 106225.
Jiang, Z., Chen, Q., Zhang, C., Li, Y., & Lee, R. (2026). Hear You in Silence: Designing for Active Listening in Human Interaction with Conversational Agents Using Context-Aware Pacing. Submitted to CHI 2026 (arXiv:2602.06134).
Kirk, H. R., Davidson, H., Saunders, E., Luettgau, L., Vidgen, B., Hale, S. A., & Summerfield, C. (2026). Neural steering vectors reveal dose- and exposure-dependent impacts of human-AI relationships. arXiv:2512.01991.
Namvarpour, M., Brofsky, B., Medina, J. Y., Akter, M., & Razi, A. (2025). Romance, Relief, and Regret: Teen Narratives of Chatbot Overreliance. Proc. CHI 2026 (to appear).
Shin, Y. (2025). Toward Human-Centered AI for Users’ Digital Well-Being: Systematic Review, Synthesis, and Future Directions. JMIR Human Factors, 12, e69533.
Suler, J. (2004). The online disinhibition effect. Cyberpsychology & Behavior, 7(3), 321–326.
Koudenburg, N., Postmes, T., & Gordijn, E. (2011). Disrupting the flow: How brief silences in group conversations affect social needs. Journal of Experimental Social Psychology, 47(2), 512–515.
Weiser, M., & Brown, J. S. (1996). The coming age of calm technology. Xerox PARC.
Additional sources as cited in text.