Research
How meaning, belief, and influence are encoded in language at scale.
Building the computational tools to detect distortion, model its spread, and audit the systems that amplify it.
How meaning, belief, and influence are encoded in language at scale.
Building the computational tools to detect distortion, model its spread, and audit the systems that amplify it.
I model misinformation not as a monolith but as a set of community-specific rhetorical strategies that spread through algorithmic and social feedback loops. My current work combines self-exciting point processes (Hawkes models) for diffusion, causal audits of algorithmic amplification, and cross-genre/cross-platform retrieval to move from describing misinformation to quantifying how and where it gains traction. This line is directly relevant to trust-and-safety and platform-governance settings.
Projects:
The anatomy of belief-driven distortion [ supported by the Society of Family Planning ]
Pragmatic Language Understanding [ supported by the U.S. National Science Foundation and the AI Innovation Institute ]
Semantic Drift in Medical Information [ supported by the U.S. National Science Foundation ]
Propaganda, Fallacies, and Media Integrity [ supported by the AI Innovation Institute ]
Selected Publications:
Much online influence operates beyond literal meaning: through deflection, fallacy, framing, and stance. I build models for these pragmatic phenomena, drawing on argumentation theory (Toulmin, rhetorical structure theory) and pragmatic language inference to detect things like whataboutism and fallacious reasoning that are missed by purely semantic analysis.
Projects:
The anatomy of belief-driven distortion [ supported by the Society of Family Planning ]
Pragmatic Language Understanding [ supported by the U.S. National Science Foundation and the AI Innovation Institute ]
Propaganda, Fallacies, and Media Integrity [ supported by the AI Innovation Institute ]
Selected Publications:
Khiem Phi, Noushin Salek Faramarzi, Chenlu Wang, and Ritwik BanerjeeFindings of the Association for Computational Linguistics ACLdoi : 10.18653/v1/2024.findings-acl.750Noushin Salek Faramarzi, Fateme Hashemi Chaleshtori, Hossein Shirazi, Indrakshi Ray, and Ritwik BanerjeeWWW '23 Companion: Companion Proceedings of the ACM Web Conference 2023doi : 10.1145/3543873.3587643I study how natural-language systems can reason about, and be held accountable to, privacy law and regulatory constraints — from auditing the fidelity of stated app permissions to investigating medical data against privacy statutes.
Projects:
A Framework for Investigating Live Medical Data against Privacy Laws [ supported by the U.S. National Science Foundation ]
Selected Publications:
I apply the methods above to biomedical and clinical text: verifying health claims against the scientific literature, extracting clinical events, and supporting downstream prediction tasks.
Projects:
Computational Linguistics and AI for Patients (CLAP) [ supported by a SUNY Small Team Multidisciplinary Award and a SUNY Seed Grant. ]
Selected Publications: