Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants

Bibliographic Details
Title: Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
Authors: Vekaria, Yash, Canino, Aurelio Loris, Levitsky, Jonathan, Ciechonski, Alex, Callejo, Patricia, Mandalari, Anna Maria, Shafiq, Zubair
Publication Year: 2025
Collection: Computer Science
Subject Terms: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Cryptography and Security, Computer Science - Computers and Society, I.2, I.2.1, I.2.7, H.3.4, K.4, K.4.1, H.1, H.1.2, H.5.2, H.4.3
More Details: Generative AI (GenAI) browser assistants integrate powerful capabilities of GenAI in web browsers to provide rich experiences such as question answering, content summarization, and agentic navigation. These assistants, available today as browser extensions, can not only track detailed browsing activity such as search and click data, but can also autonomously perform tasks such as filling forms, raising significant privacy concerns. It is crucial to understand the design and operation of GenAI browser extensions, including how they collect, store, process, and share user data. To this end, we study their ability to profile users and personalize their responses based on explicit or inferred demographic attributes and interests of users. We perform network traffic analysis and use a novel prompting framework to audit tracking, profiling, and personalization by the ten most popular GenAI browser assistant extensions. We find that instead of relying on local in-browser models, these assistants largely depend on server-side APIs, which can be auto-invoked without explicit user interaction. When invoked, they collect and share webpage content, often the full HTML DOM and sometimes even the user's form inputs, with their first-party servers. Some assistants also share identifiers and user prompts with third-party trackers such as Google Analytics. The collection and sharing continues even if a webpage contains sensitive information such as health or personal information such as name or SSN entered in a web form. We find that several GenAI browser assistants infer demographic attributes such as age, gender, income, and interests and use this profile--which carries across browsing contexts--to personalize responses. In summary, our work shows that GenAI browser assistants can and do collect personal and sensitive information for profiling and personalization with little to no safeguards.
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2503.16586
Accession Number: edsarx.2503.16586
Database: arXiv
More Details
Description not available.