You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: arxiv.json
+35Lines changed: 35 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -43580,5 +43580,40 @@
43580
43580
"pub_date": "2025-03-26",
43581
43581
"summary": "Thematic analysis (TA) is a widely used qualitative approach for uncovering latent meanings in unstructured text data. TA provides valuable insights in healthcare but is resource-intensive. Large Language Models (LLMs) have been introduced to perform TA, yet their applications in healthcare remain unexplored. Here, we propose TAMA: A Human-AI Collaborative Thematic Analysis framework using Multi-Agent LLMs for clinical interviews. We leverage the scalability and coherence of multi-agent systems through structured conversations between agents and coordinate the expertise of cardiac experts in TA. Using interview transcripts from parents of children with Anomalous Aortic Origin of a Coronary Artery (AAOCA), a rare congenital heart disease, we demonstrate that TAMA outperforms existing LLM-assisted TA approaches, achieving higher thematic hit rate, coverage, and distinctiveness. TAMA demonstrates strong potential for automated TA in clinical settings by leveraging multi-agent LLM systems with human-in-the-loop integration by enhancing quality while significantly reducing manual workload.",
"title": "CombiGCN: An effective GCN model for Recommender System",
43586
+
"url": "http://arxiv.org/abs/2503.21471v1",
43587
+
"pub_date": "2025-03-27",
43588
+
"summary": "Graph Neural Networks (GNNs) have opened up a potential line of research for collaborative filtering (CF). The key power of GNNs is based on injecting collaborative signal into user and item embeddings which will contain information about user-item interactions after that. However, there are still some unsatisfactory points for a CF model that GNNs could have done better. The way in which the collaborative signal are extracted through an implicit feedback matrix that is essentially built on top of the message-passing architecture of GNNs, and it only helps to update the embedding based on the value of the items (or users) embeddings neighboring. By identifying the similarity weight of users through their interaction history, a key concept of CF, we endeavor to build a user-user weighted connection graph based on their similarity weight. In this study, we propose a recommendation framework, CombiGCN, in which item embeddings are only linearly propagated on the user-item interaction graph, while user embeddings are propagated simultaneously on both the user-user weighted connection graph and user-item interaction graph graphs with Light Graph Convolution (LGC) and combined in a simpler method by using the weighted sum of the embeddings for each layer. We also conducted experiments comparing CombiGCN with several state-of-the-art models on three real-world datasets.",
"title": "Improvement Graph Convolution Collaborative Filtering with Weighted\n addition input",
43593
+
"url": "http://arxiv.org/abs/2503.21468v1",
43594
+
"pub_date": "2025-03-27",
43595
+
"summary": "Graph Neural Networks have been extensively applied in the field of machine learning to find features of graphs, and recommendation systems are no exception. The ratings of users on considered items can be represented by graphs which are input for many efficient models to find out the characteristics of the users and the items. From these insights, relevant items are recommended to users. However, user's decisions on the items have varying degrees of effects on different users, and this information should be learned so as not to be lost in the process of information mining. In this publication, we propose to build an additional graph showing the recommended weight of an item to a target user to improve the accuracy of GNN models. Although the users' friendships were not recorded, their correlation was still evident through the commonalities in consumption behavior. We build a model WiGCN (Weighted input GCN) to describe and experiment on well-known datasets. Conclusions will be stated after comparing our results with state-of-the-art such as GCMC, NGCF and LightGCN. The source code is also included at https://github.com/trantin84/WiGCN.",
"title": "Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval",
43600
+
"url": "http://arxiv.org/abs/2503.21237v1",
43601
+
"pub_date": "2025-03-27",
43602
+
"summary": "Advancements in retrieving accessible information have evolved faster in the last few years compared to the decades since the internet's creation. Search engines, like Google, have been the number one way to find relevant data. They have always relied on the user's abilities to find the best information in its billions of links and sources at everybody's fingertips. The advent of large language models (LLMs) has completely transformed the field of information retrieval. The LLMs excel not only at retrieving relevant knowledge but also at summarizing it effectively, making information more accessible and consumable for users. On top of it, the rise of AI Agents has introduced another aspect to information retrieval i.e. dynamic information retrieval which enables the integration of real-time data such as weather forecasts, and financial data with the knowledge base to curate context-aware knowledge. However, despite these advancements the agents remain susceptible to issues of bias and fairness, challenges deeply rooted within the knowledge base and training of LLMs. This study introduces a novel approach to bias-aware knowledge retrieval by leveraging agentic framework and the innovative use of bias detectors as tools to identify and highlight inherent biases in the retrieved content. By empowering users with transparency and awareness, this approach aims to foster more equitable information systems and promote the development of responsible AI.",
43603
+
"translated": "与互联网诞生后的数十年相比,近年在可访问信息检索领域取得的进展更为迅猛。以谷歌为代表的搜索引擎始终是获取相关数据的首要途径,但其运作一直依赖于用户从海量链接和触手可及的信息源中筛选最优内容的能力。大型语言模型(LLMs)的出现彻底革新了信息检索领域——这些模型不仅擅长检索相关知识,更能高效进行信息摘要,大幅提升信息的可获取性与可理解性。而AI智能体的兴起则为信息检索增添了动态维度:通过将天气预报、金融数据等实时信息与知识库整合,实现情境感知的知识构建。然而尽管取得这些突破,当前系统仍难以规避偏见与公平性问题,这些挑战根植于知识库构建与LLM训练过程之中。本研究提出了一种基于智能体框架的创新方法,通过将偏见检测器作为工具来识别并凸显检索内容中的固有偏差,实现偏差感知的知识检索。该方案通过增强信息透明度和用户认知,致力于构建更公平的信息系统,推动负责任人工智能的发展。\n\n(翻译说明:\n1. 专业术语处理:LLMs、AI Agents等缩写首次出现时保留英文并标注中文全称,后续直接使用缩写\n2. 技术概念转换:\"dynamic information retrieval\"译为\"动态信息检索\",\"context-aware knowledge\"译为\"情境感知知识\"符合计算机领域术语规范\n3. 长句拆分:将原文复合长句按中文表达习惯分解为多个短句,如通过破折号和冒号保持逻辑衔接\n4. 被动语态转换:\"have been relied on\"转化为主动式\"其运作依赖于\"更符合中文表达\n5. 学术风格保持:使用\"构建\"、\"规避\"等正式词汇,避免口语化表达\n6. 创新点突出:通过\"破折号+转折词\"的结构强调技术突破,使用\"方案\"、\"致力于\"等体现研究价值的表述)"
43604
+
},
43605
+
{
43606
+
"title": "Are We Solving a Well-Defined Problem? A Task-Centric Perspective on\n Recommendation Tasks",
43607
+
"url": "http://arxiv.org/abs/2503.21188v1",
43608
+
"pub_date": "2025-03-27",
43609
+
"summary": "Recommender systems (RecSys) leverage user interaction history to predict and suggest relevant items, shaping user experiences across various domains. While many studies adopt a general problem definition, i.e., to recommend preferred items to users based on past interactions, such abstraction often lacks the domain-specific nuances necessary for practical deployment. However, models are frequently evaluated using datasets from online recommender platforms, which inherently reflect these specificities. In this paper, we analyze RecSys task formulations, emphasizing key components such as input-output structures, temporal dynamics, and candidate item selection. All these factors directly impact offline evaluation. We further examine the complexities of user-item interactions, including decision-making costs, multi-step engagements, and unobservable interactions, which may influence model design and loss functions. Additionally, we explore the balance between task specificity and model generalizability, highlighting how well-defined task formulations serve as the foundation for robust evaluation and effective solution development. By clarifying task definitions and their implications, this work provides a structured perspective on RecSys research. The goal is to help researchers better navigate the field, particularly in understanding specificities of the RecSys tasks and ensuring fair and meaningful evaluations.",
"title": "Network Density Analysis of Health Seeking Behavior in Metro Manila: A\n Retrospective Analysis on COVID-19 Google Trends Data",
43614
+
"url": "http://arxiv.org/abs/2503.21162v1",
43615
+
"pub_date": "2025-03-27",
43616
+
"summary": "This study examined the temporal aspect of COVID-19-related health-seeking behavior in Metro Manila, National Capital Region, Philippines through a network density analysis of Google Trends data. A total of 15 keywords across five categories (English symptoms, Filipino symptoms, face wearing, quarantine, and new normal) were examined using both 15-day and 30-day rolling windows from March 2020 to March 2021. The methodology involved constructing network graphs using distance correlation coefficients at varying thresholds (0.4, 0.5, 0.6, and 0.8) and analyzing the time-series data of network density and clustering coefficients. Results revealed three key findings: (1) an inverse relationship between the threshold values and network metrics, indicating that higher thresholds provide more meaningful keyword relationships; (2) exceptionally high network connectivity during the initial pandemic months followed by gradual decline; and (3) distinct patterns in keyword relationships, transitioning from policy-focused searches to more symptom-specific queries as the pandemic temporally progressed. The 30-day window analysis showed more stable, but less search activities compared to the 15-day windows, suggesting stronger correlations in immediate search behaviors. These insights are helpful for health communication because it emphasizes the need of a strategic and conscientious information dissemination from the government or the private sector based on the networked search behavior (e.g. prioritizing to inform select symptoms rather than an overview of what the coronavirus is).",
0 commit comments