Divide-and-conquer Test-time Adaptation with Retrieval-augmentation for Text-attributed Graph Foundation Model
-
摘要: 文本属性图基础模型(TAG-FM)旨在通过在大规模图数据集上预训练, 实现稳健的跨域泛化能力. 尽管TAG-FM已在多项下游任务中展现出良好的性能, 但是通过深入分析发现, 其在泛化能力方面仍存在关键弱点: 域适应阈值效应, 即, 当预训练域与测试域之间的分布偏移超过某一临界阈值时, 模型性能将出现显著退化. 由于基础模型复杂度高且测试数据稀缺, 引入测试时自适应方法成为缓解该问题的可行方案. 然而现有测试时自适应方法在实例层和域层均存在局限, 严重制约着TAG-FM对分布外数据的适应性能. 针对此问题, 提出一种面向TAG-FM的“检索—分治”测试时自适应方法, 该方法通过检索增强的特征共形融合模块生成融合图结构和节点文本语义的统一特征, 并引入一种分而治之的测试时自适应策略, 通过渐进式自适应过程学习泛化性语义. 在7个基准文本属性图数据集和12个严重域偏移文本属性图数据集上的实验结果表明, 本文方法能够显著增强TAG-FM的跨域泛化能力.Abstract: Text-attributed graph foundation model (TAG-FM) is designed to attain robust cross-domain generalization capability through pretraining on large-scale graph datasets. Although TAG-FM has demonstrated promising performance on various downstream tasks, our in-depth analysis reveals a critical limitation in its generalization capability: The domain adaptation threshold effect, i.e., model performance undergoes catastrophic degradation when distributional shifts between pretraining and testing domains surpass critical thresholds. Due to the excessive foundation model complexity and scarcity of testing data, a feasible solution to mitigate this issue is to introduce test-time adaptation approaches for TAG-FM. Nevertheless, existing test-time adaptation approaches exhibit substantial limitations at both the instance and domain levels, severely constraining the ability of TAG-FM to adapt to out-of-distribution data. To address this issue, we innovatively propose the divide-and-conquer test-time adaptation with retrieval-augmentation for TAG-FM. Specifically, a retrieval-augmented feature conformal integration module is introduced to construct consolidated node features by integrating graph structures with node textual semantics. Furthermore, a divide-and-conquer test-time adaptation strategy is developed to progressively capture generalizable semantics through an iterative adaptation process. Empirically, we verify that the proposed method achieves the state-of-the-art performance on 7 benchmark text-attributed graph datasets and 12 constructed text-attributed graph datasets exhibiting severe domain shifts, which confirms that our method effectively enhances the cross-domain generalization capability of TAG-FM.1)
1 1本文将TAG-FM的预训练数据视为预训练域, 将Cora、CiteSeer等测试数据集视为测试域, 一个测试数据集可视为一个测试域. 值得注意的是, TAG-FM的预训练数据仅用于探索实验, 所提方法的实际实现并未使用任何TAG-FM预训练数据.2)2 2对任意节点$ v_n $, 均存在对应的统一特征$ z_n $, 故可反向利用$ z_n $检索节点$ v_n $. -
表 1 文本属性图数据集统计
Table 1 Statistics of text-attributed graph datasets
数据集 节点数 边数 领域 类别数 Cora 2708 5429 学术 7 CiteSeer 3186 4277 学术 6 Ele-Photo 48362 500928 电子商务 12 Ele-Computers 87229 721081 电子商务 10 Books-History 41551 358574 电子商务 12 WikiCS 11701 215863 维基百科 10 Instagram 11339 144010 社交媒体 2 表 2 类别相关提示模板汇总
Table 2 Summary of the class-relevant prompt templates
数据集 提示模板 Cora this paper has a topic on {class} {class_desc} CiteSeer good paper of {class} {class_desc} Ele-Photo this product belongs to {class} {class_desc} Ele-Computers is {class} category {class_desc} Books-History this book belongs to {class} {class_desc} WikiCS it belongs to {class} research area {class_desc} Instagram {class} {class_desc} 表 3 构建查询$ q_n$的提示模板汇总
Table 3 Summary of prompt templates used to construct the query $ q_n$
数据集 构建查询$q_n$的提示模板 Cora The textual description of the citation network centered on node ‘n{id}’ is shown above. Each node in the network represents a scholarly article, and each edge signifies a citation relationship between articles. The textual description of node ‘n{id}’ is: ‘{desc}’. Please summarize the core content of the article represented by node ‘n{id}’ using the information provided above in 200 words: CiteSeer The textual description of the citation network centered on node ‘n{id}’ is shown above. Each node in the network represents a scholarly article, and each edge signifies a citation relationship between articles. The textual description of node ‘n{id}’ is: ‘{desc}’. Please summarize the core content of the article represented by node ‘n{id}’ using the information provided above in 200 words: Ele-Photo The textual description of the Amazon Electronics graph centered on node ‘n{id}’ is shown above. Each node in the graph represents a electronic product, and each edge signifies frequent co-purchases or co-views. The user review with the most votes of node ‘n{id}’ is: ‘{desc}’. Please summarize the information provided above and re-describe the electronic product represented by node ‘n{id}’ based on the results of the summary in 200 words: Ele-Computers The textual description of the Amazon Electronics graph centered on node ‘n{id}’ is shown above. Each node in the graph represents a electronic product, and each edge signifies frequent co-purchases or co-views. The user review with the most votes of node ‘n{id}’ is: ‘{desc}’. Please summarize the information provided above and re-describe the electronic product represented by node ‘n{id}’ based on the results of the summary in 200 words: Books-
HistoryThe textual description of the Amazon-Books graph centered on node ‘n{id}’ is shown above. Each node in the graph represents a book focusing on items labeled as 'History', and each edge signifies frequent co-purchases or co-views between two books. The title and description of the book of node ‘n{id}’ is: ‘{desc}’. Please summarize the information provided above and re-describe the history book represented by node ‘n{id}’ based on the results of the summary in 200 words: WikiCS The textual description of the Wikipedia-based graph centered on node ‘n{id}’ is shown above. The textual description of node ‘n{id}’ is ‘{desc}’. Please summarize the information provided above and re-describe node ‘n{id}’ based on the results of the summary in 200 words: Instagram The textual description of the social network centered on node ‘n{id}’ is shown above. Each node in the graph signifys user, and each edge signifies following relationships. The textual description of node ‘n{id}’ is: ‘{desc}’. Please summarize the information provided above and re-describe the user represented by node ‘n{id}’ based on the results of the summary in 200 words: 表 4 轻度域偏移数据集上的节点分类任务准确率(%)
Table 4 Accuracy for the node classification task on mild domain shift datasets(%)
方法 Cora CiteSeer WikiCS Instagram Ele-Photo Ele-Computers Books-History 平均值 BERT[58] 19.56 $\pm$ 0.98 33.26 $\pm$ 2.35 29.37 $\pm$ 0.00 57.02 $\pm$ 0.57 21.80 $\pm$ 0.14 13.88 $\pm$ 0.29 9.95 $\pm$ 0.42 26.41 SBERT[59] 54.35 $\pm$ 1.26 50.47 $\pm$ 0.90 48.16 $\pm$ 0.00 48.34 $\pm$ 1.23 35.96 $\pm$ 0.44 41.82 $\pm$ 0.22 30.45 $\pm$ 0.19 44.22 DeBERTa[60] 16.42 $\pm$ 1.26 16.42 $\pm$ 1.26 15.29 $\pm$ 0.00 39.81 $\pm$ 0.58 12.38 $\pm$ 0.26 10.62 $\pm$ 0.15 9.70 $\pm$ 0.26 17.23 E5[61] 44.65 $\pm$ 0.82 42.57 $\pm$ 0.54 31.49 $\pm$ 0.00 61.28 $\pm$ 0.97 35.14 $\pm$ 0.28 16.54 $\pm$ 0.14 12.92 $\pm$ 0.48 34.94 Qwen2-7B-Instruct[62] 61.44 $\pm$ 1.29 53.57 $\pm$ 0.86 58.72 $\pm$ 0.25 39.13 $\pm$ 0.78 45.55 $\pm$ 0.12 59.18 $\pm$ 0.20 23.79 $\pm$ 0.34 48.77 Qwen2-72B-Instruct[62] 62.18 $\pm$ 0.98 60.97 $\pm$ 0.87 60.91 $\pm$ 0.08 47.70 $\pm$ 0.31 52.41 $\pm$ 0.39 60.88 $\pm$ 0.30 53.56 $\pm$ 0.64 56.94 LLaMA3.1-8B-Instruct[11] 57.75 $\pm$ 1.21 53.54 $\pm$ 1.71 58.32 $\pm$ 0.21 39.37 $\pm$ 1.14 34.38 $\pm$ 0.25 46.98 $\pm$ 0.21 22.28 $\pm$ 0.18 44.66 LLaMA3.1-70B-Instruct[11] 65.72 $\pm$ 1.24 62.79 $\pm$ 1.24 62.82 $\pm$ 0.04 43.68 $\pm$ 0.52 51.26 $\pm$ 0.53 61.62 $\pm$ 0.42 53.33 $\pm$ 0.55 57.32 OFA[12] 37.25 $\pm$ 1.38 29.64 $\pm$ 0.19 45.52 $\pm$ 1.06 32.71 $\pm$ 0.16 33.03 $\pm$ 0.64 22.09 $\pm$ 0.39 16.87 $\pm$ 0.93 31.02 ZeroG[13] 62.32 $\pm$ 1.91 52.55 $\pm$ 1.23 54.93 $\pm$ 0.06 48.97 $\pm$ 0.78 45.12 $\pm$ 0.65 56.20 $\pm$ 0.35 40.74 $\pm$ 0.65 51.55 GraphGPT[14] 23.25 $\pm$ 1.45 18.04 $\pm$ 1.45 6.30 $\pm$ 0.26 45.12 $\pm$ 1.16 7.62 $\pm$ 0.22 29.71 $\pm$ 0.83 15.92 $\pm$ 0.14 20.85 LLaGA[15] 21.44 $\pm$ 0.65 16.07 $\pm$ 1.15 2.65 $\pm$ 0.72 41.12 $\pm$ 0.94 6.50 $\pm$ 0.53 23.10 $\pm$ 0.33 11.17 $\pm$ 0.58 17.44 DGI[63] 24.03 $\pm$ 1.40 18.71 $\pm$ 1.22 18.86 $\pm$ 0.25 61.42 $\pm$ 1.12 13.96 $\pm$ 0.17 27.12 $\pm$ 0.03 15.77 $\pm$ 0.02 25.70 GRACE[64] 13.69 $\pm$ 1.27 22.88 $\pm$ 1.49 16.07 $\pm$ 0.32 62.23 $\pm$ 0.93 10.16 $\pm$ 0.13 10.94 $\pm$ 0.12 32.39 $\pm$ 0.11 24.05 BGRL[65] 31.99 $\pm$ 1.06 26.50 $\pm$ 1.22 18.35 $\pm$ 0.22 61.45 $\pm$ 0.82 5.21 $\pm$ 0.22 24.12 $\pm$ 0.22 16.28 $\pm$ 0.35 26.27 GraphMAE[66] 23.25 $\pm$ 1.07 20.75 $\pm$ 0.88 12.14 $\pm$ 0.20 62.39 $\pm$ 0.84 12.53 $\pm$ 0.08 8.36 $\pm$ 0.06 21.76 $\pm$ 0.17 23.03 G2P2[67] 41.51 $\pm$ 0.78 51.02 $\pm$ 0.62 31.92 $\pm$ 0.15 52.87 $\pm$ 0.78 22.21 $\pm$ 0.12 32.52 $\pm$ 0.13 26.18 $\pm$ 0.25 36.89 GraphCLIP[19] 67.31 $\pm$ 1.76 63.13 $\pm$ 1.13 70.19 $\pm$ 0.10 64.05 $\pm$ 0.34 53.40 $\pm$ 0.64 62.04 $\pm$ 0.21 53.88 $\pm$ 0.35 62.00 DORA 70.29 $\pm$ 0.54 67.82 $\pm$ 0.52 72.42 $\pm$ 0.67 64.83 $\pm$ 0.25 54.93 $\pm$ 0.42 63.25 $\pm$ 0.50 54.69 $\pm$ 0.43 64.03 表 5 不同测试时自适应方法在轻度域偏移数据集上的节点分类任务准确率(%)
Table 5 Accuracy for the node classification task on mild domain shift datasets with different TTA methods(%)
方法 Cora CiteSeer WikiCS Instagram Ele-Photo Ele-Computers Books-History 平均值 Tent[21] 64.27 $\pm$ 1.35 63.06 $\pm$ 2.49 57.27 $\pm$ 0.83 50.56 $\pm$ 0.39 44.90 $\pm$ 0.14 49.35 $\pm$ 0.20 24.99 $\pm$ 0.48 50.63 GTrans[24] 69.37 $\pm$ 0.60 66.98 $\pm$ 1.12 71.89 $\pm$ 0.77 64.02 $\pm$ 0.19 54.83 $\pm$ 0.29 61.16 $\pm$ 0.37 50.92 $\pm$ 0.28 62.74 Matcha[22] 69.49 $\pm$ 1.13 66.78 $\pm$ 1.39 72.82 $\pm$ 0.89 64.02 $\pm$ 0.50 51.34 $\pm$ 0.74 56.06 $\pm$ 0.30 45.34 $\pm$ 1.35 60.84 DORA 70.29 $\pm$ 0.54 67.82 $\pm$ 0.52 72.42 $\pm$ 0.67 64.83 $\pm$ 0.25 54.93 $\pm$ 0.42 63.25 $\pm$ 0.50 54.69 $\pm$ 0.43 64.03 表 6 消融实验结果(%)
Table 6 The results of the ablation experiment (%)
RFCI D&C-TTA Cora WikiCS Instagram RAS DFCI $\times$ $\times$ $\times$ 67.31 $\pm$ 1.76 70.19 $\pm$ 0.10 64.05 $\pm$ 0.34 $\times$ $ \checkmark$ $ \checkmark$ 69.13 $\pm$ 1.23 71.39 $\pm$ 0.74 64.21 $\pm$ 0.33 $ \checkmark$ $\times$ $ \checkmark$ 70.23 $\pm$ 0.57 72.16 $\pm$ 0.68 64.60 $\pm$ 0.31 $ \checkmark$ $ \checkmark$ $\times$ 70.05 $\pm$ 0.46 71.41 $\pm$ 1.35 64.80 $\pm$ 0.28 $ \checkmark$ $ \checkmark$ $ \checkmark$ 70.29 $\pm$ 0.54 72.42 $\pm$ 0.67 64.83 $\pm$ 0.25 -
[1] Rozemberczki B, Allen C, Sarkar R. Multi-scale attributed node embedding. Journal Of Complex Networks, 2021, 9(2): cnab014 doi: 10.1093/comnet/cnab014 [2] Hu W, Fey M, Zitnik M, Dong Y, Ren H, Liu B, et al. Open graph benchmark: Datasets for machine learning on graphs. Advances In Neural Information Processing Systems, 2020, 33: 22118−22133 [3] Pareja A, Domeniconi G, Chen J, Ma T, Suzumura T, Kanezashi H, et al. Evolvegcn: Evolving graph convolutional networks for dynamic graphs. Proceedings Of The AAAI Conference On Artificial Intelligence, 2020, 34: 5363−5370 doi: 10.1609/aaai.v34i04.5984 [4] 王振东, 徐振宇, 李大海, 王俊岭. 面向入侵检测的元图神经网络构建与分析. 自动化学报, 2023, 49(7): 1530−1548Wang Zhen-Dong, Xu Zhen-Yu, Li Da-Hai, Wang Jun-Ling. Construction and Analysis of Meta Graph Neural Network for Intrusion Detection. Acta Automatica Sinica, 2023, 49(7): 1530−1548 [5] 陈露, 尚家兴, 刘大江, 张玉芳, 倪晚成. 基于异构图神经网络的可解释兵棋态势预测方法. 自动化学报, 2025, 51(6): 1248−1260 doi: 10.16383/j.aas.c240468Chen Lu, Shang Jia-Xing, Liu Da-Jiang, Zhang Yu-Fang, Ni Wan-Cheng. An interpretable wargame situation prediction method based on heterogeneous graph neural networks. Acta Automatica Sinica, 2025, 51(6): 1248−1260 doi: 10.16383/j.aas.c240468 [6] Li Y, Hooi B. Prompt-Based Zero- and Few-Shot Node Classification: A Multimodal Approach. arXiv preprint arXiv: 2307.11572, 2023. [7] Tang Y, Qiu R, Liu Y, Li X, Huang, Z. CaseGNN: Graph neural networks for legal case retrieval with text-attributed graphs. European Conference On Information Retrieval. Cham: Springer Nature Switzerland, 2024: 80-95. [8] He X, Tian Y, Sun Y, Chawla N, Laurent T, LeCun Y, et al. G-retriever: Retrieval-augmented generation for textual graph understanding and question answering. Advances In Neural Information Processing Systems, 2024, 37: 132876−132907 doi: 10.52202/079017-4224 [9] 徐正斐, 辛欣. 基于大语言模型的中文实体链接实证研究. 自动化学报, 2025, 51(2): 327−342 doi: 10.16383/j.aas.c240069Xu Zheng-Fei, Xin Xin. An empirical study of Chinese entity linking based on large language model. Acta Automatica Sinica, 2025, 51(2): 327−342 doi: 10.16383/j.aas.c240069 [10] 秦龙, 武万森, 刘丹, 胡越, 尹全军, 阳东升, 王飞跃. 基于大语言模型的复杂任务自主规划处理框架. 自动化学报, 2024, 50(4): 862−872 doi: 10.16383/j.aas.c240088Qin Long, Wu Wan-Sen, Liu Dan, Hu Yue, Yin Quan-Jun, Yang Dong-Sheng, Wang Fei-Yue. Autonomous planning and processing framework for complex tasks based on large language models. Acta Automatica Sinica, 2024, 50(4): 862−872 doi: 10.16383/j.aas.c240088 [11] Vavekanand R, Sam K. Llama 3.1: An in-depth analysis of the next-generation large language model. Preprint, July, 2024. [12] Liu H, Feng J, Kong L, Liang N, Tao D, Chen Y, et al. One for all: Towards training one graph model for all classification tasks. arXiv preprint arXiv: 2310.00149, 2023. [13] Li Y, Wang P, Li Z, Yu J, Li J. Zerog: Investigating cross-dataset zero-shot transferability in graphs. Proceedings Of The 30th ACM SIGKDD Conference On Knowledge Discovery And Data Mining, 20241725−1735 [14] Tang J, Yang Y, Wei W, Shi L, Su L, Cheng S, et al. Graphgpt: Graph instruction tuning for large language models. Proceedings Of The 47th International ACM SIGIR Conference On Research And Development In Information Retrieval, 2024491−500 [15] Chen R, Zhao T, Jaiswal A, Shah N, Wang Z. Llaga: Large language and graph assistant. arXiv preprint arXiv: 2402.08170, 2024. [16] Rosenfeld J. Scaling Laws for Deep Learning. arXiv preprint arXiv: 2108.07686, 2021. [17] Hernandez D, Kaplan J, Henighan T, McCandlish S. Scaling Laws for Transfer. arXiv preprint arXiv: 2102.01293, 2021. [18] Radford A, Kim J, Hallacy C, Ramesh A, Goh G, Agarwal S, et al. Learning Transferable Visual Models From Natural Language Supervision. Proceedings Of The 38th International Conference On Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event, 2021, 139: 8748−8763 [19] Zhu Y, Shi H, Wang X, Liu Y, Wang Y, Peng B, et al. GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed Graphs. Proceedings Of The ACM On Web Conference 2025, 20252183−2197 [20] Han Z, Gui X, Sun H, Yin Y, Li S. Towards accurate and robust domain adaptation under multiple noisy environments. IEEE Transactions On Pattern Analysis And Machine Intelligence, 2022, 45(5): 6460−6479 [21] Wang D, Shelhamer E, Liu S, Olshausen B, Darrell T. Tent: Fully test-time adaptation by entropy minimization. arXiv preprint arXiv: 2006.10726, 2020. [22] Bao W, Zeng Z, Liu Z, Tong H, He J. Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation. arXiv preprint arXiv: 2410.06976, 2024. [23] Ju M, Zhao T, Yu W, Shah N, Ye Y. Graphpatcher: Mitigating degree bias for graph neural networks via test-time augmentation. Advances In Neural Information Processing Systems, 2023, 36: 55785−55801 doi: 10.52202/075280-2434 [24] Jin W, Zhao T, Ding J, Liu Y, Tang J, Shah N. Empowering graph representation learning with test-time graph transformation. arXiv preprint arXiv: 2210.03561, 2022. [25] Chatterjee S, Diaconis P. The sample size required in importance sampling. The Annals Of Applied Probability, 2018, 28: 1099−1135 [26] McAllester D, Stratos K. Formal limitations on the measurement of mutual information. International Conference On Artificial Intelligence And Statistics, 2020875−884 [27] Su G, Wang H, Wang J, et al. Large Language Models Meet Text-Attributed Graphs: A Survey of Integration Frameworks and Applications. arXiv preprint arXiv: 2510.21131, 2025. [28] Beiranvand A, Vahidipour S M. Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex. arXiv preprint arXiv: 2504.12474, 2025. [29] Khoshraftar S, Abedini N, Hajian A. GraphiT: Efficient Node Classification on Text-Attributed Graphs with Prompt Optimized LLMs. Companion Proceedings of the ACM on Web Conference 2025, 20251824−1829 [30] Zhang Q, Bian Y, Kong X, et al. COME: Test-time Adaption by Conservatively Minimizing Entropy. The Thirteenth International Conference on Learning Representations. 2025. [31] Lee J, Jung D, Lee S, Park J, Shin J, Hwang U, et al. Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors. arXiv preprint arXiv: 2403.07366, 2024. [32] Hu Y, Qiao C, Geng X, et al. Selective Label Enhancement Learning for Test-Time Adaptation. The Thirteenth International Conference on Learning Representations. 2025. [33] Chen G, Zhang J, Xiao X, Li Y. Graphtta: Test time adaptation on graph neural networks. arXiv preprint arXiv: 2208.09126, 2022. [34] Wang Y, Li C, Jin W, Li R, Zhao J, Tang J, et al. Test-time training for graph neural networks. arXiv preprint arXiv: 2210.08813, 2022. [35] Mao H, Du L, Zheng Y, Fu Q, Li Z, Chen X, et al. Source free graph unsupervised domain adaptation. Proceedings Of The 17th ACM International Conference On Web Search And Data Mining, 2024520−528 [36] Zheng X, Huang W, Zhou C, et al. Test-Time Graph Neural Dataset Search With Generative Projection. Forty-second International Conference on Machine Learning. 2025. [37] Zhao Y, Zhang Q, Luo X, et al. Test-time Adaptation on Graphs via Adaptive Subgraph-based Selection and Regularized Prototypes. Forty-second International Conference on Machine Learning. 2025. [38] Gadermayr M, Eschweiler D, Klinkhammer B M, et al. Gradual domain adaptation for segmenting whole slide images showing pathological variability. International conference on image and signal processing, 2018461−469 [39] Kumar A, Ma T, Liang P. Understanding self-training for gradual domain adaptation. International conference on machine learning, 20205468−5479 [40] Chen H Y, Chao W L. Gradual domain adaptation without indexed intermediate domains. Advances in neural information processing systems, 2021, 34: 8201−8214 [41] Lei P I, Chen X, Sheng Y, et al. Gradual domain adaptation for graph learning. arXiv preprint arXiv: 2501.17443, 2025. [42] Zeng Z, Qiu R, Bao W, et al. Pave your own path: Graph gradual domain adaptation on fused gromov-wasserstein geodesics[J]. arXiv preprint arXiv: 2505.12709, 2025. [43] Rampášek L, Galkin M, Dwivedi V, Luu A, Wolf G, Beaini D. Recipe for a general, powerful, scalable graph transformer. Advances In Neural Information Processing Systems, 2022, 35: 14501−14515 doi: 10.52202/068431-1054 [44] 胡正平, 孟鹏权. 全局孤立性和局部同质性图表示的随机游走显著目标检测算法. 自动化学报, 2011, 37(10): 1279−1284 doi: 10.3724/SP.J.1004.2011.01279HU Zheng-Ping, MENG Peng-Quan. Graph Presentation Random Walk Salient Object Detection Algorithm Based on Global Isolation and Local Homogeneity. ACTA AUTOMATICA SINICA, 2011, 37(10): 1279−1284 doi: 10.3724/SP.J.1004.2011.01279 [45] Bienstock D, Goemans M, Simchi-Levi D, Williamson D. A note on the prize collecting traveling salesman problem. Mathematical Programming, 1993, 59(1): 413−420 [46] Han Z, Zhang C, Fu H, Zhou J. Trusted multi-view classification with dynamic evidential fusion. IEEE Transactions On Pattern Analysis And Machine Intelligence, 2022, 45: 2551−2566 [47] Liu W, Wang X, Owens J, Li Y. Energy-based out-of-distribution detection. Advances In Neural Information Processing Systems, 2020, 33: 21464−21475 doi: 10.1109/ijcnn55064.2022.9892318 [48] Vinutha H, Poornima B, Sagar B. Detection of outliers using interquartile range technique from intrusion dataset. Information and decision sciences: Proceedings of the 6th international conference on ficta. Singapore: Springer Singapore, 2018511−518 [49] Niu S, Wu J, Zhang Y, Chen Y, Zheng S, Zhao P, et al. Efficient test-time model adaptation without forgetting. International Conference On Machine Learning, 202216888−16905 [50] Sen P, Namata G, Bilgic M, Getoor L, Galligher B, Eliassi-Rad T. Collective classification in network data. AI Magazine, 2008, 29(3): 93−93 doi: 10.1609/aimag.v29i3.2157 [51] Giles C, Bollacker K, Lawrence S. CiteSeer: An automatic citation indexing system. Proceedings Of The Third ACM Conference On Digital Libraries, 199889−98 [52] Yan H, Li C, Long R, Yan C, Zhao J, Zhuang W, et al. A Comprehensive Study on Text-attributed Graphs: Benchmarking and Rethinking. Advances in Neural Information Processing Systems, 2023, 36: 17238−17264 doi: 10.52202/075280-0755 [53] Ni J, Li J, McAuley J. Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects. Proceedings Of The 2019 Conference On Empirical Methods In Natural Language Processing And The 9th International Joint Conference On Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019. 2019: 188-197. [54] Mernyei P, Cangea C. Wiki-cs: A wikipedia-based benchmark for graph neural networks. arXiv preprint arXiv: 2007.02901, 2020. [55] Huang X, Han K, Yang Y, Bao D, Tao Q, Chai Z, et al. Can GNN be Good Adapter for LLMs?. Proceedings of the ACM Web Conference 2024, 2024893−904 [56] Reimers N, Gurevych I. Making monolingual sentence embeddings multilingual using knowledge distillation. arXiv preprint arXiv: 2004.09813, 2020. [57] Vinutha H, Poornima B, Sagar B. Detection of outliers using interquartile range technique from intrusion dataset. Information And Decision Sciences: Proceedings Of The 6th International Conference On Ficta, 2018511−518 [58] Devlin J, Chang M, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings Of The 2019 Conference Of The North American Chapter Of The Association For Computational Linguistics: Human Language Technologies, Volume 1 (long And Short Papers), 20194171−4186 [59] Reimers N, Gurevych I. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv: 1908.10084, 2019. [60] He P, Liu X, Gao J, Chen W. Deberta: Decoding-enhanced bert with disentangled attention. arXiv preprint arXiv: 2006.03654, 2020. [61] Wang L, Yang N, Huang X, Jiao B, Yang L, Jiang D, et al. Text embeddings by weakly-supervised contrastive pre-training. arXiv preprint arXiv: 2212.03533, 2022. [62] Yang A, Yang B, Zhang B, Hui B, Zheng B, Yu B, et al. Qwen2. 5 technical report. arXiv preprint arXiv: 2412.15115, 2024. [63] Velickovic P, Fedus W, Hamilton W, Liò P, Bengio Y, Hjelm R. Deep graph infomax. arXiv preprint arXiv: 1809.10341, 2018. [64] Zhu Y, Xu Y, Yu F, Liu Q, Wu S, Wang L. Deep graph contrastive representation learning. arXiv preprint arXiv: 2006.04131, 2020. [65] Thakoor S, Tallec C, Azar M, Munos R, Veličković P, Valko M. Bootstrapped representation learning on graphs. ICLR 2021 Workshop On Geometrical And Topological Representation Learning. 2021. [66] Hou Z, Liu X, Cen Y, Dong Y, Yang H, Wang C, et al. Graphmae: Self-supervised masked graph autoencoders. Proceedings Of The 28th ACM SIGKDD Conference On Knowledge Discovery And Data Mining, 2022594−604 [67] Wen Z, Fang Y. Augmenting low-resource text classification with graph-grounded pre-training and prompting. Proceedings Of The 46th International ACM SIGIR Conference On Research And Development In Information Retrieval, 2023506−516 [68] Maaten L, Hinton G. Visualizing data using t-SNE. Journal Of Machine Learning Research, 2008, 9(Nov): 2579−2605 -
计量
- 文章访问数: 6
- HTML全文浏览量: 2
- 被引次数: 0
下载: