Optimal RoPE extension via Bayesian Optimization for training-free length generalization
2025
Transformers are designed to process input of variable length without resource constraints. However, their performance significantly deteriorates when the input surpasses a threshold slightly larger...
Multimodal marvels of deep learning in medical diagnosis using image, speech, and text: A comprehensive review of COVID-19 detection
2025
This study presents a comprehensive review of the potential of multimodal deep learning (DL) in medical diagnosis, using COVID-19 as a case example. Motivated by the success of artificial intelligence...
ChatLLM network: More brains, more intelligence
2025
Dialogue-based language models mark a huge milestone in the field of artificial intelligence, by their impressive ability to interact with users, as well as a series of challenging tasks prompted by...
Solving the enigma: Enhancing faithfulness and comprehensibility in explanations of deep networks
2025
The accelerated progress of artificial intelligence (AI) has popularized deep learning models across various domains, yet their inherent opacity poses challenges, particularly in critical fields like...
Methodologies and their comparison in complex compound aspect-based sentiment analysis: A survey
2025
Sentiment analysis as a part of natural language processing (NLP) has received much attention following the demand to understand people’s opinions. Aspect-based sentiment analysis (ABSA) is a fine-grained...
FedGPA: Federated Learning with Global Personalized Aggregation
2025
A significant challenge in Federated Learning (FL) is addressing the heterogeneity of local data distribution across clients. Personalized Federated Learning (PFL), an emerging method aimed at overcoming...
DFM: Dialogue foundation model for universal large-scale dialogue-oriented task learning
2025
Building a universal conversational agent has been a long-standing goal of the dialogue research community. Most previous works only focus on a small set of dialogue tasks. In this work, we aim to build...
Client: Cross-variable linear integrated enhanced transformer for multivariate long-term time series forecasting
2025
Long-term time series forecasting (LTSF) is crucial in modern society, playing a pivotal role in facilitating long-term planning and developing early warning systems. While many Transformer-based models...
SafeCast: Risk-responsive motion forecasting for autonomous vehicles
2025
Accurate motion forecasting is essential for the safety and reliability of autonomous driving (AD) systems. While existing methods have made significant progress, they often overlook explicit safety...
Multi-spatial semantic information aggregation network for 3D human motion prediction
Available online 3 September 2025
In recent years, GCN-based methods have achieved great success in skeleton-based human motion prediction tasks due to the human body graph structure. However, existing methods leveraged single semantic...
Proactive Recommendation in Social Networks: Steering user interest with causal inference
Available online 2 September 2025
Recommending items that solely cater to users’ historical interests narrows users’ horizons. Recent works have considered steering target users beyond their historical interests by directly adjusting...
Adaptive negative representations for graph contrastive learning
2024
Graph contrastive learning (GCL) has emerged as a promising paradigm for learning graph representations. Recently, the idea of hard negatives is introduced to GCL, which can provide more challenging...
How to generate popular post headlines on social media?
2024
Posts, as important containers of user-generated-content on social media, are of tremendous social influence and commercial value. As an integral component of post, headline has decisive influence on...
PM2.5 forecasting under distribution shift: A graph learning approach
2024
We present a new benchmark task for graph-based machine learning, aiming to predict future air quality (PM2.5 concentration) observed by a geographically distributed network of environmental sensors....
Label-aware debiased causal reasoning for Natural Language Inference
2024
Recently, researchers have argued that the impressive performance of Natural Language Inference (NLI) models is highly due to the spurious correlations existing in training data, which makes models...
Boosting graph search with attention network for solving the general orienteering problem
2024
Recently, several studies explore to use neural networks(NNs) to solve different routing problems, which is an auspicious direction. These studies usually design an encoder–decoder based framework that...
An ecosystem for personal knowledge graphs: A survey and research roadmap
2024
This paper presents an ecosystem for personal knowledge graphs (PKGs), commonly defined as resources of structured information about entities related to an individual, their attributes, and the relations...
Few-shot Named Entity Recognition via encoder and class intervention
2024
In the real world, the large and complex nature of text increases the difficulty of tagging and results in a limited amount of tagged text. Few-shot Named Entity Recognition(NER) only uses a small amount...
CPT: Colorful Prompt Tuning for pre-trained vision-language models
2024
Vision-Language Pre-training (VLP) models have shown promising capabilities in grounding natural language in image data, facilitating a broad range of cross-modal tasks. However, we note that there...
Enhancing neural network classification using fractional-order activation functions
2024
In this paper, a series of novel activation functions is presented, which is derived using the improved Riemann–Liouville conformable fractional derivative (RLCFD). This study investigates the use of...
GPT understands, too
2024
Prompting a pretrained language model with natural language patterns has been proved effective for natural language understanding (NLU). However, our preliminary study reveals that manual discrete prompts...
Improving trajectory classification through Kramers–Moyal coefficients
2024
Trajectory classification focuses on predicting the class or category of a moving object based on its observed movement over time. The classification of trajectory data using classical approaches can...
MindLLM: Lightweight large language model pre-training, evaluation and domain application
2024
Large Language Models (LLMs) have demonstrated remarkable performance across various natural language tasks, marking significant strides towards general artificial intelligence. While general artificial...
Authorship style transfer with inverse transfer data augmentation
2024
Authorship style transfer aims to modify the style of neutral text to match the unique speaking or writing style of a particular individual. While Large Language Models (LLMs) present promising solutions,...
Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature
2024
Biomedical knowledge is typically organized in a relational scheme, such as chemical-disease relation, gene-disease relation, and gene-pathway relation. Biomedical scientists heavily rely on search...