Ongoing and recently accepted works
Multi-modal Graph Contrastive Learning with HyperGNN
Dynamic Graph Contrastive Learning
Tackling Oversmoothing in GNN
Transformer Model for Hypergraph
Triplet attention Networks
Reinforcement Learning-based Hypergraph Model for Hyperedge Prediction Task (predicting member nodes)
***A summary of the ongoing projects will be posted soon***
Topic: Expanding the Horizons of Hypergraph Neural Networks: Advanced Methodologiesย
๐HyperGCL: Multi-Modal Graph Contrastive Learning via Learnable Hypergraph Views --IJCNN'25 (Rank A)
โก๏ธ Recent advancements in Graph Contrastive Learning (GCL) have demonstrated remarkable effectiveness in improving graph representations. However, relying on predefined augmentations (e.g., node dropping, edge perturbation, and attribute masking) may result in the loss of task-relevant information and a lack of adaptability to diverse input data. Furthermore, the selection of negative samples remains rarely explored. In this paper, we introduce HyperGCL, a novel multimodal GCL framework from a hypergraph perspective. HyperGCL constructs three distinct hypergraph views by jointly utilizing the input graph's structure and attributes, enabling a comprehensive integration of multiple modalities in contrastive learning. A learnable adaptive topology augmentation technique enhances these views by preserving important relations and filtering out noise. View-specific encoders capture essential characteristics from each view, while a network-aware contrastive loss leverages the underlying topology to define positive and negative samples effectively. Extensive experiments on benchmark datasets demonstrate that HyperGCL achieves state-of-the-art node classification performance.ย
๐Topology-guided Hypergraph Transformer Network: Unveiling Structural Insights for Improved Representations (Stanford Graph Learning Workshop'24)
โก๏ธInspired by the success of using transformers in NLP and CV, researchers have explored their potentiality in Graph Learning, leading to the development of advanced models like Graph Transformer, Graphormer, Gophormer, Graph-Bert, etc.
๐กIncorporating graph topology information is crucial in adapting transformers for graph data. Without this, the application of transformers becomes akin to processing IID data, which fails to capture the essence of graph structures and becomes less meaningful.
๐ Our research has identified a notable gap in this area, especially regarding ๐๐ฒ๐ฉ๐๐ซ๐ ๐ซ๐๐ฉ๐ก ๐๐ซ๐๐ง๐ฌ๐๐จ๐ซ๐ฆ๐๐ซ๐ฌ. While working on this, we found only a few papers worked on the modeling transformers for hypergraphs (hypergraphs transformers), and even then, they often overlooked the vital aspect of hypergraph topology information. Moreover, they are highly limited for Heterogeneous hypergraphs, resulting in a lack of generalization.
โก To address these challenges, we present a novel approach: a ๐ป๐๐๐๐๐๐๐-๐ฎ๐๐๐ ๐๐ ๐ฏ๐๐๐๐๐๐๐๐๐ ๐ป๐๐๐๐๐๐๐๐๐๐ ๐ต๐๐๐๐๐๐ (๐ป๐ฏ๐ป๐ต) designed explicitly for hypergraph learning. This model is a significant stride in graph analysis๐ฅ, incorporating different innovative ๐ฌ๐ญ๐ซ๐ฎ๐๐ญ๐ฎ๐ซ๐-๐ฉ๐จ๐ฌ๐ข๐ญ๐ข๐จ๐ง ๐๐ง๐๐จ๐๐๐ซ๐ฌ. These encoders are instrumental in embedding both structural and spatial information of hypernodes, ensuring the model's sensitivity to the unique characteristics of hypergraphs.
โก Additionally, we have developed different unique ๐ฅ๐จ๐๐๐ฅ ๐๐ง๐ ๐ ๐ฅ๐จ๐๐๐ฅ ๐ฌ๐ญ๐ซ๐ฎ๐๐ญ๐ฎ๐ซ๐ ๐ฅ๐๐๐ซ๐ง๐ข๐ง๐ ๐ฆ๐จ๐๐ฎ๐ฅ๐๐ฌ working as topological inductive bias. These are integrated within the self-attention networks of our model, playing a crucial role in discerning the structural importance of hypernodes for a hyperedge and hyperedges for hypernodes in a hypergraph.
๐ Through these advancements, our research aims to provide a more robust and nuanced approach to hypergraph learning, leveraging the complexities of hypergraph structures to enhance the capabilities of transformer models for hypergraphs.
Topic: Tackling Oversmoothing in GNN
โก๏ธGraph Neural Network (GNN) achieves great success for node-level and graph-level tasks via encoding meaningful topological structures of networks in various domains, ranging from social to biological networks. However, repeated aggregation operations lead to excessive mixing of node representations, particularly in dense regions with multiple GNN layers, resulting in nearly indistinguishable embeddings. This phenomenon leads to the oversmoothing problem that hampers downstream graph analytics tasks. To overcome this issue, we propose a novel and flexible truss-based graph sparsification model that prunes edges from dense regions of the graph. Pruning redundant edges in dense regions helps to prevent the aggregation of excessive neighborhood information during hierarchical message passing and pooling in GNN models. We then utilize our sparsification model in the state-of-the-art baseline GNNs and pooling models, such as GIN, SAGPool, GMT, DiffPool, MinCutPool, HGP-SL, DMonPool, and AdamGNN. Extensive experiments on different real-world datasets show that our model significantly improves the performance of the baseline GNN models in the graph classification task.ย
Topic: Dynamic Graph Learning
๐DyGCL: Dynamic Graph Contrastive Learning For Event Prediction-- IEEE BigData'24ย
โก๏ธPredicting events, ranging from political unrest to disease outbreaks and criminal activities, stands as a pivotal task in proactively addressing emerging challenges. Despite the richness of textual data as a source for event detection, it is challenging to extract contextual information from documents due to their complex structure and the dynamic evolution of events. In response to this challenge, dynamic Graph Neural Networks (GNNs) have emerged as a promising tool for capturing the intricate patterns embedded within textual data graphs. Nevertheless, many models in this domain primarily rely on local node-level representations, overlooking the essential global graph-level context. However, both node-level and graph-level representations are critical for effective event prediction. Node-level representations provide insight into the local structure, while graph-level representations offer an understanding of the global structure and the evaluation of temporal graphs. To address these challenges, in this paper, we propose a Dynamic Graph Contrastive Learning (DyGCL) method for event prediction. Our model DyGCL first employs a local view encoder to effectively capture the local dynamic structure of input graphs as the evolving node representations. Then, it performs a global view encoder to perceive the hierarchical dynamic graph representation of the input graphs. Finally, the graph representations from both encoders, optimized via contrastive learning, are combined with an attention mechanism and utilized to predict future events. Our extensive experiments demonstrate that our proposed method outperforms the state-of-the-art methods for event prediction on six real-world datasets.ย
Topic: Revolutionizing Drug Discovery with Novel Graph/Hypergraph Learning Approaches
โก๏ธHyGNN is an attention-based encoder-decoder architecture designed to predict drug-drug interactions (DDIs). HyGNN relies on the hypothesis that similar drugs behave similarly, are likely to interact with the same drugs, and two drugs are similar if they have similar substructures as functional groups in their SMILES strings. To properly depict the structural-based similarity between drugs, we present them in a novel hypergraph setting (Drug-Hypergraph), representing drugs as hyperedges connecting many substructures as nodes.
After constructing the hypergraph, we develop a Hypergraph Neural Network (HyGNN), a model that learns the DDIs by generating and using the representation of hyperedges as drugs. HyGNN proposes ๐ ๐ง๐จ๐ฏ๐๐ฅ ๐ก๐ฒ๐ฉ๐๐ซ๐ ๐ซ๐๐ฉ๐ก ๐๐๐ ๐ ๐๐ง๐๐จ๐๐๐ซ ๐๏ธ consisting of two layers of attention (node and edge level) mechanism which can precisely ๐๐ข๐ฌ๐๐จ๐ฏ๐๐ซ ๐ข๐ฆ๐ฉ๐จ๐ซ๐ญ๐๐ง๐ญ ๐ฌ๐ฎ๐๐ฌ๐ญ๐ซ๐ฎ๐๐ญ๐ฎ๐ซ๐๐ฌย ๐ from a chemical compound. Finally, a decoder is exploited to predict DDIs.
Extensive experiments are performed on the proposed methods and some baselines as well. Results show that the proposed HyGNN can accurately predict DDIs, and its performance significantly outperforms all the baselines ๐ฅ, including two notable pioneer research works, CASTER and Decagon, even for ๐๐ซ๐๐ง๐ ๐ง๐๐ฐ ๐๐ซ๐ฎ๐ ๐ฌ.ย
๐HeTAN: Heterogeneous Graph Triplet Attention Network for Drug Repurposing-- DSAA'24 (Rank A)
โก๏ธModeling the interactions between drugs, targets, and diseases has significant implications for drug discovery, precision medicine, and personalized treatments. Current computational approaches consider pairwise interaction, including drug-target or drug-disease interaction individually. On the other hand, within human metabolic systems, the interaction of drugs with protein targets in cells influences target activities. Moving beyond binary relationships and exploring tighter relationships together as triple is essential to understanding drugsโ mechanism of action (MoAs). Moreover, considering the heterogeneity of drugs, targets, and diseases, along with their distinct characteristics, it is critical to model these complex interactions appropriately. To address these challenges, we develop a novel Heterogeneous Graph Triplet Attention Network (HeTAN) by modeling the interconnectedness of all entities in a heterogeneous graph. HeTAN introduces a novel triplet message passing and triplet-wise attention mechanism within this heterogeneous graph structure. In contrast to focusing only on pairwise attention as the importance of an entity for the other, we define triplet attention to model the importance of pairs for the other in the drug-target-disease triplet prediction problem. We perform extensive experiments on real-world datasets and our results show that HeTAN outperforms several baselines, demonstrating its superior performance in uncovering novel drug-target-disease relationships.ย
๐DDI Prediction via Heterogeneous Graph Attention Networkย ย ย ย ย ย ย ACM-BCB'23 and BioKDD @ KDD'22
โก๏ธDrug-drug interaction (DDI) is the activity that occurs when the impact of one drug changes when combined with another. DDIs may obstruct, increase, or decrease the intended effect of either drug or, in the worst-case scenario, create adverse side effects. In this paper, we present a novel heterogeneous graph attention model, HAN-DDI, to predict drug-drug interactions. We create a heterogeneous network of drugs with different biological entities. Then, we develop a heterogeneous graph attention network to learn DDIs using the relations of drugs with other entities. It consists of an attention-based heterogeneous graph node encoder for obtaining drug node representations and a decoder for predicting drug-drug interactions. Further, we utilize comprehensive experiments to evaluate our model and to compare it with state-of-the-art models. Experimental results show that our proposed method, HAN-DDI, outperforms the baselines significantly and accurately predicts DDIs, even for new drugs.
๐Drug-Drug Interaction Prediction: a Purely SMILES Based Approachย ย ย ย ย ย IEEE BigData'21ย
โก๏ธIn this paper, we propose a novel method for predicting DDIs based on the vital chemical substructure of drugs extracted from their SMILES strings. We construct a graph that connects drugs based on their common functional chemical substructures. Furthermore, we apply different well-known graph neural network (GNN) methods to generate drug embeddings. Drug embeddings of individual drugs are concatenated to generate features of drug pairs. Finally, drug pair features are fed to different machine learning (ML) classifiers for DDI prediction. We evaluate our model on the DrugBank dataset. Our result shows promising results, and our model outperforms a baseline model based on different DDI representation creation methods.
Topic: ย Advancing Sequence Data Analysis through Innovative Graph/Hypergraph Learning Models
๐Seq-HyGAN: Sequence Classification via Hypergraph Attention Network ย ACM CIKM'23 (Rank - A/A*) & MLG @ KDD'23ย
โก๏ธExtracting meaningful features from sequences and devising effective similarity measures are vital for sequence data mining tasks, particularly sequence classification. While Neural Network models are commonly used to learn features of sequence automatically, they are limited to capturing adjacent structural connection information and ignore global, higher-order information between the sequences.ย
To address these challenges, we propose a novel Hypergraph Attention Network model, namely Seq-HyGAN, for sequence classification problems. To capture the complex structural similarity between sequence data, we create a novel hypergraph model by defining higher-order relations between subsequences (nodes) extracted from sequences (hyperedges). Subsequently, we introduce a Sequence Hypergraph Attention Network that learns sequence features by considering the significance of subsequences and sequences to one another. It consists of three levels of aggregation with attention that captures different levels of context. At the first level, it generates node embedding that incorporates global context by aggregating hyperedge embeddings. At the second level, the model refines node embeddings for each hyperedge. It captures local context by aggregating neighboring node embeddings in the same hyperedge and considering the subsequence position in a sequence. Finally, at the third level, it generates sequence embedding by aggregating node embeddings from both global (level 1) and local (level 2) perspectives, resulting in a comprehensive representation of the sequencesย
Through extensive experiments, we demonstrate the effectiveness of our proposed Seq-HyGAN model in accurately classifying sequence data, outperforming several state-of-the-art methods by a significant margin.ย
๐Drug Abuse Detection in Twitter-sphere: Graph-Based Approachย ย ย ย ย ย ย ย ย IEEE BigData'21ย
โก๏ธThe rate of non-medical use of opioid drugs has increased markedly since the early 2000s. Many studies have been done to detect Drug Abuse (DA) events from social media data using machine learning and deep learning concepts. Moreover, Graph Neural Networks (GNNs) have recently become popular in text classification tasks due to their high accuracy and capability to handle complex structures. In this work, we collect drugs-related Twitter data (tweets) and build text graphs (corpus-level and document-level) to capture word-word, document-word, and document-document relations. Then we apply different GNN models on those text graphs and thus turn the text classification task into a node classification (for corpus-level graph) and graph classification (for document-level graph) task to detect DA events. Finally, we compare our graph-based DA detection models with different types of baseline models, including rule-based, traditional machine learning, and deep learning models. Our result shows that graph-based models outperform traditional machine learning and deep learning-based models.
Topic: Biomedical knowledge mining from Social Networks
โก๏ธOpioid Use Disorder (OUD) is one of the most severe healthcare problems in the USA. People addicted to opioids need various treatments, including Medication-Assisted Treatment (MAT), proper counseling, and behavioral therapies. However, during the peak time of the COVID-19 pandemic, the supply of emergency medications was seriously disrupted. Patients faced severe medical care scarcity since many pharmaceutical companies, drugstores, and local pharmacies were closed. Import-export was also canceled to consent to the government emergency law, i.e., lockdown, quarantine, and isolation. These circumstances and their negative effects on OUD patientsโ psychology could have led them to drop out of MAT medications and be persuaded to resume illicit opioid use. This project involves collecting and analyzing a large volume of Twitter data related to MAT medications for OUD patients. We discover the Active MAT Medicine Users (AMMUs) on Twitter. For this, we build a seed dictionary of words related to OUD and MAT and apply association rules to expand it. Further, AMMUsโ tweet posts are studied โbefore the pandemicโ (BP) and โduring the pandemicโ (DP) to understand how drug behaviors and habits have changed due to COVID-19. We also perform sentiment analysis on Tweets to determine the impact of the COVID-19 pandemic on the psychology of AMMUs. Our analysis shows that the use of MAT medications has decreased by around 30.54%, where the use of illicit drugs and other prescription opioids increased by 18.06% and 12.12%, respectively, based on AMMUsโ tweets posted during the lockdown compared with before the lockdown statistics. The COVID-19 pandemic and lockdown may result in the resumption of illegal and prescription opioid abuse by OUD patients. Necessary steps and precautions should be taken by health care providers to ensure the emergency supply of medicines and also psychological support and thus prevent patients from illicit opioid use.