The Science and Information (SAI) Organization
  • Home
  • About Us
  • Journals
  • Conferences
  • Contact Us

Publication Links

  • IJACSA
  • Author Guidelines
  • Publication Policies
  • Outstanding Reviewers

IJACSA

  • About the Journal
  • Call for Papers
  • Editorial Board
  • Author Guidelines
  • Submit your Paper
  • Current Issue
  • Archives
  • Indexing
  • Fees/ APC
  • Reviewers
  • Apply as a Reviewer

IJARAI

  • About the Journal
  • Archives
  • Indexing & Archiving

Special Issues

  • Home
  • Archives
  • Proposals
  • ICONS_BA 2025

Computer Vision Conference (CVC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Computing Conference

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Intelligent Systems Conference (IntelliSys)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Future Technologies Conference (FTC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact
  • Home
  • Call for Papers
  • Editorial Board
  • Guidelines
  • Submit
  • Current Issue
  • Archives
  • Indexing
  • Fees
  • Reviewers
  • RSS Feed

DOI: 10.14569/IJACSA.2026.0170125
PDF

Attention-Enhanced Hierarchical Transformer for Multimodal Integration of Mammograms and Clinical Data

Author 1: N. Kannaiya Raja
Author 2: V S Krushnasamy
Author 3: Nurilla Mahamatov
Author 4: Prasad Devarasetty
Author 5: S.T. Gopukumar
Author 6: Sanjiv Rao Godla
Author 7: Vuda Sreenivasa Rao

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 17 Issue 1, 2026.

  • Abstract and Keywords
  • How to Cite this Article
  • {} BibTeX Source

Abstract: Breast cancer has been listed as one of the leading causes of death amongst women all over the world, and the current diagnostic techniques, which are founded on the manual examination of mammograms or individual clinical presentations, are often subjective, neither being consistent nor generalizable. The existing computer-aided diagnosis (CAD) systems are also characterized by significant weaknesses related to poor multimodal integration, no interpretability, and vulnerability to class imbalance. In order to address the inadequacy, the present study introduces an advanced multimodal deep learning framework named Hybrid Graph-Generative Transformer (HGGT), designed to integrate high-resolution mammographic images with the clinical, demographic, proteomic, and histological data pertinent to the patient. The HGGT network is a hierarchical Swin Transformer and CNN-based feature extraction, a Graph Attention Network (GAT) (to identify clinical variable interaction), and a contrastive cross-modal generative fusion system (to match the different modalities). The diagnostic head employs a Bayesian uncertainty-aware classifier to ensure more reliability in the prediction of malignancy. It is trained on 5-fold cross-validation, AdamW, and a cosine annealing scheduler, which is set on Python 3.10. It is demonstrated by the performance of the CBIS-DDSM mammography dataset and a corresponding clinical dataset consisting of over 400 patients that HGGT is much superior with 98.2% accuracy, 98.7% precision, 98.5% recall, 99.2% F1-score, and 99.1% AUC-ROC, having a significant advantage over the established models of ResNet50, EfficientNet-B0 and GAN-enhanced CNN classifier. Overall, the HGGT framework is delivering a scalable, interpretable, and highly accurate diagnosis solution that was a huge improvement over the existing unimodal and poorly integrated CAD system in the detection of breast cancer.

Keywords: Breast cancer diagnosis; multimodal deep learning; Graph Attention Network; Bayesian uncertainty estimation; explainable AI

N. Kannaiya Raja, V S Krushnasamy, Nurilla Mahamatov, Prasad Devarasetty, S.T. Gopukumar, Sanjiv Rao Godla and Vuda Sreenivasa Rao. “Attention-Enhanced Hierarchical Transformer for Multimodal Integration of Mammograms and Clinical Data”. International Journal of Advanced Computer Science and Applications (IJACSA) 17.1 (2026). http://dx.doi.org/10.14569/IJACSA.2026.0170125

@article{Raja2026,
title = {Attention-Enhanced Hierarchical Transformer for Multimodal Integration of Mammograms and Clinical Data},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2026.0170125},
url = {http://dx.doi.org/10.14569/IJACSA.2026.0170125},
year = {2026},
publisher = {The Science and Information Organization},
volume = {17},
number = {1},
author = {N. Kannaiya Raja and V S Krushnasamy and Nurilla Mahamatov and Prasad Devarasetty and S.T. Gopukumar and Sanjiv Rao Godla and Vuda Sreenivasa Rao}
}



Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

IJACSA

Upcoming Conferences

Computer Vision Conference (CVC) 2026

21-22 May 2026

  • Amsterdam, The Netherlands

Computing Conference 2026

9-10 July 2026

  • London, United Kingdom

Artificial Intelligence Conference 2026

3-4 September 2026

  • Amsterdam, The Netherlands

Future Technologies Conference (FTC) 2026

15-16 October 2026

  • Berlin, Germany
The Science and Information (SAI) Organization
BACK TO TOP

Computer Science Journal

  • About the Journal
  • Call for Papers
  • Submit Paper
  • Indexing

Our Conferences

  • Computer Vision Conference
  • Computing Conference
  • Intelligent Systems Conference
  • Future Technologies Conference

Help & Support

  • Contact Us
  • About Us
  • Terms and Conditions
  • Privacy Policy

The Science and Information (SAI) Organization Limited is a company registered in England and Wales under Company Number 8933205.