Dual-Attention Multimodal Network for Explainable Lung Nodule Malignancy Assessment Through Integration of CT Imaging and Molecular Phase-Separation Biomarkers

Authors

  • Kenneth A. Richards Department of Computer Science, Binghamton University, Binghamton, NY, USA.
  • Trancis Breene Department of Computer Science, University of Houston, Houston, TX, USA.

Keywords:

lung nodule malignancy, dual-attention network, multimodal fusion, computed tomography, phase separation, explainable AI, clinical decision support, federated learning

Abstract

Lung cancer remains the leading cause of cancer mortality worldwide, with low-dose computed tomography screening generating a large number of indeterminate pulmonary nodules that demand precise risk stratification. While deep learning approaches applied to CT images have substantially improved nodule classification, they often lack the molecular contextualization necessary for personalized decision-making and fail to provide actionable explanations for their predictions. Concurrently, molecular biology has uncovered the relevance of biomolecular condensates formed through liquid-liquid phase separation, such as the YAP-MAML2 transcriptional co-activator complexes, as drivers of aggressive tumor phenotypes. This paper proposes a dual-attention multimodal network architecture that integrates volumetric CT imaging with molecular phase-separation biomarkers to deliver explainable malignancy assessment. The system employs a dual-attention mechanism that combines channel-spatial self-attention within the imaging pathway and cross-modal attention between image embeddings and condensate state features, generating spatial attention maps that highlight nodule regions alongside molecular feature importance scores. We discuss system-level structural trade-offs including data harmonization from heterogeneous clinical sources, federated learning strategies to preserve patient privacy, computational sustainability, robustness to domain shift, and fairness across demographic subgroups. The deployment framework is situated within emerging governance standards and regulatory pathways for artificial intelligence as a medical device, emphasizing explainability as a cornerstone for clinical trust. By bridging radiological phenotyping and molecular biophysics, the proposed architecture addresses critical gaps in multimodal fusion for lung cancer diagnostics and charts a course toward integrated precision oncology platforms.

References

1. Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., & Bray, F. (2021). Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 71(3), 209–249.

2. Shen, W., Zhou, M., Yang, F., Yang, C., & Tian, J. (2017). Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification. Pattern Recognition, 61, 663–673.

3. Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015 (pp. 234–241). Springer.

4. Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., & Lu, H. (2019). Dual attention network for scene segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 3146–3154).

5. Shin, Y., & Brangwynne, C. P. (2017). Liquid phase condensation in cell physiology and disease. Science, 357(6357), eaaf4382.

6. Boeynaems, S., Alberti, S., Fawzi, N. L., Mittag, T., Polymenidou, M., Rousseau, F., ... & Fuxreiter, M. (2018). Protein phase separation: A new phase in cell biology. Trends in Cell Biology, 28(6), 420–435.

7. Chung, C. I., Yang, J., Yang, X., Liu, H., Ma, Z., Szulzewsky, F., ... & Shu, X. (2024). Phase separation of YAP-MAML2 differentially regulates the transcriptome. Proceedings of the National Academy of Sciences, 121(7), e2310430121.

8. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems, 30.

9. Baltrusaitis, T., Ahuja, C., & Morency, L. P. (2019). Multimodal machine learning: A survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(2), 423–443.

10. Chen, R. J., Lu, M. Y., Weng, W. H., Chen, T. Y., Williamson, D. F. K., Manz, T., ... & Mahmood, F. (2022). Multimodal co-attention transformer for survival prediction in gigapixel whole slide images. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 4015–4025).

11. Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-CAM: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 618–626).

12. Tjoa, E., & Guan, C. (2020). A survey on explainable artificial intelligence (XAI): Toward medical XAI. IEEE Transactions on Neural Networks and Learning Systems, 32(11), 4793–4813.

13. Ardila, D., Kiraly, A. P., Bharadwaj, S., Choi, B., Reicher, J. J., Peng, L., ... & Shetty, S. (2019). End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nature Medicine, 25(6), 954–961.

14. Chang, C., Fu, M., Chen, X., Feng, S., Zhang, M., Zhou, X., ... & Liu, Z. (2025, November). Research on PDU-Net Lung Nodule Segmentation Algorithm Based on Path Aggregation and Dual Attention. In 2025 4th International Conference on Image Processing, Computer Vision and Machine Learning (ICICML) (pp. 1897-1900). IEEE.

15. Finlayson, S. G., Bowers, J. D., Ito, J., Zittrain, J. L., Beam, A. L., & Kohane, I. S. (2019). Adversarial attacks on medical AI systems. Science, 363(6433), 1287–1289.

16. Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447–453.

17. Rieke, N., Hancox, J., Li, W., Milletari, F., Roth, H. R., Albarqouni, S., ... & Cardoso, M. J. (2020). The future of digital health with federated learning. NPJ Digital Medicine, 3(1), 119.

18. Cath, C. (2018). Governing artificial intelligence: Ethical, legal and technical opportunities and challenges. Philosophical Transactions of the Royal Society A, 376(2133), 20180080.

19. Strubell, E., Ganesh, A., & McCallum, A. (2019). Energy and policy considerations for deep learning in NLP. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp. 3645–3650).

20. Lyon, A. S., Peeples, W. B., & Rosen, M. K. (2021). A framework for understanding the functions of biomolecular condensates across scales. Nature Reviews Molecular Cell Biology, 22(3), 215–235.

Downloads

Published

2026-06-12

How to Cite

Kenneth A. Richards, & Trancis Breene. (2026). Dual-Attention Multimodal Network for Explainable Lung Nodule Malignancy Assessment Through Integration of CT Imaging and Molecular Phase-Separation Biomarkers. International Journal of Clinical and Translational Medicine, 1(1). Retrieved from https://ijctmed.org/index.php/home/article/view/137