Utilizing image super-resolution to overcome information bottlenecks in vision transformers

Jamal  Zraqou; Riyad  Alrousan; Bilal  Sowan; Jawad  Alkhatib

doi:10.53894/ijirss.v8i3.7383

Social Sciences

Jamal Zraqou, Riyad Alrousan, Bilal Sowan, Jawad Alkhatib

https://doi.org/10.53894/ijirss.v8i3.7383

Issue
Vol. 8 No. 3 (2025)

Keywords:

CNN, Feature map intensity, Information bottlenecks, Super-resolution, Swin transformer.

PDF

Abstract

This research tends to solve the information bottleneck challenge in vision transformer-based solutions for image super-resolution, where the intensity of the feature map reduces in deeper network layers, thus affecting model performance. LITRL, the Layer-Interconnected Transformer with Residual Links, provides stability to the information flow by means of the dense residual connections between the layers, with the aim of preventing spatial information loss. The methodology involves the integration of the Swin transformer architecture and new schemes of interconnections to maintain vital spatial features in the whole network. Experimental results show that the LITRL-based method gives better results on traditional benchmark datasets (Set5, Set14, BSD100, Urban100, Manga109), in terms of quantitative (PSNR, SSIM) and qualitative evaluation. At 4×, LITRL obtains PSNR/SSIM of 40.37/0.9628 on Set 5 and 35.70/0.9408 on Urban100 with far higher performance than comparable methods. The proposed LITRL model dramatically reduces the information bottleneck of transformer-based super-resolution. It retains fundamental spatial information due to the dense-residual connections, giving rise to sharper images with more natural textures and fewer artefacts. Practical Implications: The excellent performance of LITRL in generating complex textures and structures that, in turn, enables accurate reconstruction, makes the method particularly useful for the tasks where the retention of a high level of fidelity of the image enhancement is imperative, i.e., for medical imaging, analysis of satellite images, and developing digital content while requiring a reasonable computational efficiency.

Authors

Jamal Zraqou

Department of Computer Science, Faculty of Information Technology, University of Petra, Amman, Jordan.

https://orcid.org/0000-0001-9060-7188

Jamal.Zraqou@uop.edu.jo (Primary Contact)

Riyad Alrousan

Department of Design & Visual Communication, School of SABE), German Jordanian University (GJU), Amman, Jordan.

https://orcid.org/0000-0001-6383-1117

Bilal Sowan

Department of Business Intelligence & Data Analytics, Faculty of Administrative & Financial Sciences, UOP, Amman, Jordan.

https://orcid.org/0000-0002-1933-4196

Jawad Alkhatib

Department of Computer Engineering, Prince Mohamad Bin Fahd University, Dhahran, Saudi Arabia.

https://orcid.org/0000-0001-7611-7887

Zraqou, J. ., Alrousan, R. ., Sowan, B. ., & Alkhatib, J. . (2025). Utilizing image super-resolution to overcome information bottlenecks in vision transformers. International Journal of Innovative Research and Scientific Studies, 8(3), 3734–3749. https://doi.org/10.53894/ijirss.v8i3.7383

Download Citation

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

	All	Since 2020
Citations	2836	2676
h-index	22	21
i10-index	71	71

Article Sidebar

Abstract

Authors

Article Details

Cited byView all

Cited by